An AI agent that automates desktop tasks by simulating user interactions with mouse and keyboard inputs.
An open-source framework automating desktop workflows using large multimodal models.