Automate desktop actions like mouse control, keyboard input, and taking screenshots.
A Model Context Protocol server that provides desktop automation capabilities using RobotJS and screenshot capabilities. This server enables LLMs to control mouse movements, keyboard inputs, and capture screenshots of the desktop environment.
Here's how to configure Claude Desktop to use the MCP Desktop Automation server:
{
"mcpServers": {
"desktop-automation": {
"command": "npx",
"args": ["-y", "mcp-desktop-automation"]
}
}
}
This server requires system-level permissions to:
When first running Claude Desktop with this server, you may need to grant these permissions in your operating system's security settings.
While this server works with various MCP clients, it has been primarily tested with Claude Desktop.
Important: The current implementation has a 1MB response size limit. For screen captures, this means:
get_screen_size
screen_capture
keyboard_press
key
(string, required): Key to press (e.g., 'enter', 'a', 'control')modifiers
(array of strings, optional): Modifier keys to hold while pressing the key. Possible values: "control", "shift", "alt", "command"keyboard_type
text
(string, required): Text to typemouse_click
button
(string, optional, default: "left"): Mouse button to click. Possible values: "left", "right", "middle"double
(boolean, optional, default: false): Whether to perform a double clickmouse_move
x
(number, required): X coordinatey
(number, required): Y coordinateThe server provides access to screenshots:
Screenshot List (screenshot://list
)
Screenshot Content (screenshot://{id}
)
This MCP server is licensed under the MIT License. This means you are free to use, modify, and distribute the software, subject to the terms and conditions of the MIT License. For more details, please see the LICENSE file in the project repository.
Interact with Ramp's Developer API to run analysis on your spend and gain insights leveraging LLMs
CalDAV Nectcloud calendar integration. Manage calendars, events, attendees, etc.
Interact with the HubSpot CRM API to manage contacts, companies, and deals.
Provides workflow validation and best practices tools for the n8n automation platform.
A command-line tool to fetch Google Calendar schedules and convert them into custom prompts or text snippets using a template engine.
Simple unofficial MCP server to track time via Toggl API
Product‑discovery and strategy platform integration. Create, query and update opportunities, solutions, outcomes, requirements and feedback from any MCP‑aware LLM.
Interact with Quire.io projects and tasks using the Quire API, enabling AI assistants to manage your workflow.
Connect your Limitless Pendant data to Claude and other LLMs using the Limitless API.
An MCP server for managing tasks with the command-line tool TaskWarrior.