Automate desktop actions like mouse control, keyboard input, and taking screenshots.
A Model Context Protocol server that provides desktop automation capabilities using RobotJS and screenshot capabilities. This server enables LLMs to control mouse movements, keyboard inputs, and capture screenshots of the desktop environment.
Here's how to configure Claude Desktop to use the MCP Desktop Automation server:
{
"mcpServers": {
"desktop-automation": {
"command": "npx",
"args": ["-y", "mcp-desktop-automation"]
}
}
}
This server requires system-level permissions to:
When first running Claude Desktop with this server, you may need to grant these permissions in your operating system's security settings.
While this server works with various MCP clients, it has been primarily tested with Claude Desktop.
Important: The current implementation has a 1MB response size limit. For screen captures, this means:
get_screen_size
screen_capture
keyboard_press
key
(string, required): Key to press (e.g., 'enter', 'a', 'control')modifiers
(array of strings, optional): Modifier keys to hold while pressing the key. Possible values: "control", "shift", "alt", "command"keyboard_type
text
(string, required): Text to typemouse_click
button
(string, optional, default: "left"): Mouse button to click. Possible values: "left", "right", "middle"double
(boolean, optional, default: false): Whether to perform a double clickmouse_move
x
(number, required): X coordinatey
(number, required): Y coordinateThe server provides access to screenshots:
Screenshot List (screenshot://list
)
Screenshot Content (screenshot://{id}
)
This MCP server is licensed under the MIT License. This means you are free to use, modify, and distribute the software, subject to the terms and conditions of the MIT License. For more details, please see the LICENSE file in the project repository.
A tool for dynamic and reflective problem-solving using a structured thinking process.
Interact with Trello boards and cards via the Trello API, with built-in rate limiting and error handling.
Interact with the accounting data in your business using our official MCP server
Manages and randomly selects from a list of lunch restaurants, storing choices and visit statistics locally.
An AI-driven server for browser automation using natural language commands, implementing the Model Context Protocol (MCP).
Trello integration for working with boards, lists in boards and cards in lists.
Provides real-time currency conversion and exchange rate data using the Frankfurter API.
Integrates with the Productboard API, offering 49 specialized tools to manage all major Productboard functionalities.
A server that facilitates structured, progressive thinking through defined stages.
A simple note-taking server for recording and managing notes with AI models, using AWS DynamoDB for storage.