MCP QEMU VM Control

Give your AI full computer access — safely. Let Claude (or any MCP-compatible LLM) see your screen, move the mouse, type on the keyboard, and run commands — all inside an isolated QEMU virtual machine. Perfect for AI-driven automation, testing, and computer-use experiments without risking your host system.

GitHub

MCP QEMU VM Control

Give your AI full computer access — safely.

Let Claude (or any MCP-compatible LLM) see your screen, move the mouse, type on the keyboard, and run commands — all inside an isolated QEMU virtual machine. Perfect for AI-driven automation, testing, and computer-use experiments without risking your host system.

A Model Context Protocol (MCP) server for controlling QEMU virtual machines via SSH. This server enables LLMs to interact with VMs through mouse/keyboard control, screenshots, and SSH command execution.

Features
Prerequisites
QEMU/libvirt Setup
Installation
Configuration
Usage
Tools Reference
Typical Workflow
Best Practices for LLM Automation
Architecture
Troubleshooting

Features

Mouse Control - Move cursor and click buttons
Keyboard Input - Type text and send key combinations
Action Batching - Execute sequences of UI actions in one call
Screenshots - Capture and retrieve VM screenshots
SSH Command Execution - Run shell commands on the VM
File Transfer - Upload and download files via SFTP
Project Management - Organize outputs into project folders with logs, results, and advice
Advice System - Save and retrieve tips for future LLM sessions

Prerequisites

Host System

Python 3.12+
uv (recommended) or pip
QEMU/KVM with libvirt
virt-manager (optional, for GUI management)

VM Requirements

Linux with X11 desktop environment
SSH server enabled
Required packages: openssh, xdotool, scrot, xrandr, xinput

QEMU/libvirt Setup

1. Install virtualization packages

Arch/Manjaro:

sudo pacman -S qemu-full libvirt virt-manager dnsmasq iptables-nft

Debian/Ubuntu:

sudo apt install qemu-kvm libvirt-daemon-system libvirt-clients virt-manager bridge-utils

Fedora:

sudo dnf install @virtualization

2. Configure libvirt

# Enable and start libvirtd
sudo systemctl enable --now libvirtd

# Add your user to libvirt group
sudo usermod -aG libvirt $USER

# Log out and back in, then verify
groups  # should show 'libvirt'

3. Set up the default network

libvirt provides a default NAT network (192.168.122.0/24) that VMs use to communicate with the host:

# Check network status
virsh -c qemu:///system net-list --all

# If 'default' is not active, start it
virsh -c qemu:///system net-start default

# Enable autostart
virsh -c qemu:///system net-autostart default

The default network configuration:

Bridge: virbr0
Host IP: 192.168.122.1
DHCP range: 192.168.122.2 - 192.168.122.254
Mode: NAT (VMs can access internet, host can access VMs)

4. Create a VM with virt-manager

Launch virt-manager
Create a new VM (File → New Virtual Machine)
Select installation media (ISO)
Allocate resources:
- Memory: 4096 MB recommended
- CPUs: 2+ recommended
Important: Under "Network selection", choose "Virtual network 'default': NAT"
Complete installation

5. Configure the VM

After installing the guest OS:

# Inside the VM - Install required packages

# Arch/Manjaro
sudo pacman -S --needed openssh xdotool scrot xorg-xrandr xorg-xinput

# Debian/Ubuntu
sudo apt install openssh-server xdotool scrot x11-xserver-utils xinput

# Enable SSH
sudo systemctl enable --now sshd

6. Create the automation user

On the VM:

# Create vmrobot user
sudo useradd -m -s /bin/bash vmrobot
sudo passwd vmrobot

# Set up SSH key authentication
sudo -u vmrobot mkdir -p /home/vmrobot/.ssh
sudo -u vmrobot chmod 700 /home/vmrobot/.ssh

On the host:

# Copy your public key to the VM
ssh-copy-id vmrobot@192.168.122.XX

# Or manually add to /home/vmrobot/.ssh/authorized_keys on VM

7. Grant X11 access to vmrobot

The vmrobot user needs permission to access the X display. On the VM, as the user who owns the desktop session:

# Quick fix (run once per session)
xhost +local:vmrobot

# Permanent fix - add to ~/.xprofile or ~/.xinitrc
echo "xhost +local:" >> ~/.xprofile

8. Find your VM's IP address

# From the host
virsh -c qemu:///system domifaddr manjaro

# Or from inside the VM
ip addr show | grep "inet 192.168.122"

9. Test the connection

# Test SSH
ssh vmrobot@192.168.122.XX

# Test X11 automation
ssh vmrobot@192.168.122.XX 'DISPLAY=:0 xdotool getmouselocation'

# Test screenshot
ssh vmrobot@192.168.122.XX 'DISPLAY=:0 scrot /tmp/test.png && echo Success'

Installation

1. Clone the repository

git clone https://github.com/Neanderthal/mcp-qemu-vm.git
cd mcp-qemu-vm

2. Install dependencies

Using uv (recommended):

uv venv && source .venv/bin/activate
uv pip install -r requirements.txt

Using pip:

python -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt

Configuration

Set environment variables or create a .env file:

Variable	Default	Description
`VM_HOST`	`192.168.122.79`	VM IP address
`VM_USER`	`vmrobot`	SSH username
`VM_PORT`	`22`	SSH port
`VM_DISPLAY`	`:0`	X11 display
`VM_IDENTITY`	(empty)	SSH private key path (optional)

Example .env file:

VM_HOST=192.168.122.79
VM_USER=vmrobot
VM_PORT=22
VM_DISPLAY=:0

Usage

MCP Client Configuration

Add to your MCP client config (e.g., Claude Desktop claude_desktop_config.json):

{
  "qemu-vm-control": {
    "command": "python3",
    "args": ["/path/to/mcp-qemu-vm/server.py"],
    "env": {
      "VM_HOST": "192.168.122.79",
      "VM_USER": "vmrobot",
      "VM_PORT": "22",
      "VM_DISPLAY": ":0"
    }
  }
}

Config file locations:

macOS: ~/Library/Application Support/Claude/claude_desktop_config.json
Windows: %APPDATA%/Claude/claude_desktop_config.json
Linux: ~/.config/Claude/claude_desktop_config.json

Development with MCP Inspector

uv run mcp dev server.py

# With custom environment
VM_HOST=192.168.122.79 VM_USER=vmrobot uv run mcp dev server.py

Running Standalone

python server.py

Tools Reference

Project Management

Projects organize all outputs (screenshots, logs, results, advice) into timestamped folders under data/projects/.

Tool	Description
`project_init(name, description)`	Create a new project (required before screenshots)
`project_load(project_path)`	Load an existing project
`project_list()`	List all projects
`project_info()`	Get current project statistics
`project_log(message, level)`	Add a log entry
`project_read_logs(lines, level_filter)`	Read project logs
`project_save_result(filename, content)`	Save a result file
`project_save_advice(title, content)`	Save tips for future sessions
`project_read_advice()`	Read all saved advice

Mouse & Keyboard

Tool	Description
`move_mouse(x, y, mode)`	Move cursor (mode: "absolute" or "relative")
`click(button, count)`	Click mouse button (left/middle/right)
`type_text(text)`	Type text
`press_keys(keys)`	Press key combo, e.g., `["Ctrl", "L"]`
`wait(seconds)`	Pause execution
`run_actions(actions)`	Execute a sequence of actions in one call

Batch Actions Example

[
  {"action": "press_keys", "keys": ["Ctrl", "Shift", "p"]},
  {"action": "wait", "seconds": 0.5},
  {"action": "type_text", "text": "Terminal: Focus Terminal"},
  {"action": "press_keys", "keys": ["Return"]}
]

SSH Operations

Tool	Description
`ssh_execute(command)`	Run a shell command on the VM
`ssh_upload(local_path, remote_path)`	Upload file to VM
`ssh_download(remote_path, local_path)`	Download file from VM
`ssh_connection_info()`	Get connection status

Screenshots

Tool	Description
`take_screenshot()`	Capture screenshot (requires active project)

Screenshots are saved to the project's screenshots/ folder and exposed as MCP resources at vm://screenshot/{id}.

Typical Workflow

1. project_init("my-task", "Description")
2. take_screenshot()
3. ... perform VM operations ...
4. project_read_logs()
5. project_save_result("output.txt", data)
6. project_save_advice("Title", "Lessons learned...")

For continuing work:

1. project_list()
2. project_load("data/projects/...")  # Shows any saved advice
3. ... continue work ...

Best Practices for LLM Automation

These lessons were learned from real-world usage and help avoid common pitfalls.

1. Always Screenshot Before Actions

Before ANY interaction:

take_screenshot()
Analyze the image
Identify current focus (which window/field is active)
Only then proceed with actions

Never skip screenshots to "save time" - blind actions lead to errors.

2. Don't Trust Mouse Clicks for Focus

Clicking on a window/terminal does NOT reliably switch focus, especially in:

Nested environments (Citrix, remote desktop)
High-latency connections
Applications with multiple panels (VS Code, IDEs)

Use keyboard shortcuts instead:

[
  {"action": "press_keys", "keys": ["Ctrl", "Shift", "p"]},
  {"action": "wait", "seconds": 0.5},
  {"action": "type_text", "text": "Terminal: Focus Terminal"},
  {"action": "wait", "seconds": 0.3},
  {"action": "press_keys", "keys": ["Return"]},
  {"action": "wait", "seconds": 0.5}
]

Then take_screenshot() to verify before typing.

3. Required Wait Times

After This Action	Wait Time
Opening Command Palette	0.5s
Typing search text	0.3s
Pressing Enter/Return	0.5-1.0s
Command execution	1.0-2.0s
Window/focus switch	0.5s

Never rapid-fire actions - they may arrive out of order.

4. Use Batch Actions

Use run_actions() instead of separate tool calls to reduce latency and ensure ordering:

# Instead of 5 separate calls:
run_actions([
    {"action": "press_keys", "keys": ["Ctrl", "Shift", "p"]},
    {"action": "wait", "seconds": 0.5},
    {"action": "type_text", "text": "command"},
    {"action": "wait", "seconds": 0.3},
    {"action": "press_keys", "keys": ["Return"]}
])

5. SSH Scope Limitation

ssh_execute only reaches the first VM layer. For nested environments (VM → Citrix → Windows), use UI automation to type commands in the visible terminal.

6. Recovery Commands

Problem	Solution
Typed in wrong window (few chars)	`Escape` → `u` (undo in Vim)
Multiple lines in wrong place	`Escape` → `uuuuuuu`
File corrupted	`Escape` → `:e!` → `Enter` (reload)
VS Code revert	`Ctrl+Shift+P` → "Revert File"

7. Common Mistakes to Avoid

Typing immediately after clicking terminal (focus may not have switched)
Skipping screenshots to "save time"
Using ssh_execute for nested environment commands
Not waiting between actions
Assuming focus switched without verification

Architecture

┌─────────────┐         SSH          ┌──────────────┐
│             │ ◄──────────────────► │              │
│  MCP Server │                      │   QEMU VM    │
│   (Host)    │                      │   (Linux)    │
│             │                      │              │
└──────┬──────┘                      └──────────────┘
       │                                    │
       │ MCP Protocol                       │
       │ (stdio)                            │
       │                                    │
       ▼                                    ▼
┌─────────────┐                      xdotool, scrot
│  LLM Client │                      X11 automation
│  (Claude)   │
└─────────────┘

Network topology:

┌────────────────────────────────────────────────────┐
│  Host (192.168.122.1)                              │
│  ┌──────────┐                                      │
│  │ virbr0   │◄── NAT bridge                        │
│  └────┬─────┘                                      │
│       │                                            │
│  ┌────┴─────┐                                      │
│  │ QEMU VM  │ 192.168.122.79                       │
│  │ (manjaro)│                                      │
│  └──────────┘                                      │
└────────────────────────────────────────────────────┘

Project Structure

mcp-qemu-vm/
├── server.py           # Main MCP server (single file)
├── requirements.txt    # Python dependencies
├── data/
│   └── projects/       # Project folders
│       └── YYYYMMDD-HHMMSS_name/
│           ├── screenshots/
│           ├── logs/
│           ├── results/
│           └── advice/
└── README.md

Troubleshooting

Cannot connect to VM

Check VM is running:
```
virsh -c qemu:///system list
```

Check network is active:

virsh -c qemu:///system net-list
# If default is inactive:
virsh -c qemu:///system net-start default

Check VM has IP:

virsh -c qemu:///system domifaddr <vm-name>

Test SSH connectivity:
```
ssh vmrobot@192.168.122.XX
```

Mouse/keyboard not working

Verify xdotool is installed on VM: which xdotool
Check X11 display: echo $DISPLAY (should be :0)
Test manually: DISPLAY=:0 xdotool getmouselocation

Screenshots failing / X11 Authorization Error

If you see Authorization required, but no authorization protocol specified:

Quick fix (run as X session owner on VM):

xhost +local:vmrobot

Permanent fix - Add to ~/.xprofile:

xhost +local:

Verify access:

# Check current xhost settings
DISPLAY=:0 xhost

# Should show:
# access control enabled, only authorized clients can connect
# LOCAL:

VM network issues

# Restart the default network
virsh -c qemu:///system net-destroy default
virsh -c qemu:///system net-start default

# Check virbr0 bridge exists
ip addr show virbr0

License

MIT

Related Servers

Scout Monitoring MCP

sponsor

Put performance and error data directly in the hands of your AI assistant.

Alpha Vantage MCP Server

sponsor

Access financial market data: realtime & historical stock, ETF, options, forex, crypto, commodities, fundamentals, technical indicators, & more

DeepWiki by Devin

Remote, no-auth MCP server providing AI-powered codebase context and answers

Translator AI

Translate JSON i18n files using Google Gemini or local Ollama models, with incremental caching support.

Squads MCP

A secure MCP implementation for Squads multisig management on the Solana blockchain.

Plugged.in

A comprehensive proxy that combines multiple MCP servers into a single MCP. It provides discovery and management of tools, prompts, resources, and templates across servers, plus a playground for debugging when building MCP servers.

LAML MCP Server

Loads and validates LAML (YAML-based markup language) documents via the Model Context Protocol.

Kapture

A Chrome DevTools Extension that enables browser automation through the Model Context Protocol (MCP) for AI applications.

Qase MCP Server

An MCP server for interacting with the Qase test management platform.

Apktool

A server for Android APK analysis and reverse engineering using Apktool.

Odoo XML-RPC MCP Server

Interact with Odoo instances using the XML-RPC API. Requires configuration via environment variables or config files.

CC Token Saver

Use a local LLM for smaller or specialized tasks within Claude to save tokens.

MCP QEMU VM Control

MCP QEMU VM Control

Table of Contents

Features

Prerequisites

Host System

VM Requirements

QEMU/libvirt Setup

1. Install virtualization packages

2. Configure libvirt

3. Set up the default network

4. Create a VM with virt-manager

5. Configure the VM

6. Create the automation user

7. Grant X11 access to vmrobot

8. Find your VM's IP address

9. Test the connection

Installation

1. Clone the repository

2. Install dependencies

Configuration

Usage

MCP Client Configuration

Development with MCP Inspector

Running Standalone

Tools Reference

Project Management

Mouse & Keyboard

Batch Actions Example

SSH Operations

Screenshots

Typical Workflow

Best Practices for LLM Automation

1. Always Screenshot Before Actions

2. Don't Trust Mouse Clicks for Focus

3. Required Wait Times

4. Use Batch Actions

5. SSH Scope Limitation

6. Recovery Commands

7. Common Mistakes to Avoid

Architecture

Project Structure

Troubleshooting

Cannot connect to VM

Mouse/keyboard not working

Screenshots failing / X11 Authorization Error

VM network issues

License

Related

Related Servers

Scout Monitoring MCP

Alpha Vantage MCP Server

DeepWiki by Devin

Translator AI

Squads MCP

Plugged.in

LAML MCP Server

Kapture

Qase MCP Server

Apktool

Odoo XML-RPC MCP Server

CC Token Saver