ScreenPilot

Mtehabsim/ScreenPilot

🔗 Latest commit:349180d

🕒 Updated:Sep 9, 2025, 01:06 PM

Python

Browser Automation

MCP server to let LLM take full control on your device by providing screen automation toolkit for controlling and interacting with graphical user interface

MCP Trust Score

Based on our comprehensive evaluation criteria

🤖 Evaluated by gemini-2.5-flashFix

Trust Score24/100

• Basic MCP protocol features implemented (12/40)
• Room for improvement in GitHub community
• Moderate dependency usage (10/20)
• Room for improvement in deployment maturity
• Documentation (8/8)
• Archestra MCP Trust badge (2/2)

GitHub Metrics

Repository statistics and activity

⭐ GitHub Stars:38

👥 Contributors:3

📋 Total Issues:2

📦 Has Releases:No

🔧 Has CI/CD Pipeline:No

Configuration

Configuration example extracted from README.md for Claude Desktop and other clients.

🤖 Evaluated by gemini-2.5-flashFix

{
  "device-controll": {
    "command": "pathToEnv\\venv\\Scripts\\python.exe",
    "args": [
      "pathToProject\\ScreenPilot\\main.py"
    ],
    "env": {}
  }
}

MCP Protocol Support

Implemented MCP protocol features

🤖 Evaluated by gemini-2.5-flashFix

Tools:✓

Prompts:✗

Resources:✗

Sampling:✗

Roots:✗

Logging:✗

STDIO Transport:✓

HTTP Transport:✗

OAuth2 Auth:✗

Dependencies

39 dependencies

Libraries and frameworks used by this MCP server

🤖 Evaluated by gemini-2.5-flashFix

Main

mcp

starlette

PyAutoGUI

uvicorn

MouseInfo

PyGetWindow

PyMsgBox

pyperclip

PyRect

PyScreeze

pytweening

Medium

pydantic

pydantic-settings

pydantic_core

sse-starlette

httpx-sse

pillow

httpx

requests

typer

click

anyio

sniffio

h11

httpcore

Light

python-dotenv

rich

markdown-it-py

mdurl

Pygments

colorama

shellingham

annotated-types

typing-inspection

typing_extensions

certifi

charset-normalizer

idna

urllib3

README.md

ScreenPilot

MCP server to let LLM take full control on your device by providing screen automation toolkit for controlling and interacting with graphical user interfaces. Good for automation, education and having fun.

Main Features

📷 Screen capture and analysis
🖱️ Mouse control (clicking, positioning)
⌨️ Keyboard input (typing, key presses, hotkeys)

watch demo

https://github.com/user-attachments/assets/c18380c0-b3dd-4b7c-925d-28ef205ca11f

Installation

Install python 3.12

Clone the repository:

git clone https://github.com/Mtehabsim/ScreenPilot.git

create virtiual environment


python -m venv venv

activate the env

venv\Scripts\activate

Install the required packages:
```
pip install -r requirements.txt
```
Open Claude AI desktop
file -> settings -> developer -> edit config
open config file and paste this

{
    "mcpServers": {
        "device-controll": {
            "command": "pathToEnv\\venv\\Scripts\\python.exe",
            "args": [
                "pathToProject\\ScreenPilot\\main.py"
            ]
        }
    }
}

Replace
"pathToEnv\venv\Scripts\python.exe" → with the full path to your python.exe
"pathToProject\ScreenPilot\main.py" → with the full path to your main.py file
Save the config file.
Open Claude AI Desktop.
Go to File → Exit
You can now open Claude AI Desktop and enjoy ScreenPilot.

Available Tools

Screen Capture: Take screenshots and get screen information
Mouse Control: Move the mouse and perform clicks
Keyboard Actions: Type text, press keys, and use hotkey combinations
Scrolling: Scroll in different directions and to specific positions
Element Detection: Check if elements exist on screen and wait for them to appear
Action Sequences: Perform multiple actions in sequence

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Resources

GitHub Repository

Add Quality Badge

Show your MCP trust score in your README

[![Trust Score](https://archestra.ai/mcp-catalog/api/badge/quality/Mtehabsim/ScreenPilot)](https://archestra.ai/mcp-catalog/mtehabsim__screenpilot)

Edit This Server Add New MCP Server Report an Issue

README.md

ScreenPilot

Main Features

📷 Screen capture and analysis
🖱️ Mouse control (clicking, positioning)
⌨️ Keyboard input (typing, key presses, hotkeys)

watch demo

https://github.com/user-attachments/assets/c18380c0-b3dd-4b7c-925d-28ef205ca11f

Installation

Install python 3.12

Clone the repository:

git clone https://github.com/Mtehabsim/ScreenPilot.git

create virtiual environment


python -m venv venv

activate the env

venv\Scripts\activate

Install the required packages:
```
pip install -r requirements.txt
```
Open Claude AI desktop
file -> settings -> developer -> edit config
open config file and paste this

{
    "mcpServers": {
        "device-controll": {
            "command": "pathToEnv\\venv\\Scripts\\python.exe",
            "args": [
                "pathToProject\\ScreenPilot\\main.py"
            ]
        }
    }
}

Replace
"pathToEnv\venv\Scripts\python.exe" → with the full path to your python.exe
"pathToProject\ScreenPilot\main.py" → with the full path to your main.py file
Save the config file.
Open Claude AI Desktop.
Go to File → Exit
You can now open Claude AI Desktop and enjoy ScreenPilot.

Available Tools

Screen Capture: Take screenshots and get screen information
Mouse Control: Move the mouse and perform clicks
Keyboard Actions: Type text, press keys, and use hotkey combinations
Scrolling: Scroll in different directions and to specific positions
Element Detection: Check if elements exist on screen and wait for them to appear
Action Sequences: Perform multiple actions in sequence

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.