mcp-local-rag

nkapila6/mcp-local-rag

🔗 Latest commit:2e40b33

🕒 Updated:Sep 9, 2025, 01:06 PM

Python

AI Tools

"primitive" RAG-like web search model context protocol (MCP) server that runs locally. ✨ no APIs ✨

MCP Trust Score

Based on our comprehensive evaluation criteria

🤖 Evaluated by gemini-2.5-flashFix

Trust Score57/100

• Core MCP protocol features implemented (20/40)
• Limited GitHub community activity (8/20)
• Optimal dependency management (20/20)
• Moderate deployment maturity (5/10)
• Documentation (8/8)
• Archestra MCP Trust score badge is missing

GitHub Metrics

Repository statistics and activity

⭐ GitHub Stars:66

👥 Contributors:4

📋 Total Issues:1

📦 Has Releases:No

🔧 Has CI/CD Pipeline:Yes

Configuration

Configuration example extracted from README.md for Claude Desktop and other clients.

🤖 Evaluated by gemini-2.5-flashFix

{
  "mcp-local-rag": {
    "command": "uvx",
    "args": [
      "--python=3.10",
      "--from",
      "git+https://github.com/nkapila6/mcp-local-rag",
      "mcp-local-rag"
    ],
    "env": {}
  },
  "ghcr.io-nkapila6-mcp-local-rag-docker": {
    "command": "docker",
    "args": [
      "run",
      "--rm",
      "-i",
      "--init",
      "-e",
      "DOCKER_CONTAINER=true",
      "ghcr.io/nkapila6/mcp-local-rag:latest"
    ],
    "env": {
      "DOCKER_CONTAINER": "true"
    },
    "docker_image": "ghcr.io/nkapila6/mcp-local-rag:latest"
  }
}

MCP Protocol Support

Implemented MCP protocol features

🤖 Evaluated by gemini-2.5-flashFix

Tools:✓

Prompts:✗

Resources:✓

Sampling:✗

Roots:✗

Logging:✗

STDIO Transport:✓

HTTP Transport:✗

OAuth2 Auth:✗

Dependencies

6 dependencies

Libraries and frameworks used by this MCP server

🤖 Evaluated by gemini-2.5-flashFix

Main

fastmcp

beautifulsoup4

ddgs

mediapipe

aiohttp

requests

README.md

mcp-local-rag

"primitive" RAG-like web search model context protocol (MCP) server that runs locally. ✨ no APIs ✨

%%{init: {'theme': 'base'}}%%
flowchart TD
    A[User] -->|1.Submits LLM Query| B[Language Model]
    B -->|2.Sends Query| C[mcp-local-rag Tool]
    
    subgraph mcp-local-rag Processing
    C -->|Search DuckDuckGo| D[Fetch 10 search results]
    D -->|Fetch Embeddings| E[Embeddings from Google's MediaPipe Text Embedder]
    E -->|Compute Similarity| F[Rank Entries Against Query]
    F -->|Select top k results| G[Context Extraction from URL]
    end
    
    G -->|Returns Markdown from HTML content| B
    B -->|3.Generated response with context| H[Final LLM Output]
    H -->|5.Present result to user| A

    classDef default stroke:#333,stroke-width:2px;
    classDef process stroke:#333,stroke-width:2px;
    classDef input stroke:#333,stroke-width:2px;
    classDef output stroke:#333,stroke-width:2px;

    class A input;
    class B,C process;
    class G output;

Installation

Locate your MCP config path here or check your MCP client settings.

Run Directly via `uvx`

This is the easiest and quickest method. You need to install uv for this to work.

Add this to your MCP server configuration:

{
  "mcpServers": {
    "mcp-local-rag":{
      "command": "uvx",
        "args": [
          "--python=3.10",
          "--from",
          "git+https://github.com/nkapila6/mcp-local-rag",
          "mcp-local-rag"
        ]
      }
  }
}

Using Docker (recommended)

Ensure you have Docker installed.

Add this to your MCP server configuration:

{
  "mcpServers": {
    "mcp-local-rag": {
      "command": "docker",
      "args": [
        "run",
        "--rm",
        "-i",
        "--init",
        "-e",
        "DOCKER_CONTAINER=true",
        "ghcr.io/nkapila6/mcp-local-rag:latest"
      ]
    }
  }
}

Security audits

MseeP does security audits on every MCP server, you can see the security audit of this MCP server by clicking here.

MCP Clients

The MCP server should work with any MCP client that supports tool calling. Has been tested on the below clients.

Claude Desktop
Cursor
Goose
Others? You try!

Examples on Claude Desktop

When an LLM (like Claude) is asked a question requiring recent web information, it will trigger mcp-local-rag.

When asked to fetch/lookup/search the web, the model prompts you to use MCP server for the chat.

In the example, have asked it about Google's latest Gemma models released yesterday. This is new info that Claude is not aware about.

Result

mcp-local-rag performs a live web search, extracts context, and sends it back to the model—giving it fresh knowledge:

Contributing

Have ideas or want to improve this project? Issues and pull requests are welcome!

License

This project is licensed under the MIT License.

Resources

GitHub Repository

Add Quality Badge

Show your MCP trust score in your README

[![Trust Score](https://archestra.ai/mcp-catalog/api/badge/quality/nkapila6/mcp-local-rag)](https://archestra.ai/mcp-catalog/nkapila6__mcp-local-rag)

Edit This Server Add New MCP Server Report an Issue

README.md

mcp-local-rag

"primitive" RAG-like web search model context protocol (MCP) server that runs locally. ✨ no APIs ✨

%%{init: {'theme': 'base'}}%%
flowchart TD
    A[User] -->|1.Submits LLM Query| B[Language Model]
    B -->|2.Sends Query| C[mcp-local-rag Tool]
    
    subgraph mcp-local-rag Processing
    C -->|Search DuckDuckGo| D[Fetch 10 search results]
    D -->|Fetch Embeddings| E[Embeddings from Google's MediaPipe Text Embedder]
    E -->|Compute Similarity| F[Rank Entries Against Query]
    F -->|Select top k results| G[Context Extraction from URL]
    end
    
    G -->|Returns Markdown from HTML content| B
    B -->|3.Generated response with context| H[Final LLM Output]
    H -->|5.Present result to user| A

    classDef default stroke:#333,stroke-width:2px;
    classDef process stroke:#333,stroke-width:2px;
    classDef input stroke:#333,stroke-width:2px;
    classDef output stroke:#333,stroke-width:2px;

    class A input;
    class B,C process;
    class G output;

Installation

Locate your MCP config path here or check your MCP client settings.

Run Directly via `uvx`

This is the easiest and quickest method. You need to install uv for this to work.

Add this to your MCP server configuration:

{
  "mcpServers": {
    "mcp-local-rag":{
      "command": "uvx",
        "args": [
          "--python=3.10",
          "--from",
          "git+https://github.com/nkapila6/mcp-local-rag",
          "mcp-local-rag"
        ]
      }
  }
}

Using Docker (recommended)

Ensure you have Docker installed.

Add this to your MCP server configuration:

{
  "mcpServers": {
    "mcp-local-rag": {
      "command": "docker",
      "args": [
        "run",
        "--rm",
        "-i",
        "--init",
        "-e",
        "DOCKER_CONTAINER=true",
        "ghcr.io/nkapila6/mcp-local-rag:latest"
      ]
    }
  }
}

Security audits

MseeP does security audits on every MCP server, you can see the security audit of this MCP server by clicking here.

MCP Clients

The MCP server should work with any MCP client that supports tool calling. Has been tested on the below clients.

Claude Desktop
Cursor
Goose
Others? You try!

Examples on Claude Desktop

When an LLM (like Claude) is asked a question requiring recent web information, it will trigger mcp-local-rag.

When asked to fetch/lookup/search the web, the model prompts you to use MCP server for the chat.

In the example, have asked it about Google's latest Gemma models released yesterday. This is new info that Claude is not aware about.

Result

mcp-local-rag performs a live web search, extracts context, and sends it back to the model—giving it fresh knowledge:

Contributing

Have ideas or want to improve this project? Issues and pull requests are welcome!

License

This project is licensed under the MIT License.

Related MCP Servers

mcp-shrimp-task-manager

88/100

Shrimp Task Manager is a task tool built for AI Agents, emphasizing chain-of-thought, reflection, and style consistency. It converts natural language into structured dev tasks with dependency tracking and iterative refinement, enabling agent-like developer behavior in reasoning AI systems.

cognee-mcp

86/100

Memory for AI Agents in 5 lines of code

shadcn-ui-mcp-server

82/100

A mcp server to allow LLMS gain context about shadcn ui component structure,usage and installation,compaitable with react,svelte 5,and vue

mobile-mcp

81/100

Model Context Protocol Server for Mobile Automation and Scraping (iOS, Android, Emulators, Simulators and Real Devices)

HuggingFace

80/100

Remote MCP server for HuggingFace services - access models, datasets, and spaces

markitdown-mcp

80/100

Python tool for converting files and office documents to Markdown.

mcp-local-rag

Main

mcp-local-rag

Installation

Run Directly via uvx

Using Docker (recommended)

Security audits

MCP Clients

Examples on Claude Desktop

Result

Contributing

License

mcp-local-rag

Installation

Run Directly via uvx

Using Docker (recommended)

Security audits

MCP Clients

Examples on Claude Desktop

Result

Contributing

License

Related MCP Servers

mcp-shrimp-task-manager

cognee-mcp

shadcn-ui-mcp-server

mobile-mcp

HuggingFace

markitdown-mcp

Run Directly via `uvx`

Run Directly via `uvx`