Doc Scraper MCP Server

A Model Context Protocol (MCP) server that provides documentation scraping functionality. This server converts web-based documentation into markdown format using jina.ai's conversion service.

Features

Scrapes documentation from any web URL
Converts HTML documentation to markdown format
Saves the converted documentation to a specified output path
Integrates with the Model Context Protocol (MCP)

Installation

Installing via Smithery

To install Doc Scraper for Claude Desktop automatically via Smithery:

npx -y @smithery/cli install @askjohngeorge/mcp-doc-scraper --client claude

Clone the repository:

git clone https://github.com/askjohngeorge/mcp-doc-scraper.git
cd mcp-doc-scraper

Create and activate a virtual environment:

python -m venv venv
source venv/bin/activate  # On Windows, use: venv\Scripts\activate

Install the dependencies:

pip install -e .

Usage

The server can be run using Python:

python -m mcp_doc_scraper

Tool Description

The server provides a single tool:

Name: scrape_docs
Description: Scrape documentation from a URL and save as markdown
Input Parameters:
- url: The URL of the documentation to scrape
- output_path: The path where the markdown file should be saved

Project Structure

doc_scraper/
├── __init__.py
├── __main__.py
└── server.py

Dependencies

aiohttp
mcp
pydantic

Development

To set up the development environment:

Install development dependencies:

pip install -r requirements.txt

The server uses the Model Context Protocol. Make sure to familiarize yourself with MCP documentation.

License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
mcp_doc_scraper		mcp_doc_scraper
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
smithery.yaml		smithery.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Doc Scraper MCP Server

Features

Installation

Installing via Smithery

Usage

Tool Description

Project Structure

Dependencies

Development

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

askjohngeorge/mcp-doc-scraper

Folders and files

Latest commit

History

Repository files navigation

Doc Scraper MCP Server

Features

Installation

Installing via Smithery

Usage

Tool Description

Project Structure

Dependencies

Development

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages