Skip to content

teoobarca/perplexity-mcp

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

35 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation


Logo
Perplexity MCP Server



The only Perplexity MCP server with multi-account pooling, an admin dashboard, and zero-cost monitoring.
No API keys. No per-query fees. Uses your existing Perplexity Pro session.


MIT LicenseΒ  Python 3.10+Β  MCP CompatibleΒ  Token PoolΒ  Admin Panel

Features Β· Quick Start Β· Admin Panel Β· Configuration Β· Architecture



🎯 Why This One?

Most Perplexity MCP servers are single-account wrappers around the paid Sonar API. This one is different:

  • πŸ†“ No API costs β€” uses session cookies, not the paid API. Same features, zero per-query fees
  • 🏊 Multi-account pool β€” round-robin across N accounts with automatic failover
  • πŸ“Š Admin dashboard β€” React UI to monitor quotas, manage tokens, tail logs in real-time
  • ❀️ Zero-cost health checks β€” monitors all accounts via rate-limit API without consuming queries
  • πŸ›‘οΈ Downgrade protection β€” detects when Perplexity silently returns a regular result instead of deep research
  • πŸ“± Telegram alerts β€” get notified when tokens expire or quota runs out

✨ Features

πŸ” Smart Search

  • Pro Search β€” fast, accurate answers with citations
  • Reasoning β€” multi-model thinking for complex decisions
  • Deep Research β€” comprehensive 10-30+ citation reports
  • Multi-source β€” web, scholar, and social

πŸ€– 9 Models Available

  • sonar Β· gpt-5.2 Β· claude-4.5-sonnet Β· grok-4.1
  • gpt-5.2-thinking Β· claude-4.5-sonnet-thinking
  • gemini-3.0-pro Β· kimi-k2-thinking Β· grok-4.1-reasoning

🏊 Token Pool Engine

  • Round-robin rotation across accounts
  • Exponential backoff on failures (60s β†’ 120s β†’ ... β†’ 1h cap)
  • 3-level fallback β€” Pro β†’ auto (exhausted) β†’ anonymous
  • Smart quota tracking β€” decrements locally, verifies at zero
  • Hot-reload β€” add/remove tokens without restart

πŸ›‘οΈ Production Hardened

  • Silent deep research downgrade detection
  • Atomic config saves (no corruption on crash)
  • Connection drop handling
  • Cross-process state sharing via pool_state.json
  • 53 unit tests

πŸ–ΌοΈ Screenshots

Token Pool Dashboard

Token Pool Dashboard

Stats grid, monitor controls, sortable token table with per-account quotas (Pro / Research / Agentic), filter pills, and one-click actions.



Log Viewer

Log Viewer

Live log streaming with auto-refresh, level filtering, search highlighting, follow mode, and line numbers.


πŸš€ Quick Start

1. Clone & Install

git clone https://github.com/teoobarca/perplexity-mcp.git
cd perplexity-mcp
uv sync

2. Add to Your AI Tool

🟣 Claude Code
claude mcp add perplexity -s user -- uv --directory /path/to/perplexity-mcp run perplexity-mcp
🟒 Cursor

Go to Settings β†’ MCP β†’ Add new server and paste:

{
  "command": "uv",
  "args": ["--directory", "/path/to/perplexity-mcp", "run", "perplexity-mcp"]
}
πŸ”΅ Windsurf / VS Code / Other MCP clients

Add to your MCP config file:

{
  "mcpServers": {
    "perplexity": {
      "command": "uv",
      "args": ["--directory", "/path/to/perplexity-mcp", "run", "perplexity-mcp"]
    }
  }
}

That's it. Works immediately with anonymous sessions. Add your tokens for Pro access β€” see Authentication.


πŸ› οΈ Tools

Two MCP tools with LLM-optimized descriptions so your AI assistant picks the right one automatically:

perplexity_ask

AI-powered answer engine for tech questions, documentation lookups, and how-to guides.

Parameter Type Default Description
query string required Natural language question with context
model string null Model selection (see models)
sources array ["web"] Sources: web, scholar, social
language string en-US ISO 639 language code

Mode auto-detection: Models with thinking or reasoning in the name automatically switch to Reasoning mode.

"gpt-5.2"          β†’ Pro Search
"gpt-5.2-thinking"  β†’ Reasoning Mode  ← auto-detected

perplexity_research

Deep research agent for comprehensive analysis. Returns extensive reports with 10-30+ citations.

Parameter Type Default Description
query string required Detailed research question with full context
sources array ["web", "scholar"] Sources: web, scholar, social
language string en-US ISO 639 language code

Tip

Deep research takes 2-5 minutes per query. Provide detailed context and constraints for better results. The server has a 15-minute timeout to accommodate this.


πŸ–₯️ Admin Panel

A built-in web dashboard for managing your token pool. Start it with:

perplexity-server

Opens automatically at http://localhost:8123/admin/

Feature Description
πŸ“Š Stats Grid Total clients, Online/Exhausted counts, Monitor status
πŸ“‹ Token Table Sortable columns, filter pills (Online/Exhausted/Offline/Unknown), icon actions
πŸ’° Quota Column Per-token breakdown β€” Pro remaining, Research quota, Agentic research
❀️ Health Monitor Zero-cost checks via rate-limit API, configurable interval
πŸ“± Telegram Alerts Notifications on token state changes (expired, exhausted, back online)
πŸ”„ Fallback Toggle Enable/disable automatic Pro β†’ free fallback
πŸ“₯ Import/Export Bulk token management via JSON config files
πŸ“ Log Viewer Live streaming, level filter (Error/Warning/Info/Debug), search, follow mode
πŸ§ͺ Test Button Run health check on individual tokens or all at once

πŸ” Authentication

By default, the server uses anonymous Perplexity sessions (rate limited). For Pro access, add your session tokens.

How to Get Tokens

  1. Sign in at perplexity.ai
  2. Open DevTools (F12) β†’ Application β†’ Cookies
  3. Copy these two cookies:
    • next-auth.csrf-token
    • next-auth.session-token

Single Token

Create token_pool_config.json in the project root:

{
  "tokens": [
    {
      "id": "my-account",
      "csrf_token": "your-csrf-token-here",
      "session_token": "your-session-token-here"
    }
  ]
}

Multi-Token Pool

Add multiple accounts for round-robin rotation with automatic failover:

{
  "monitor": {
    "enable": true,
    "interval": 6,
    "tg_bot_token": "optional-telegram-bot-token",
    "tg_chat_id": "optional-chat-id"
  },
  "fallback": {
    "fallback_to_auto": true
  },
  "tokens": [
    { "id": "account-1", "csrf_token": "...", "session_token": "..." },
    { "id": "account-2", "csrf_token": "...", "session_token": "..." },
    { "id": "account-3", "csrf_token": "...", "session_token": "..." }
  ]
}

Note

Session tokens last ~30 days. The monitor detects expired tokens and alerts you via Telegram.


βš™οΈ Configuration

Environment Variables

Variable Default Description
PERPLEXITY_TIMEOUT 900 Request timeout in seconds (15 min for deep research)
SOCKS_PROXY β€” SOCKS5 proxy URL (socks5://host:port)

Token States

Token state is computed automatically from session_valid + rate_limits (never set manually):

State Meaning Badge Behavior
🟒 normal Session valid, pro quota available Online Used for all requests
🟑 exhausted Session valid, pro quota = 0 Exhausted Skipped for Pro, used as auto fallback
πŸ”΄ offline Session invalid/expired Offline Not used for any requests
πŸ”΅ unknown Not yet checked Unknown Used normally (quota assumed available)

Fallback Chain

When a Pro request fails, the server tries progressively:

1. βœ… Next client with Pro quota (round-robin)
2. βœ… Next client with Pro quota ...
3. 🟑 Any available client (auto mode)
4. πŸ”΅ Anonymous session (auto mode)
5. ❌ Error returned to caller

πŸ—οΈ Architecture

β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚  Your AI Assistant (Claude Code / Cursor / Windsurf)    β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
                       β”‚ MCP (stdio)
                       β–Ό
β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚  perplexity-mcp                                          β”‚
β”‚  β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”  β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”  β”‚
β”‚  β”‚  tools.py       β”‚  β”‚  server.py                     β”‚  β”‚
β”‚  β”‚  β€’ ask          │──│  β€’ Pool state sync             β”‚  β”‚
β”‚  β”‚  β€’ research     β”‚  β”‚  β€’ Timeout handling            β”‚  β”‚
β”‚  β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜  β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜  β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
                       β”‚
                       β–Ό
β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚  Backend Engine (perplexity/)                            β”‚
β”‚                                                          β”‚
β”‚  β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”  β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”  β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”  β”‚
β”‚  β”‚  client.py   β”‚  β”‚  client_pool β”‚  β”‚  admin.py      β”‚  β”‚
β”‚  β”‚  β€’ Search    β”‚  β”‚  β€’ Rotation  β”‚  β”‚  β€’ REST API    β”‚  β”‚
β”‚  β”‚  β€’ Upload    β”‚  β”‚  β€’ Backoff   β”‚  β”‚  β€’ Static      β”‚  β”‚
β”‚  β”‚  β€’ Validate  β”‚  β”‚  β€’ Monitor   β”‚  β”‚    files       β”‚  β”‚
β”‚  β””β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”˜  β”‚  β€’ Fallback  β”‚  β””β”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”˜  β”‚
β”‚         β”‚          β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜           β”‚          β”‚
β”‚         β–Ό                                     β–Ό          β”‚
β”‚  β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”                    β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”   β”‚
β”‚  β”‚ Perplexity  β”‚                    β”‚ React Admin UI β”‚   β”‚
β”‚  β”‚ (web API)   β”‚                    β”‚ :8123/admin/   β”‚   β”‚
β”‚  β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜                    β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜   β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
Component File Role
MCP Server src/server.py Stdio transport, pool state sync, timeout handling
Tool Definitions src/tools.py 2 MCP tools with LLM-optimized descriptions
API Client perplexity/client.py Perplexity API via curl_cffi (bypasses Cloudflare)
Client Pool perplexity/server/client_pool.py Round-robin, backoff, monitor, state persistence
Query Engine perplexity/server/app.py Rotation loop, 3-level fallback, validation
Admin API perplexity/server/admin.py REST endpoints + static file serving
Admin UI perplexity/server/web/ React + Vite + Tailwind dashboard

πŸ§ͺ Development

# Install in development mode
uv pip install -e ".[dev]" --python .venv/bin/python

# Run unit tests (53 tests)
.venv/bin/python -m pytest tests/ -v

# Frontend development
cd perplexity/server/web
npm install
npm run dev      # Dev server with proxy to :8123
npm run build    # Production build

Project Structure

src/                          # MCP stdio server (thin wrapper)
  server.py                   #   Entry point, pool state sync
  tools.py                    #   Tool definitions

perplexity/                   # Backend engine
  client.py                   #   Perplexity API client (curl_cffi)
  config.py                   #   Constants, endpoints, model mappings
  exceptions.py               #   Custom exception hierarchy
  logger.py                   #   Centralized logging
  server/
    app.py                    #   Starlette app, query engine
    client_pool.py            #   ClientPool, rotation, monitor
    admin.py                  #   Admin REST API
    utils.py                  #   Validation helpers
    main.py                   #   HTTP server entry point
    web/                      #   React admin frontend (Vite + Tailwind)

tests/                        # 53 unit tests

⚠️ Limitations

  • Unofficial β€” uses Perplexity's web interface, may break if they change it
  • Cookie-based auth β€” session tokens expire after ~30 days
  • Rate limits β€” anonymous sessions have strict query limits
  • Deep research β€” takes 2-5 minutes per query (this is normal)

πŸ“„ License

MIT

About

Free Perplexity AI MCP server with multi-account pooling, React admin dashboard, zero-cost health monitoring, and Telegram alerts. No API keys needed.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors