GeminiProxy 🚀

A powerful Python wrapper and REST API for Google's Gemini CLI, designed to seamlessly manage free tier limitations with intelligent rate limiting, caching, and enterprise-ready features.

✨ Features

🔒 Smart Rate Limiting - Automatic tracking and management of the 1000 requests/hour limit
💾 Response Caching - Reduce redundant API calls with intelligent TTL-based caching
🔄 Retry Logic - Automatic retries with exponential backoff for failed requests
⚡ Async Processing - Queue requests for non-blocking background processing
🌐 REST API - Full-featured HTTP API with Flask and CORS support
📊 Usage Analytics - Comprehensive tracking and historical statistics
🎯 Batch Processing - Efficiently process multiple prompts with progress tracking
🔧 CLI Tool - Feature-rich command-line interface for all operations
🐳 Docker Support - Ready-to-deploy containerized solution
📝 Extensive Logging - Detailed logging for debugging and monitoring

🚀 Quick Start

Installation

# Clone the repository
git clone https://github.com/falkensmz/GeminiProxy.git
cd GeminiProxy

# Install with pip
pip install -e .

# Or install from PyPI (when published)
pip install geminiproxy

Basic Usage

Command Line

# Send a simple prompt
geminiproxy "Explain quantum computing in simple terms"

# Check usage statistics
geminiproxy --stats

# Process multiple prompts
geminiproxy --batch prompts.txt --output results.json

# Start the REST API server
geminiproxy --server --port 5000

Python API

from geminiproxy import GeminiClient

# Initialize client
client = GeminiClient()

# Send a prompt
response = client.prompt("Write a haiku about Python")
if response["success"]:
    print(response["output"])

# Check usage
stats = client.get_usage()
print(f"Remaining requests: {stats['remaining_this_hour']}")

# Batch processing
prompts = ["What is AI?", "Explain ML", "Define NLP"]
results = client.batch_prompts(prompts)

REST API

# Start the server
geminiproxy --server

# Send a prompt
curl -X POST http://localhost:5000/prompt \
  -H "Content-Type: application/json" \
  -d '{"prompt": "Hello, Gemini!"}'

# Check usage
curl http://localhost:5000/usage

# Async request
curl -X POST http://localhost:5000/prompt/async \
  -H "Content-Type: application/json" \
  -d '{"prompt": "Complex analysis task"}'

📚 Documentation

API Endpoints

Endpoint	Method	Description
`/`	GET	API documentation
`/health`	GET	Health check with usage stats
`/usage`	GET	Detailed usage statistics
`/prompt`	POST	Send prompt (synchronous)
`/prompt/async`	POST	Queue prompt (asynchronous)
`/job/<id>`	GET	Check async job status
`/batch`	POST	Process multiple prompts
`/stream`	POST	Stream responses (SSE)
`/jobs`	GET	List all jobs
`/cache/clear`	POST	Clear response cache
`/stats/history`	GET	Historical usage data

Configuration

from geminiproxy import GeminiClient

client = GeminiClient(
    auto_approve=True,        # Auto-approve tool calls
    checkpointing=True,       # Enable checkpointing
    max_retries=3,           # Retry attempts
    rate_limit_per_hour=950, # Conservative limit
    cache_ttl=3600,          # Cache TTL in seconds
    timeout=300              # Command timeout
)

🐳 Docker Deployment

Using Docker

# Build the image
docker build -t geminiproxy .

# Run the container
docker run -p 5000:5000 geminiproxy

# With environment variables
docker run -p 5000:5000 \
  -e RATE_LIMIT=900 \
  -e AUTO_APPROVE=true \
  geminiproxy

Using Docker Compose

# Start services
docker-compose up -d

# View logs
docker-compose logs -f

# Stop services
docker-compose down

🏗️ Architecture

GeminiProxy/
├── geminiproxy/
│   ├── __init__.py       # Package initialization
│   ├── client.py         # Core client implementation
│   ├── server.py         # REST API server
│   ├── database.py       # SQLite rate limiting
│   ├── exceptions.py     # Custom exceptions
│   └── cli.py           # CLI interface
├── tests/               # Test suite
├── docs/               # Documentation
├── examples/           # Usage examples
└── docker/            # Docker configuration

🔧 Advanced Features

Rate Limiting

The system tracks API usage in a local SQLite database (~/.geminiproxy/rate_limit.db) and enforces a conservative limit of 950 requests/hour. When limits are reached:

Synchronous calls return error with wait time
Async calls are automatically queued
Batch processing pauses intelligently

Caching Strategy

Responses are cached with configurable TTL:

In-memory cache for fast retrieval
MD5-based cache keys
Automatic cache invalidation
Manual cache clearing available

Error Handling

Comprehensive error handling with:

Custom exception hierarchy
Detailed error messages
Automatic retry logic
Graceful degradation

📊 Monitoring & Analytics

Track your usage with built-in analytics:

# View current usage
geminiproxy --stats

# Get historical data (API)
curl http://localhost:5000/stats/history?days=30

# Clean old data
geminiproxy --cleanup

🤝 Contributing

We welcome contributions! Please see our Contributing Guide for details.

Fork the repository
Create your feature branch (git checkout -b feature/AmazingFeature)
Commit your changes (git commit -m 'Add AmazingFeature')
Push to the branch (git push origin feature/AmazingFeature)
Open a Pull Request

📝 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

Google for providing the Gemini CLI tool
The Python community for excellent libraries
All contributors and users of this project

📞 Support

Issues: GitHub Issues
Discussions: GitHub Discussions
Email: contact@falkensmz.dev

🗺️ Roadmap

Made with ❤️ by falkensmz

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
examples.py		examples.py
gemini_api.py		gemini_api.py
gemini_server.py		gemini_server.py
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GeminiProxy 🚀

✨ Features

🚀 Quick Start

Installation

Basic Usage

Command Line

Python API

REST API

📚 Documentation

API Endpoints

Configuration

🐳 Docker Deployment

Using Docker

Using Docker Compose

🏗️ Architecture

🔧 Advanced Features

Rate Limiting

Caching Strategy

Error Handling

📊 Monitoring & Analytics

🤝 Contributing

📝 License

🙏 Acknowledgments

📞 Support

🗺️ Roadmap

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

GeminiProxy 🚀

✨ Features

🚀 Quick Start

Installation

Basic Usage

Command Line

Python API

REST API

📚 Documentation

API Endpoints

Configuration

🐳 Docker Deployment

Using Docker

Using Docker Compose

🏗️ Architecture

🔧 Advanced Features

Rate Limiting

Caching Strategy

Error Handling

📊 Monitoring & Analytics

🤝 Contributing

📝 License

🙏 Acknowledgments

📞 Support

🗺️ Roadmap

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages