Lighthouse.ai

Voice-driven web navigator for blind and low-vision users

Lighthouse.ai is a voice-controlled assistant that lets blind and low-vision users browse and operate websites hands-free. Users speak commands; the agent controls the browser and announces what's on screen after every action—reliably and safely.

🚀 Quick Start

Prerequisites

Python 3.9+
Chrome/Chromium browser
macOS, Linux, or Windows

Installation

# Clone the repository
git clone https://github.com/lighthouse-ai/lighthouse.git
cd lighthouse

# Quick setup (recommended)
make quickstart

# Or manual setup
make install
make download-models
make check-chrome

Running Lighthouse.ai

CLI Mode (Voice Interface)

make run-cli
# or
python cli.py

API Mode (Testing/Development)

make run-api
# or
uvicorn main:app --reload

Visit http://localhost:8000/docs for API documentation.

🎯 Core Features

Voice Commands

Navigate: "Go to google.com"
Click: "Click the search button"
Type: "Type hello world"
Submit: "Submit the form"
Describe: "Describe this page"
List: "List all buttons"
Stop: "Stop" or "Cancel"

Safety Features

Domain Allowlist: Only navigate to approved domains
Confirmation Gates: Destructive actions require confirmation
Local Processing: All speech processing happens locally by default

Accessibility

Screen Descriptions: Clear, concise page summaries
Element Disambiguation: Numbered lists when multiple matches
Change Detection: Reports what changed after each action

🔧 Configuration

Environment Variables

Copy .env.example to .env and configure:

# Domain allowlist (comma-separated)
ALLOWED_DOMAINS=google.com,amazon.com,github.com,wikipedia.org

# Browser settings
HEADLESS_MODE=false
BROWSER_TIMEOUT=10

# Audio settings
AUDIO_DEVICE=default
VAD_AGGRESSIVENESS=2

# Privacy settings
LOCAL_PROCESSING=true
LOG_LEVEL=INFO

Domain Allowlist

Edit config/domains.yaml to manage allowed domains:

allowed_domains:
  - google.com
  - amazon.com
  - github.com
  - wikipedia.org
  - example.com

restricted_actions:
  - delete
  - purchase
  - payment
  - account_change

🛡️ Privacy & Security

Privacy-First Design

Local Processing: Speech recognition and synthesis happen on your device
No Audio Storage: Audio is processed in real-time and discarded
Redacted Logs: Sensitive information is automatically redacted
Opt-in Cloud: Cloud services only used with explicit consent

Security Features

Domain Restrictions: Only navigate to approved websites
Action Confirmation: Destructive actions require verbal confirmation
Sandboxed Browser: Isolated browser profile for safety
Audit Trail: All actions are logged for review

Data Handling

No Personal Data: We don't collect or store personal information
Local Storage: All data stays on your device
Encrypted Logs: Session logs are encrypted locally
Transparent Processing: Open source code for full transparency

🧪 Testing

# Run all tests
make test

# Run specific test categories
pytest tests/test_cli.py -v
pytest tests/test_api.py -v
pytest tests/test_browser.py -v

# Test on target sites
make test-sites

# Check code quality
make lint
make format

📁 Project Structure

lighthouse/
├── cli.py                 # CLI entry point
├── main.py                # FastAPI service
├── config/                # Configuration files
├── core/                  # Core functionality
│   ├── asr.py            # Speech recognition
│   ├── nlu.py            # Natural language understanding
│   ├── tts.py            # Text-to-speech
│   ├── browser.py        # Browser automation
│   ├── safety.py         # Safety controls
│   └── state.py          # Session management
├── api/                   # REST API
├── utils/                 # Utilities
└── tests/                 # Test suite

🤝 Contributing

Fork the repository
Create a feature branch: git checkout -b feature/amazing-feature
Make your changes and add tests
Run tests: make test
Commit your changes: git commit -m 'Add amazing feature'
Push to the branch: git push origin feature/amazing-feature
Open a Pull Request

Development Setup

# Install development dependencies
make dev-install

# Setup pre-commit hooks
pre-commit install

# Run development server
make dev

📋 Roadmap

v1.0 (Current MVP)

v1.1 (Planned)

Hotword detection
Advanced form handling
Table navigation
Multi-step workflows

v2.0 (Future)

Cloud TTS integration
Advanced error recovery
Custom command training
Mobile app

🆘 Troubleshooting

Common Issues

"Chrome not found"

make install-chrome
make check-chrome

"Audio device not working"

# Check available audio devices
python -c "import sounddevice; print(sounddevice.query_devices())"

"Whisper model not found"

make download-models

"Permission denied"

# On macOS, grant microphone permissions in System Preferences
# On Linux, add user to audio group
sudo usermod -a -G audio $USER

Getting Help

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

OpenAI Whisper for speech recognition
Coqui TTS for text-to-speech
Selenium for browser automation
FastAPI for the web framework

Made with ❤️ for the accessibility community

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
lighthouse		lighthouse
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
MVP_PLAN.md		MVP_PLAN.md
Makefile		Makefile
PROTOTYPE_SUMMARY.md		PROTOTYPE_SUMMARY.md
README.md		README.md
TASKS.md		TASKS.md
cli.py		cli.py
demo.py		demo.py
env.example		env.example
main.py		main.py
main_simple.py		main_simple.py
prd_voice_driven_web_navigator_for_blind_low_vision_users.md		prd_voice_driven_web_navigator_for_blind_low_vision_users.md
pyproject.toml		pyproject.toml
test_api.py		test_api.py

Folders and files

Latest commit

History

Repository files navigation

Lighthouse.ai

🚀 Quick Start

Prerequisites

Installation

Running Lighthouse.ai

CLI Mode (Voice Interface)

API Mode (Testing/Development)

🎯 Core Features

Voice Commands

Safety Features

Accessibility

🔧 Configuration

Environment Variables

Domain Allowlist

🛡️ Privacy & Security

Privacy-First Design

Security Features

Data Handling

🧪 Testing

📁 Project Structure

🤝 Contributing

Development Setup

📋 Roadmap

v1.0 (Current MVP)

v1.1 (Planned)

v2.0 (Future)

🆘 Troubleshooting

Common Issues

Getting Help

📄 License

🙏 Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages