A document intelligence system built for the Adobe India Hackathon 2025.
AcroLens helps users analyze PDFs in depth, connect insights across documents, and even generate podcast-style audio summaries.
- Multi-PDF Upload → Work with multiple documents in one session.
- AI-Powered Insights → Extract summaries, contradictions, overlaps, and examples.
- Semantic Search → Instantly find related context across your document library.
- Audio Summaries → Listen to your documents in podcast-style audio.
- Clean 3-Column UI → PDF viewer, insights, and related content side-by-side.
- Progressive Processing → Start interacting instantly while background analysis continues.
- Frontend → React (Vite) SPA, served on port
8080in Docker. - Backend → FastAPI (Python 3.11) with semantic search, embeddings, and TTS/LLM integration.
- Processing → Background pipeline for PDF parsing + embeddings + insight generation.
- Deployment → Single Docker container (multi-stage build: Node for frontend, Python for backend).
- Docker Desktop
- (Optional for local dev) Node.js 20+, Python 3.11+
- Build the image
docker build --platform linux/amd64 -t acrolens . - Run the container
docker run -v /path/to/credentials:/credentials -e ADOBE_EMBED_API_KEY=<YOUR_ADOBE_EMBED_API_KEY> -e LLM_PROVIDER=gemini -e GOOGLE_APPLICATION_CREDENTIALS=/credentials/adbe-gcp.json -e GEMINI_MODEL=gemini-2.5-flash -e TTS_PROVIDER=gcp -p 8080:8080 acrolens