PDF Text Cleaner for Text-to-Speech

A web application using Google's Gemini API to clean and optimize PDF text for text-to-speech applications.

Features

AI-powered text cleaning (fixes line breaks, removes headers/footers, corrects spacing)
Drag-and-drop PDF upload
Download cleaned PDF
Modern web interface

Prerequisites

Python 3.8+
Gemini API key

Quick Start

Clone/download the repository
Create virtual environment: python -m venv venv
Activate: venv\Scripts\activate (Windows) or source venv/bin/activate (Mac/Linux)
Install: pip install -r requirements.txt
Create .env and add your Gemini API key: GEMINI_API_KEY=your_key_here
Run: python app.py
Open: http://localhost:5000

How It Works

Extract text from PDF (PyPDF2)
Clean with Google's Gemini API
Generate new PDF (ReportLab)
Download cleaned PDF

Troubleshooting

API key error: Make sure .env file exists with correct GEMINI_API_KEY
PDF extraction fails: Some scanned PDFs need OCR first
File upload fails: Check file is valid PDF and under 16MB

Security

Never commit .env to version control
Keep your API key secret
Temporary files are deleted after processing

License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
templates		templates
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
ai_cleaner.py		ai_cleaner.py
app.py		app.py
pdf_processor.py		pdf_processor.py
requirements.txt		requirements.txt
run.bat		run.bat
test_models.py		test_models.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PDF Text Cleaner for Text-to-Speech

Features

Prerequisites

Quick Start

How It Works

Troubleshooting

Security

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

BrantisIsHacking/PDF-Cleaner

Folders and files

Latest commit

History

Repository files navigation

PDF Text Cleaner for Text-to-Speech

Features

Prerequisites

Quick Start

How It Works

Troubleshooting

Security

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages