LLM_RAG_CHATBOT

<<<<<<< HEAD

LLM_RAG_CHATBOT

=======

Document Chat Application

A complete end-to-end web application that allows users to chat with an AI assistant restricted only to the documents they upload. Built with Angular frontend, Spring Boot backend, and Gemini Pro API.

Features

Document Upload: Support for PDF, DOCX, and TXT files
Text Extraction: Automatic text extraction from uploaded documents
Vector Search: Document chunks are embedded and stored for semantic search
AI Chat: Chat interface that answers only from uploaded documents
Smart Responses:
- Answers from documents when relevant
- "Out of scope" for unrelated questions
- "No documents available" when no documents are uploaded
- Highlights conflicts when documents contain conflicting information

Tech Stack

Frontend: Angular 17
Backend: Spring Boot 3.2.0
Database: H2 (in-memory)
AI: Google Gemini Pro API
Document Processing: Apache PDFBox, Apache POI
Vector Storage: Custom implementation with cosine similarity

Quick Start

Prerequisites

Java 17 or higher
Node.js 18 or higher
npm or yarn
Google Gemini API key (optional - app works without it using dummy responses)

1. Backend Setup

cd backend
./mvnw spring-boot:run

The backend will start on http://localhost:8080

2. Frontend Setup

cd frontend
npm install
npm start

The frontend will start on http://localhost:4200

3. Configure Gemini API (Required for AI Responses)

⚠️ IMPORTANT: The application requires a Gemini API key to work properly. Without it, you'll get dummy responses.

Quick Setup:

./setup-api-key.sh

Manual Setup:

Get your API key:
- Go to: https://makersuite.google.com/app/apikey
- Create a new API key
- Copy the API key
Set the API key (choose one method):

Option A - Environment Variable (Recommended):
```
export GEMINI_API_KEY=your_api_key_here
```
Option B - Add to application.properties:
```
gemini.api.key=your_api_key_here
```
Option C - Set temporarily for testing:
```
GEMINI_API_KEY=your_api_key_here ./start-backend.sh
```
Restart the backend after setting the API key:
```
cd backend && mvn spring-boot:run
```

How It Works

Document Upload: Users upload PDF, DOCX, or TXT files through the web interface
Text Extraction: Backend extracts text from documents using Apache PDFBox and POI
Chunking: Text is split into manageable chunks (1000 characters with 200 character overlap)
Embedding: Each chunk is converted to a vector embedding using Gemini's embedding API
Storage: Embeddings are stored in the database with the original text
Query Processing: When users ask questions:
- Query is converted to an embedding
- Similar chunks are found using cosine similarity
- Relevant context is sent to Gemini Pro for response generation
- Response is returned with source information

API Endpoints

Documents

POST /api/documents/upload - Upload a document
GET /api/documents - Get all documents
DELETE /api/documents/{id} - Delete a document

Chat

POST /api/chat/message - Send a chat message

Configuration

Backend Configuration (`application.properties`)

# Server
server.port=8080

# Database
spring.datasource.url=jdbc:h2:mem:testdb
spring.datasource.username=sa
spring.datasource.password=password

# File upload
spring.servlet.multipart.max-file-size=10MB
spring.servlet.multipart.max-request-size=10MB

# Gemini API
gemini.api.key=${GEMINI_API_KEY:}

Development

Backend Development

cd backend
./mvnw spring-boot:run

Frontend Development

cd frontend
npm start

Building for Production

Backend:

cd backend
./mvnw clean package
java -jar target/document-chat-backend-0.0.1-SNAPSHOT.jar

Frontend:

cd frontend
npm run build

Troubleshooting

Common Issues

Port already in use: Change the port in application.properties
CORS errors: Ensure the frontend URL is correct in the CORS configuration
File upload fails: Check file size limits and supported formats
No AI responses or "Out of scope" for valid questions:
- Most common cause: Missing or invalid Gemini API key
- Run ./setup-api-key.sh to check your API key setup
- Verify the API key is set: echo $GEMINI_API_KEY
- Restart the backend after setting the API key
Poor response quality: The dummy embedding system has limitations; use a real Gemini API key for better results

Logs

Backend logs are available in the console. For more detailed logging, modify logback-spring.xml.

License

This project is for educational purposes. Please ensure you comply with Google's Gemini API terms of service when using the AI features.

0822860 (RAG Chatbot)

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
backend		backend
frontend		frontend
.gitignore		.gitignore
FINAL_SUMMARY.md		FINAL_SUMMARY.md
FINAL_TEST_REPORT.md		FINAL_TEST_REPORT.md
InterviewGuideAtGoogle.pdf		InterviewGuideAtGoogle.pdf
PERFORMANCE_COMPARISON_REPORT.md		PERFORMANCE_COMPARISON_REPORT.md
README.md		README.md
TEST_CASES.md		TEST_CASES.md
TEST_REPORT.md		TEST_REPORT.md
comprehensive_test.py		comprehensive_test.py
comprehensive_test_suite.py		comprehensive_test_suite.py
debug_test.py		debug_test.py
install-and-run.sh		install-and-run.sh
performance_test.py		performance_test.py
setup-api-key.sh		setup-api-key.sh
simple_test.py		simple_test.py
start-app.sh		start-app.sh
start-backend.sh		start-backend.sh
start-frontend.sh		start-frontend.sh
test_chat_api.py		test_chat_api.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM_RAG_CHATBOT

Document Chat Application

Features

Tech Stack

Quick Start

Prerequisites

1. Backend Setup

2. Frontend Setup

3. Configure Gemini API (Required for AI Responses)

Quick Setup:

Manual Setup:

How It Works

API Endpoints

Documents

Chat

Configuration

Backend Configuration (`application.properties`)

Development

Backend Development

Frontend Development

Building for Production

Troubleshooting

Common Issues

Logs

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LLM_RAG_CHATBOT

Document Chat Application

Features

Tech Stack

Quick Start

Prerequisites

1. Backend Setup

2. Frontend Setup

3. Configure Gemini API (Required for AI Responses)

Quick Setup:

Manual Setup:

How It Works

API Endpoints

Documents

Chat

Configuration

Backend Configuration (application.properties)

Development

Backend Development

Frontend Development

Building for Production

Troubleshooting

Common Issues

Logs

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Backend Configuration (`application.properties`)

Packages