Local LLM RAG System

A local Retrieval-Augmented Generation (RAG) system that allows you to chat with your PDF documents using LLMs (Large Language Models) running on your machine through Ollama.

Overview

This project implements a full-stack RAG system with the following components:

Web UI: A Streamlit-based interface for uploading documents, managing your document library, and chatting with your documents
API Backend: A FastAPI service that handles document processing, vector storage, and LLM interactions
Database Storage: MongoDB for storing documents and conversation history
Vector Storage: ChromaDB for storing and searching document embeddings
LLM Integration: Uses Ollama to run local language models for embeddings and completion

Prerequisites

Docker and Docker Compose

Running in Docker

This section explains how to run the app using conteinarized services

Installation

Clone this repository:

git clone <repository-url>
cd local-llm-rag

Run

Start the application:

docker-compose up --build

Running locally

This section explains how to run the API and UI components locally while using containerized MongoDB and ChromaDB.

Installation

Install Poetry if you haven't already:

pip install poetry

Install API dependencies:

cd api
poetry install
cd ..

Install UI dependencies:

cd ui
poetry install
cd ..

Install Ollama and pull required models:

# Install Ollama
curl -fsSL https://ollama.com/install.sh | sh

# Pull the required models
ollama pull llama2
ollama pull nomic-embed-text

Run

Start MongoDB and ChromaDB containers:

docker-compose up mongodb chroma --build

Start the API server:

cd api
poetry run uvicorn src.main:app --reload --host 0.0.0.0 --port 8000

Start the UI server (in another terminal):

cd ui
poetry run streamlit run src/app.py

Access the application:
- Web UI: http://localhost:8501
- API Documentation: http://localhost:8000/docs

Note: Make sure you have Ollama installed and running locally with your desired models. The API will connect to Ollama on the default address (http://localhost:11434).

Usage

Open your browser and navigate to http://localhost:8501 to access the UI
Upload PDF documents using the file uploader in the sidebar
Ask questions about your documents in the chat interface
Start a new conversation or delete documents as needed using the sidebar controls

Project Structure

api/: FastAPI backend service
- config/: Configuration settings
- src/: Source code for the API components
ui/: Streamlit front-end application

Customization

You can modify the default LLM settings in the api/config/settings.py file:

Change the embedding model: embedding_model
Change the LLM model: llm_model
Adjust chunk size and overlap for document processing

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
api		api
ui		ui
.gitignore		.gitignore
README.md		README.md
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Local LLM RAG System

Overview

Prerequisites

Running in Docker

Installation

Run

Running locally

Installation

Run

Usage

Project Structure

Customization

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Local LLM RAG System

Overview

Prerequisites

Running in Docker

Installation

Run

Running locally

Installation

Run

Usage

Project Structure

Customization

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages