🧠 RAG-HEALTH: WHO-focused Retrieval-Augmented Generation System

project link: https://drive.google.com/file/d/1e15xKkPw1KtUhfN2a43LJ2b7T8nXV2RV/view?usp=drive_link

RAG-HEALTH is an end-to-end intelligent question-answering system designed for the healthcare domain. It leverages Retrieval-Augmented Generation (RAG) to retrieve information from trusted documents and generate accurate, explainable answers using state-of-the-art LLMs via the Groq API.

This system supports multimodal content (text, tables, and images) and is built to enable public access to healthcare knowledge using AI in a safe, transparent, and modular way.

🌐 Key Features

🔍 Vector-based document retrieval using ChromaDB and MiniLM embeddings
🤖 LLM-powered answer generation with Groq's LLaMA3 models
🖼️ Image-to-text captioning using ImgBB and Vision LLM
📊 PDF extraction for tables, text, and images
⚙️ Microservice architecture using FastAPI
🧩 Workflow orchestration using n8n

📁 Directory Structure

RAG-HEALTH/
│
├── data/                         # Source PDF files
├── helpers/                      # Core modules for embedding, retrieval, and LLM
│   ├── context_builder.py
│   ├── db_init.py
│   ├── image_llm.py
│   ├── prompt_builder.py
│   ├── text_llm.py
│
├── scripts/                      # Utility and automation scripts
│   ├── data_ingestion.py        # Extracts content and indexes it
│   ├── RAG_HEALTH.json          # n8n workflow config
│
├── services/                     # API layer (FastAPI microservices)
│   ├── retriever.py             # Context fetcher
│   ├── main.py                  # LLM responder
│
├── ui/                           # Streamlit frontend
│   ├── chat.py
│
├── vector_store/                # Persisted Chroma vector DB
├── .env                         # API keys and environment variables
├── requirements.txt
└── README.md

🧪 How It Works

PDF Ingestion:
- scripts/data_ingestion.py extracts text, images, and tables.
- Images are uploaded to ImgBB and captioned using Groq's LLM Vision model.
- All content is split into semantic chunks and stored in Chroma with metadata.
Query Flow:
- User enters a healthcare query (e.g., "What are sterilization guidelines before surgery?").
- The retriever service searches Chroma for relevant document chunks.
- The main service sends the prompt to Groq LLaMA3 API and returns the answer.
Multimodal Understanding:
- Tables and images from PDFs are treated as context via captions or markdown rendering.
n8n Workflow:
- The entire query → retrieval → response flow is orchestrated using n8n.

🔁 n8n Workflow (Visual Overview)

Trigger Node (Webhook /query)
    ↓
HTTP Node → POST /retrieve
    ↓
HTTP Node → POST /ask
    ↓
Respond to Webhook

Webhook Endpoint: /query
Expected Body:

{
  "query": "Explain laparoscopic sterilization steps."
}

Intermediate Responses:
- /retrieve → { context, question }
- /ask → { answer, sources, context }

💻 Running Locally

1. Clone the Repository

git clone https://github.com/yourname/RAG-HEALTH
cd RAG-HEALTH

2. Install Python Dependencies

pip install -r requirements.txt

3. Configure Environment

Create a .env file with the following:

GROQ_API_KEY=your_groq_key
IMGBB_API_KEY=your_imgbb_key
HUGGINGFACEHUB_API_TOKEN=your_huggingface_token
CHROMA_DB_DIR=./vector_store

4. Ingest Documents

python scripts/data_ingestion.py

5. Start Backend Services

# In one terminal
uvicorn services.retriever:app --port 8000

# In another terminal
uvicorn services.main:app --port 8001

6. Start n8n

Import scripts/RAG_HEALTH.json into your n8n dashboard
Start the workflow manually or via API trigger

7. (Optional) Launch Streamlit UI

streamlit run ui/chat.py

🤖 Technologies Used

Layer	Technology
LLM	Groq LLaMA3-70B via OpenAI API
Embeddings	HuggingFace `all-MiniLM-L6-v2`
Vector Store	Chroma DB
Backend	FastAPI
Frontend	Streamlit
Image Upload	ImgBB
Orchestration	n8n
Parsing PDFs	PyMuPDF (fitz), pdfplumber

🧠 Prompt Design

[Context]
- Chunked and cleaned healthcare-relevant paragraphs
- Captions from PDF images and tables

[Question]
12 Healthy habits?.

[Answer]
(Generated using Groq LLM)

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
RAG_Health		RAG_Health
data		data
helpers		helpers
models		models
scripts		scripts
services		services
ui		ui
vector_store		vector_store
.DS_Store		.DS_Store
.gitignore		.gitignore
LICENSE		LICENSE
package-lock.json		package-lock.json
package.json		package.json
readme.md		readme.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧠 RAG-HEALTH: WHO-focused Retrieval-Augmented Generation System

🌐 Key Features

📁 Directory Structure

🧪 How It Works

🔁 n8n Workflow (Visual Overview)

💻 Running Locally

1. Clone the Repository

2. Install Python Dependencies

3. Configure Environment

4. Ingest Documents

5. Start Backend Services

6. Start n8n

7. (Optional) Launch Streamlit UI

🤖 Technologies Used

🧠 Prompt Design

🔮 Future Enhancements

🧑‍⚕️ Ideal Use Cases

📃 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🧠 RAG-HEALTH: WHO-focused Retrieval-Augmented Generation System

🌐 Key Features

📁 Directory Structure

🧪 How It Works

🔁 n8n Workflow (Visual Overview)

💻 Running Locally

1. Clone the Repository

2. Install Python Dependencies

3. Configure Environment

4. Ingest Documents

5. Start Backend Services

6. Start n8n

7. (Optional) Launch Streamlit UI

🤖 Technologies Used

🧠 Prompt Design

🔮 Future Enhancements

🧑‍⚕️ Ideal Use Cases

📃 License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages