Ollama Simple RAG Example in Javascript

This is a simple Retrieval-Augmented Generation (RAG) web application that allows you to query a contacts database using natural language. It uses Ollama for local AI models to classify queries, generate embeddings, and provide answers based on relevant contact information.

Prerequisites

Node.js installed
Ollama installed and running locally
Ollama models:
- llama3 (for chat and classification)
- mxbai-embed-large (for generating embeddings)

Installing Ollama Models

First, ensure Ollama is installed and running. Then pull the required models:

ollama pull llama3
ollama pull mxbai-embed-large

What This Does

The application provides a web interface where you can ask questions about contacts in natural language. For example:

"What's John's phone number?"
"Give me the email for the person named Sarah"

The system:

Classifies whether the query is about contact information
If it is, generates an embedding for the query
Finds the most similar contact using cosine similarity
Uses the relevant contact information as context for the LLM to generate an answer
If it's not a contact query, answers directly using the LLM

Building the Embeddings

Before running the application, you need to build embeddings for your contacts database.

Ensure you have a contacts.json file with your contact data. The format should be an array of objects like:

[
  {
    "name": "John Doe",
    "phone": "123-456-7890",
    "email": "john@example.com",
    "address": "123 Main St, Anytown, USA"
  }
]

Run the embedding builder:

node build_embeddings.js

This will:

Read the contacts from contacts.json
Generate embeddings for each contact using the mxbai-embed-large model
Save the contacts with their embeddings to contacts_with_embeddings.json

Running the Application

Start the Node.js server:

node server.js

The server will run on http://localhost:8000 by default.

Open your browser and navigate to http://localhost:8000
Type your query in the input field and click Send or press Enter

Configuration

Port: Set the PORT environment variable to change the server port (default: 8000)
Ollama URL: Set the OLLAMA_URL environment variable to point to a different Ollama instance (default: http://localhost:11434)

Example:

PORT=3000 OLLAMA_URL=http://localhost:11434 node server.js

Files

index.html — The main web page
styles.css — CSS styles for the interface
main.js — Frontend JavaScript for handling user input and displaying responses
server.js — Node.js server that serves static files and proxies requests to Ollama
build_embeddings.js — Script to generate embeddings for contacts
contacts.json — Your contacts database (input)

Security Notes

This server is intended for local development only
It sets permissive CORS headers for proxied responses
Do not expose this server on the public internet without proper security measures

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Ollama Simple RAG Example in Javascript

Prerequisites

Installing Ollama Models

What This Does

Building the Embeddings

Running the Application

Configuration

Files

Security Notes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.gitignore		.gitignore
README.md		README.md
build_embeddings.js		build_embeddings.js
contacts.json		contacts.json
index.html		index.html
main.js		main.js
server.js		server.js
styles.css		styles.css

Folders and files

Latest commit

History

Repository files navigation

Ollama Simple RAG Example in Javascript

Prerequisites

Installing Ollama Models

What This Does

Building the Embeddings

Running the Application

Configuration

Files

Security Notes

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages