PREPARE Extraction Tool

PREPARE Extraction Tool is an application to help create mappings between coding systems and the Vocabulary standard concepts. The tool is an adaptation/extension of the OHDSI Usagi tool, focusing on extracting relevant medical terms from unstructured text and mapping them to the OHDSI vocabularies available on OHDSI Athena.

Running the Tool

Prerequisites

Docker and Docker Compose
Node.js 18+ (for local frontend development)
Python 3.10+ (for local backend development)

Using Docker (Recommended)

This is the easiest way to run the full stack. Open a terminal in the project root and follow the steps below.

First Time Setup

Clone the repository:

git clone <repository-url>
cd PREPARE-Extraction-Tool

Set up environment variables:

cp .env.example .env
# Edit .env with your configuration

Key host configuration variables:

Variable	Default	Description
`FRONTEND_HOST`	`http://localhost:3000`	URL where the frontend is accessible
`BACKEND_HOST`	`http://localhost:8000`	URL where the backend API is accessible
`EXTRACT_HOST`	`http://localhost:5600`	URL where the extraction service is accessible

Place the GLiNER model files:

Use the shared zip file named model.zip, extract it, and place the extracted model folder inside bioner. If you have a fine-tuned model, place that extracted model folder in the same location.

Expected result:
```
bioner/model/
```
Start all services:
```
docker-compose up -d
```

Apply database migrations:

docker compose exec backend alembic upgrade head

(Optional) Load Medical Vocabularies:

This step populates PostgreSQL and Elasticsearch with the main medical vocabularies and concepts required for mapping.
- Note: You can skip this step now and manually upload these vocabularies through the application interface later.
- Prerequisite: Ensure the required data files (vocabulary.csv, concept.csv, and the es_repo folder) are placed inside the seed_data directory.
- Run the script:
```
./scripts/seed.sh
```
Access the application by opening http://localhost:3000 in your browser (using default host values):
- Frontend: http://localhost:3000 (configured via FRONTEND_HOST)
- Backend API: http://localhost:8000 (configured via BACKEND_HOST)
- API Documentation: http://localhost:8000/docs
- Database Admin: http://localhost:8080

Every Time After That

If your containers are still running (e.g. you haven't restarted your computer), just open http://localhost:3000 in your browser — nothing else needed.

If you restarted your computer or stopped Docker, simply run:

docker-compose up -d

Then open http://localhost:3000.

Project Structure

PREPARE-Extraction-Tool/
├── backend/               # FastAPI backend service
│   ├── app/               # Main application code
│   │   ├── core/          # Core configuration and utilities
│   │   ├── routes/        # API endpoints
│   │   ├── models.py      # Data models
│   │   ├── utils/         # Utility functions
│   │   └── tests/         # Backend tests
│   ├── requirements.txt   # Python dependencies
│   ├── pyproject.toml     # Project configuration
│   └── Dockerfile         # Backend container
├── frontend/              # React frontend application
│   ├── src/               # Source code
│   │   ├── components/    # React components
│   │   ├── pages/         # Page components
│   │   ├── hooks/         # Custom React hooks
│   │   └── assets/        # Static assets
│   ├── package.json       # Node.js dependencies
│   └── Dockerfile         # Frontend container
├── scripts/               # Build and deployment scripts
├── docker-compose.yaml    # Multi-container setup
└── .env                   # Environment variables (create from .env.example)

Backend

The backend is built with Python 3.10+ using the following main technologies:

FastAPI: Modern, fast web framework for building APIs
Uvicorn: ASGI server for running FastAPI applications
SQLModel: SQL database integration with Pydantic models
Pydantic: Data validation and settings management
PostgreSQL: Primary database (via Docker)

Frontend

The frontend is built with TypeScript and React 19 using:

React 19: Latest React with concurrent features
TypeScript: Type-safe JavaScript
Vite: Fast build tool and dev server
Storybook: Component development and documentation
Vitest: Unit testing framework
ESLint + Prettier: Code quality and formatting

Name		Name	Last commit message	Last commit date
Latest commit History 380 Commits
backend		backend
bioner		bioner
frontend		frontend
scripts		scripts
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
docker-compose.yaml		docker-compose.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PREPARE Extraction Tool

Running the Tool

Prerequisites

Using Docker (Recommended)

First Time Setup

Every Time After That

Project Structure

Backend

Frontend

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

PREPARE Extraction Tool

Running the Tool

Prerequisites

Using Docker (Recommended)

First Time Setup

Every Time After That

Project Structure

Backend

Frontend

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages