AnyBio is a Streamlit-based application that allows users to interact with a bioinformatics knowledge base. It uses Google Generative AI and FAISS for document embedding and retrieval.
Before running the application, ensure you have the following installed:
- Python 3.8 or higher: Download Python
- pip: Python's package manager (comes with Python installation).
- Virtual Environment (optional): Recommended to isolate dependencies.
Clone the repository to your local machine:
git clone <repository-url>
cd AnyBiopython -m venv venv
source venv/bin/activate # On Windows: venv\Scripts\activateInstall the required Python packages:
pip install -r requirements.txtCreate a .env file in the root directory and add the following:
GOOGLE_API_KEY=<your-google-api-key>
Replace <your-google-api-key> with your actual Google AI API key.
Place the PDF files you want to process in the data directory. Create the directory if it does not exist:
mkdir dataStart the Streamlit app:
streamlit run app.pyOpen your browser and navigate to:
http://localhost:8501
- Multi-language Support: Supports English, Spanish, Amharic, and Arabic.
- PDF Processing: Extracts and indexes text from PDF files.
- Knowledge Base Management: Recreate or clear the FAISS vector store.
- Chat Interface: Ask bioinformatics-related questions and get responses based on the knowledge base.
- Missing API Key: Ensure the
.envfile contains a valid Google API key. - PDF Directory Not Found: Create the
datadirectory and add PDF files. - Dependencies Issue: Ensure all dependencies are installed using
pip install -r requirements.txt.
This project is licensed under the MIT License. See the LICENSE file for details.