Skip to content

user177013/lilac

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

953 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Osmanthus

Modernize. Organize. Osmanthus.

🔗 Maintained fork of the archived Lilac project.

License Apache 2.0

Osmanthus is a production-ready fork of the archived Lilac project. It is designed for exploration, curation, and quality control of datasets for LLMs, with a focus on modern embedding infrastructure and Windows stability.

Osmanthus continues the mission of providing "Better data, better AI" by maintaining the core registry-based signal architecture while decoupling from defunct hosted services.

✨ Key Features in this Fork

  • Modern GGUF Support: Enhanced llama-cpp-python integration for state-of-the-art GGUF embeddings.
  • Independent Identity: Full decoupling from the defunct "Lilac Garden" infrastructure.
  • Windows Optimized: Critical stability fixes for high-performance embedding pipelines on Windows systems.
  • Botanical UI: A premium, high-density design system focusing on "Cockpit Mode" utility.

Why Osmanthus?

  • Interactive Exploration: Search, filter, cluster, and annotate your data with an LLM-powered interface.
  • On-Device Performance: Runs entirely on your local machine using open-source LLMs.
  • Data Hygiene: Detect PII, remove duplicates, and analyze text statistics to lower training costs.
  • Centralized Insights: Understand how your data evolves across the entire ML lifecycle.

Osmanthus UI

🔥 Getting Started

💻 Install

# Install directly from the fork repository
pip install git+https://github.com/user177013/lilac.git

🌐 Start the Server

Start the Osmanthus webserver using the new CLI:

osmanthus start ~/my_project

Or from Python:

import osmanthus as osman
osman.start_server(project_dir='~/my_project')

The server will be available at http://localhost:5432/.

📊 Documentation

For detailed guides on loading datasets from HuggingFace, Parquet, JSON, and more, please refer to the docs/ folder.

⚖️ License

Osmanthus is licensed under the Apache License, Version 2.0. This project is an independent fork and is not affiliated with the original Lilac AI Inc. team.


Created with focus on performance and independence.

About

Curate better data for LLMs

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Python 50.4%
  • Svelte 20.0%
  • Jupyter Notebook 17.8%
  • TypeScript 11.0%
  • Shell 0.4%
  • CSS 0.2%
  • Other 0.2%