A tool facilitating matching for any dataset discovery method. Also, an extensible experiment suite for state-of-the-art schema matching methods.
-
Updated
Apr 14, 2026 - Python
A tool facilitating matching for any dataset discovery method. Also, an extensible experiment suite for state-of-the-art schema matching methods.
Valentine scalable deployment for VLDB demo
Deterministic key and join discovery for structured datasets
Master thesis: Holistic Schema Matching at Scale
JCDL 2025 Paper "Multi-Disciplinary Dataset Discovery from Citation-Verified Literature Contexts" which matching research questions to cited datasets.
Search-first dataset discovery platform with DuckDB, FastAPI, background workers, and a reproducible local demo path.
Add a description, image, and links to the dataset-discovery topic page so that developers can more easily learn about it.
To associate your repository with the dataset-discovery topic, visit your repo's landing page and select "manage topics."