Add datasets query_imdb and query_krama by Ruiying-Ma · Pull Request #46 · ucbepic/DataAgentBench

Ruiying-Ma · 2026-05-05T22:59:58Z

Datasets Added

`imdb`

Domain: Movie/entertainment industry (IMDB JOB benchmark)
Databases: PostgreSQL (movies_db) + SQLite (people.sqlite)
Properties:
- Multi-DB integration: queries span PostgreSQL and SQLite
- Ill-formatted join keys: identifier columns use non-standard string encodings (e.g. tt0000042, nm001, InfT~~7); agents must strip prefixes and leading zeros before
  joining

`krama`

Domain: Multi-domain scientific data (wildfire, environment, legal, biomedical, astronomy, archeology)
Databases: MongoDB (domain_docs.bson for csv/txt files) + SQLite (us_geo.db for gpkg files and a synthetic beach_water_temperature table) + SQLite (domain_assets.db for other files)
Properties:
- Multi-DB integration: queries span MongoDB, SQLite, and heterogeneous flat files
- Ill-formatted join keys: beach codes: plain integers vs 'COM-XXX' prefix; UCEC patient IDs: 'S001' vs 's_1'
- Unstructured-text-transformation: GACC / location / doc info merged into NL description field

Each dataset has 10 queries with ground-truth CSVs and validate.py scripts.

Validation Results (Claude Sonnet 4.6, 1 run/query)

Dataset	Accuracy	Timeouts	Avg Latency (successful runs)	Min	Max
`imdb`	5/10 (50%)	3	929.6s (~15.5 min)	198s	2214s
`krama`	9/10 (90%)	0	129.8s (~2.2 min)	72s	236s
Overall	14/20 (70%)	3	—	—	—

Ruiying-Ma added 9 commits May 4, 2026 00:29

add query_imdb (JOB) queries

493b5a8

add data

a25ca39

update

6634aa6

update

3f7e2f9

data lake

8b026a4

update

971a93d

fix file names

f57ecfa

rename hints

3ef6357

update

543ba8b

Ruiying-Ma changed the title ~~Ruiying datasets only~~ Add datasets query_imdb and query_krama May 6, 2026

Ruiying-Ma added 10 commits May 5, 2026 23:30

transformed

f07dd6b

update

9e76551

update db_description

2c13c36

update hints

fdaf28d

relaxed

bdbd86f

merge files

c8c3ea6

NL file meta

c52f909

update queries

560ec6d

update description

d2fe9af

merge docs 3-6

6943688

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add datasets query_imdb and query_krama#46

Add datasets query_imdb and query_krama#46
Ruiying-Ma wants to merge 19 commits intomainfrom
ruiying-datasets-only

Ruiying-Ma commented May 5, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Ruiying-Ma commented May 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Datasets Added

imdb

krama

Validation Results (Claude Sonnet 4.6, 1 run/query)

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Ruiying-Ma commented May 5, 2026 •

edited

Loading

`imdb`

`krama`