Rewrite paolina using DataFrames by gonzaponte · Pull Request #957 · next-exp/IC

gonzaponte · 2026-04-20T16:05:05Z

Paolina functions were written using the HitCollection and Hit classes, which are cumbersome. Pandas' Dataframes are a much better tool to handle this type of data and, in practice, we always use them. This PR rewrites some of the paolina functionality avoiding these unnecessary intermediate objects.

The idea is that it's easier to move this conversion down the workflow, step by step

Useful when we have written a hypothesis strategy to generate complex data, but we only need one example or a dummy value for property-based testing

1. Beersheba hits don't have Q, for some reason, so we need to fill in the value 2. We need to drop some unused columns and reorder the rest 3. We need to force a type change because there is something wrong in the input file 1 should probably be revisited, 2 will hopefully be worked out once the full event-model removal is done and 3 is a serious WTF with some continuous variables having integer type and needs to be fixed.

This is due to the terrible bookkeeping of our test files. For now, this is just a demonstration that the tests pass, i.e., the refactor does not modify the output. In the future, test files will be updated to prevent this from the beginning.

jwaiton

Good work! This is a really important change that I'm excited to see implemented.

I've commented commit-by-commit, so some of the commits may no longer be useful (and some outdated ones may still be, for example I still think the dhits_from_files() docstring is a bit barebones).

jwaiton

This PR removes the use of the HitCollection class from paolina, retrofitting all relevant functions to instead work with dataframes. General improvements to the docstrings and code clarity were made in parallel.

Good work!

gonzaponte added 14 commits April 20, 2026 15:31

Move df->hitc conversion to within compute_tracks

aac2894

The idea is that it's easier to move this conversion down the workflow, step by step

Rewrite filter_events_nohits with df as input

fdb9254

Rewrite Efield_copier with df as input

ef9ef54

Rewrite paolina functions using hits in df format

6142eeb

Add testing utility to generate a single example

5fde98a

Useful when we have written a hypothesis strategy to generate complex data, but we only need one example or a dummy value for property-based testing

Refactor paolina tests using an_instance_of

e32c2ce

Rewrite components to work with dataframes

5a297d7

Remove hitc_from_df

1a3d41f

Recover skipped test

66989c6

Clean-up: remove unused imports

3968445

Fix several lies in type hints and docs

ac5811e

Move hitc_to_df to the museum

689bb2e

jwaiton reviewed Apr 27, 2026

View reviewed changes

gonzaponte added 5 commits April 28, 2026 08:04

Improve docs and type hint for dhits_from_files

9970760

Improve bounding_box docstring

384246d

Improve commentary

07e4629

Add warning for too many hits

dd9f40c

Remove default argument in round_hits_positions_in_place

025c444

gonzaponte force-pushed the paolina-df branch from 78df6d0 to 025c444 Compare April 28, 2026 10:47

gonzaponte force-pushed the paolina-df branch from 3b05f83 to 025c444 Compare May 5, 2026 19:05

jwaiton approved these changes May 7, 2026

View reviewed changes

Ian0sborne merged commit 0037429 into next-exp:master May 11, 2026
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rewrite paolina using DataFrames#957

Rewrite paolina using DataFrames#957
Ian0sborne merged 19 commits into
next-exp:masterfrom
gonzaponte:paolina-df

gonzaponte commented Apr 20, 2026 •

edited

Loading

Uh oh!

jwaiton left a comment •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jwaiton left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

gonzaponte commented Apr 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jwaiton left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jwaiton left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

gonzaponte commented Apr 20, 2026 •

edited

Loading

jwaiton left a comment •

edited

Loading