Enhance print-metadata-fields to group by root JSON and modalities by karl-koschutnig · Pull Request #473 · PennLINC/CuBIDS

karl-koschutnig · 2025-10-18T09:44:34Z

Description:

This PR improves the output of the cubids print-metadata-fields command to provide a cleaner, more organized view of metadata fields in a BIDS dataset. Instead of listing all fields from every JSON file individually (which can be overwhelming ), the command now groups fields as follows:

Root-level JSON files (e.g., dataset_description.json, participants.json) are listed with all their metadata fields.
Modalities (e.g., func, anat, dwi) group all nested JSON files by type and display only the metadata fields unique to that modality (fields that don't appear in other categories).
This reduces redundancy—fields like EchoTime that appear across many files are shown only once under their respective modality, avoiding clutter in the output.

Changes Made:
Modified get_all_metadata_fields() in cubids.py to collect and categorize fields by root files and modalities, filtering for uniqueness.
Updated print_metadata_fields() in workflows.py to display the grouped output.
Example Output (Before):

AcquisitionTime
BandwidthPerPixelPhaseEncode
BodyPartExamined
CoilString
ConversionSoftware
ConversionSoftwareVersion
DerivedVendorReportedEchoSpacing
DeviceSerialNumber
EchoTime
EchoTrainLength
EffectiveEchoSpacing

Example Output (After):

dataset_description.json
Acknowledgements
Authors
...

func
EchoTime
RepetitionTime
TaskName
...

participants.json
age
sex
...

This enhancement makes it easier to get an overview of metadata structure without duplication, addressing user feedback for a more structured and unique field listing.

Best, Karl

…th unique fields

…tional BIDS datasets

… unique ones The previous implementation filtered to show only fields unique to each category, which caused common fields like 'Manufacturer' to disappear from all modalities. This fixes test failures in test_remove_fields and test_print_metadata_fields_command_with_test_dataset. Now all fields that exist in each modality are shown, which is the expected behavior. Fields are still unique within each modality (no duplication), but fields that appear in multiple modalities are now correctly shown in each one.

The test was pointing to the wrong directory. The get_data() function copies cubids/tests/data/ which contains multiple BIDS datasets (BIDS_Dataset, complete, inconsistent, etc). The test needs to specify which dataset to use, like other tests do with data_root / 'complete'. This fixes the test failure where metadata_fields was empty because no subjects were found at the root level.

The code was incorrectly searching for '_events.json' in root_fields dictionary keys (which are filenames like 'dataset_description.json'). While BIDS does allow task-level files at root (e.g., task-rest_events.json), these are already correctly captured by the bids_path.glob('*.json') operation and stored in root_fields with their full filename as the key. The removed code was: - Searching in the wrong structure (dictionary keys instead of file contents) - Using incorrect pattern matching ('_events.json' in filename) - Completely redundant as root-level JSON files are already properly collected Addresses feedback from cursor bot review.

cursor · 2025-10-18T10:29:22Z

cubids/cubids.py

+        for mod, fields in modalities.items():
+            result[mod] = sorted(fields)
+
+        return result


Bug: Test Fails to Detect Field Removal

The test_remove_fields no longer correctly verifies field removal. The get_all_metadata_fields() method now returns a dictionary, but the test's assertion set(new_fields) incorrectly creates a set of dictionary keys (filenames/modality names) instead of metadata field names. This causes the intersection check to always pass, making the test report success even if fields were not removed.

Additional Locations (1)

cubids/tests/test_bond.py#L986-L1004

Enhance print-metadata-fields to group by root JSON and modalities wi…

b0ecab2

…th unique fields

This comment was marked as outdated.

Sign in to view

Fix get_all_metadata_fields to handle both longitudinal and cross-sec…

bbd8881

…tional BIDS datasets

This comment was marked as outdated.

Sign in to view

karl-koschutnig added 3 commits October 18, 2025 12:02

cursor bot reviewed Oct 18, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enhance print-metadata-fields to group by root JSON and modalities#473

Enhance print-metadata-fields to group by root JSON and modalities#473
karl-koschutnig wants to merge 5 commits intoPennLINC:mainfrom
karl-koschutnig:feature/group-metadata-fields

karl-koschutnig commented Oct 18, 2025 •

edited

Loading

Uh oh!

This comment was marked as outdated.

Uh oh!

This comment was marked as outdated.

Uh oh!

cursor bot Oct 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

karl-koschutnig commented Oct 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

This comment was marked as outdated.

Uh oh!

This comment was marked as outdated.

Uh oh!

cursor bot Oct 18, 2025

Choose a reason for hiding this comment

Bug: Test Fails to Detect Field Removal

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

karl-koschutnig commented Oct 18, 2025 •

edited

Loading