1 rewrite of scan batch dir by bdgregg · Pull Request #2 · ulsdevteam/scan-batch-dir

bdgregg · 2026-02-26T15:59:30Z

This PR addresses issue #1.

This is a full rewrite of the original scan-batch-dir script to allow it to be more modular so that changes needed in the future should be easier to implement. This also added the ability to use PDF files as newspaper issues.

…e islandora model in the model map.

…e Images.

ctgraham

Initial readthrough with some questions and suggestions.

scan-batch-dir

ctgraham · 2026-02-26T17:07:38Z

scan-batch-dir

+        logger.info(f"File is model: {imodel}, TID: {field_model}")
+
+        # Process any .tif files.
+        if (file_ext.lower() == ".tif"):


Special processing by type would be a good candidate to break out into separate functions for readability.

The pattern of "Handle top level files" ... "Build row data" is also heavily repeated here.

Agreed with separate functions for readability. This would probably be a next step as I was building these out as I went along.

scan-batch-dir

ctgraham · 2026-02-26T17:17:24Z

scan-batch-dir

+            'resouce_type': 'Text',
+            'child': 'File',
+        },
+        'Publication Issue 1': {


Can "1" and "2" be given semantically meaningful names?

Would love to, have any suggestions? Maybe "Publication Issue Paged" vs "Publication Issue PDF"? Not sure if MAD would want different names.

ctgraham · 2026-02-26T17:19:00Z

scan-batch-dir

+        "field_weight","field_model","model","field_resource_type","transcript"]
+
+    # Global file patterns to skip over.
+    globals()['skip'] = ["ignore",".jp2",".metadata","meta",".opex",".fits",


Are these skip patterns documented outside of this code?

Probably not yet. Was thinking on adding the list to the config file to allow customization.

…as being the first value from the return column.

bdgregg added 11 commits February 18, 2026 10:50

Initial issue commit.

7cc0f7a

Adjusting Islandora Models.

7075968

A bit of cleanup.

fb43989

Built out the PDF model, updated the model map, and explicitly set th…

97078c7

…e islandora model in the model map.

Added additional fields for PDFs and added minimal row_data for Simpl…

2d83e10

…e Images.

Added tables to the README.md file.

0a92610

Updated the tables in the README.md file.

e946d26

Updated the tables in the README.md file.

cd956a3

Updated the tables in the README.md file.

a6cce36

Updated the tables in the README.md file.

05ed05c

Updated the tables in the README.md file.

90a0624

bdgregg requested review from alex-wreschnig, chryslovelace, ctgraham, ojas-uls-dev and rzhang152 February 26, 2026 15:59

bdgregg self-assigned this Feb 26, 2026

bdgregg linked an issue Feb 26, 2026 that may be closed by this pull request

Rewrite of scan-batch-dir #1

Open

ctgraham requested changes Feb 26, 2026

View reviewed changes

bdgregg added 8 commits February 26, 2026 13:12

Fixed spacing in function call parameters.

ea1316c

Adjust function documentation to correctly describe the return value …

a7226d5

…as being the first value from the return column.

Added function documentation to process_file.

d2c1561

Adding some argument signatures to functions.

f04de97

Removed some unused functions.

4726124

Added the missing $ in the regex.

736b240

Remove function in preference for in-line code.

607681e

Removed unused function dump_df_columns.

91d8a30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

1 rewrite of scan batch dir#2

1 rewrite of scan batch dir#2
bdgregg wants to merge 19 commits intomasterfrom
1-rewrite-of-scan-batch-dir

bdgregg commented Feb 26, 2026

Uh oh!

ctgraham left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ctgraham Feb 26, 2026

Uh oh!

ctgraham Feb 26, 2026

Uh oh!

bdgregg Feb 26, 2026

Uh oh!

Uh oh!

Uh oh!

ctgraham Feb 26, 2026

Uh oh!

bdgregg Feb 26, 2026

Uh oh!

ctgraham Feb 26, 2026

Uh oh!

bdgregg Feb 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

bdgregg commented Feb 26, 2026

Uh oh!

ctgraham left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ctgraham Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

ctgraham Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

bdgregg Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

ctgraham Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

bdgregg Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

ctgraham Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

bdgregg Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants