Skip to content

General Update For Paragraph Splitting, Metadata Preservation, and MongoDB Prediction#9

Open
walkernr wants to merge 35 commits into
lbnlp:mainfrom
walkernr:main
Open

General Update For Paragraph Splitting, Metadata Preservation, and MongoDB Prediction#9
walkernr wants to merge 35 commits into
lbnlp:mainfrom
walkernr:main

Conversation

@walkernr
Copy link
Copy Markdown

These changes implement a more robust paragraph splitting approach that resolves many edge-case errors. Additionally, metadata containing the DOI and paragraph number are now passed through the dataloader and prediction function. Various errors with the prediction function caused by incompatibility with the model trainer were also addressed as well as problems with MongoDB prediction.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant