StructSense Pipeline Upgrade: Specialized Tools, Robust Chunking, and BioPortal Integration#62
Open
tekrajchhetri wants to merge 143 commits intomainfrom
Open
StructSense Pipeline Upgrade: Specialized Tools, Robust Chunking, and BioPortal Integration#62tekrajchhetri wants to merge 143 commits intomainfrom
tekrajchhetri wants to merge 143 commits intomainfrom
Conversation
Note: to disable traces run "crewai traces disable"
…han just spacy NER
sample output
{
"text": "photopic spectral sensitivity curves",
"label": "CONCEPT",
"occurrences": [
{
"start": 51,
"end": 87,
"global_start": 1127,
"global_end": 1163,
"sentence": "Recently, electroretinogram (ERG) responses of the photopic spectral sensitivity curves of photoreceptors of rats and mice were measured throughout the UV-visible spectrum (300-700 nm) (Rocha et al., 2016)."
}
]
},
{
"text": "photoreceptors",
"label": "ANATOMICAL-CONCEPT",
"occurrences": [
{
"start": 91,
"end": 105,
"global_start": 1167,
"global_end": 1181,
"sentence": "Recently, electroretinogram (ERG) responses of the photopic spectral sensitivity curves of photoreceptors of rats and mice were measured throughout the UV-visible spectrum (300-700 nm) (Rocha et al., 2016)."
}
]
},
{
"text": "rats and mice",
"label": "ORGANISM",
"occurrences": [
{
"start": 109,
"end": 122,
"global_start": 1185,
"global_end": 1198,
"sentence": "Recently, electroretinogram (ERG) responses of the photopic spectral sensitivity curves of photoreceptors of rats and mice were measured throughout the UV-visible spectrum (300-700 nm) (Rocha et al., 2016)."
}
]
},
…loading of models.
Collaborator
Author
|
WIP: search and reranking for ontology alignment. |
Move usage documentation (CLI, Python API, known issues) from tutorial/readme.md into the root README.md and delete the tutorial copy. Closes task 1 of #69.
removed link to brainkb documentation and moved `tutorials` section further down
Separate symptom and resolution into distinct sections for the pip resolution-too-deep and Python version known issues.
The 'Python version' issue section is unclear because the resolution is listed as if it were part of the symptom. To make it easier for users to understand and solve the problem, I recommend separating the symptom and resolution and formatting the error message as a code block for better readability. Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
This reverts commit 4b320c6.
…ssed properly; removing src/tests from gitignore
Added link to original dataset in the readme.
Added a link to the original dataset for reference.
…runs these tests only when openrouter is available
NER Benchmarking Dataset
…ate directory in readme
Merge tutorial README into root README
adding source checks and source_text option
adding option to set task_type + tests
Add NER tests
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
This pull request introduces major improvements to StructSense, including the addition of specialized tools, fixes to the chunking mechanism, and integration with BioPortal as the ontology database.
What’s Included
Issues this PR addresses