Skip to content

StructSense Pipeline Upgrade: Specialized Tools, Robust Chunking, and BioPortal Integration#62

Open
tekrajchhetri wants to merge 143 commits intomainfrom
improvement
Open

StructSense Pipeline Upgrade: Specialized Tools, Robust Chunking, and BioPortal Integration#62
tekrajchhetri wants to merge 143 commits intomainfrom
improvement

Conversation

@tekrajchhetri
Copy link
Copy Markdown
Collaborator

@tekrajchhetri tekrajchhetri commented Feb 10, 2026

This pull request introduces major improvements to StructSense, including the addition of specialized tools, fixes to the chunking mechanism, and integration with BioPortal as the ontology database.

What’s Included

  • Detailed list of changes: see the Change Log
  • Usage tutorials: see the Tutorial for instructions on running StructSense
  • Developer documentation: see the Developer Guide
  • Updated configuration templates: available in the config_template directory

Issues this PR addresses

Note: to disable traces run "crewai traces disable"
…han just spacy NER

sample output

{
      "text": "photopic spectral sensitivity curves",
      "label": "CONCEPT",
      "occurrences": [
        {
          "start": 51,
          "end": 87,
          "global_start": 1127,
          "global_end": 1163,
          "sentence": "Recently, electroretinogram (ERG) responses of the photopic spectral sensitivity curves of photoreceptors of rats and mice were measured throughout the UV-visible spectrum (300-700 nm) (Rocha et al., 2016)."
        }
      ]
    },
    {
      "text": "photoreceptors",
      "label": "ANATOMICAL-CONCEPT",
      "occurrences": [
        {
          "start": 91,
          "end": 105,
          "global_start": 1167,
          "global_end": 1181,
          "sentence": "Recently, electroretinogram (ERG) responses of the photopic spectral sensitivity curves of photoreceptors of rats and mice were measured throughout the UV-visible spectrum (300-700 nm) (Rocha et al., 2016)."
        }
      ]
    },
    {
      "text": "rats and mice",
      "label": "ORGANISM",
      "occurrences": [
        {
          "start": 109,
          "end": 122,
          "global_start": 1185,
          "global_end": 1198,
          "sentence": "Recently, electroretinogram (ERG) responses of the photopic spectral sensitivity curves of photoreceptors of rats and mice were measured throughout the UV-visible spectrum (300-700 nm) (Rocha et al., 2016)."
        }
      ]
    },
@tekrajchhetri
Copy link
Copy Markdown
Collaborator Author

WIP: search and reranking for ontology alignment.

puja-trivedi and others added 29 commits February 25, 2026 10:51
Move usage documentation (CLI, Python API, known issues) from
tutorial/readme.md into the root README.md and delete the tutorial
copy. Closes task 1 of #69.
removed link to brainkb documentation and moved `tutorials` section further down
Separate symptom and resolution into distinct sections for the pip
resolution-too-deep and Python version known issues.
The 'Python version' issue section is unclear because the resolution is listed as if it were part of the symptom. To make it easier for users to understand and solve the problem, I recommend separating the symptom and resolution and formatting the error message as a code block for better readability.

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
This reverts commit 4b320c6.
…ssed properly; removing src/tests from gitignore
Added link to original dataset in the readme.
Added a link to the original dataset for reference.
…runs these tests only when openrouter is available
adding source checks and source_text option
adding option to set task_type + tests
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants