Skip to content

YAML document delimiters are sometimes contained in eval results #25

@valerie-autumn-skye

Description

@valerie-autumn-skye

Some responses from the chat agent contain a series of three hyphens --- which are also used as YAML document delimiters. In the eval results, these are always indented with the text, so perhaps these are not a problem. Just getting this documented here in case an issue arises in the future.

Partial eval results from literature_mcp_eval:

- model: claude-4-sonnet
  coder: goose
  case_name: PMC4831113_Full_Text
  metric_name: CorrectnessMetric
  score: 0.9
  passed: true
  reason: The actual output fully includes the expected output, providing the exact
    text without contradiction. It goes beyond by offering additional details about
    the new species' characteristics. The only minor issue is the inclusion of a 'Description'
    section, which wasn't explicitly requested, but this aligns with evaluation step
    2 by providing more detail rather than omitting information.
  actual_output: 'I''ll help you get the Conclusions section of PMC4831113. Let me
    retrieve the full text of this paper first.

    ## Conclusions Section of PMC4831113


    Here is the **Conclusions** section from the paper "High-quality draft genome
    sequence and description of Haemophilus massiliensis sp. nov.":


    ---


    ### Conclusions


    On the basis of phenotypic, phylogenetic and genomic analyses, we formally propose
    the creation of **Haemophilus massiliensis** sp. nov. that contains strain FF7T
    (CSUR P859 = DSM 28247) which is the type strain. The strain was isolated from
    a peritoneal fluid specimen from a 44-year-old Senegalese woman admitted to Hôpital
    Principal in Dakar, Senegal.

Metadata

Metadata

Labels

documentationImprovements or additions to documentation

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions