Skip to content

output VCF is not spec compliant - ACMG_DISEASE_NAME can contain commas #629

@ameynert

Description

@ameynert

The output VCF can contain commas in the ACMG_DISEASE_NAME field of the Exomiser INFO key. Could these be replaced with semi-colons or underscores?

[Exomiser/exomiser-core/src/main/java/org/monarchinitiative/exomiser/core/writers/VcfResultsWriter.java](

fields.add('"' + assignment.map(acmgAssignment -> acmgAssignment.disease().diseaseName().replace(" ", "_")).orElse("") + '"');

This breaks spec compliance which says "commas are permitted only as delimiters for lists of values".

Because the Exomiser INFO field is set as `Number=.' in the VCF, pysam interprets these commas as breaking between multiple instances of the field, which breaks parsing the field into its subfields.

related to #620

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions