feat: expose speaker embeddings and subsegments in DiarizeResult by smm-h · Pull Request #4 · FoxNoseTech/diarize

smm-h · 2026-03-25T09:40:10Z

Summary

The diarize() function already computes speaker embeddings and subsegments via extract_embeddings(), but these are discarded before building the DiarizeResult. This change simply preserves them on the result object by adding two new optional fields.

Changes

utils.py: Added embeddings: Any = None and subsegments: list[SubSegment] | None = None fields to DiarizeResult. Added model_config = ConfigDict(arbitrary_types_allowed=True) to support numpy arrays in the Pydantic model.
__init__.py: Pass embeddings and subsegments to the DiarizeResult constructor in diarize().

Motivation

Use case: cross-recording speaker clustering and identification. When processing multiple audio files, having access to the raw speaker embeddings allows users to cluster or match speakers across recordings -- something that is not possible with just the segment labels.

Notes

Both fields default to None, so the change is fully backward-compatible.
No performance impact -- this just stores a reference to already-computed data instead of discarding it.

feat: expose speaker embeddings and subsegments in DiarizeResult

5ea110b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: expose speaker embeddings and subsegments in DiarizeResult#4

feat: expose speaker embeddings and subsegments in DiarizeResult#4
smm-h wants to merge 1 commit intoFoxNoseTech:mainfrom
smm-h:main

smm-h commented Mar 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

smm-h commented Mar 25, 2026

Summary

Changes

Motivation

Notes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant