Skip to content

Assign users#1158

Open
cameledev wants to merge 9 commits intomainfrom
assign-users
Open

Assign users#1158
cameledev wants to merge 9 commits intomainfrom
assign-users

Conversation

@cameledev
Copy link
Copy Markdown
Collaborator

@cameledev cameledev commented Mar 17, 2026

Purpose

Users

Proposal

Build upon: #741

  • Use newly implemented VAD in metadata to associated user names to speech intervals. Use intersect these intervals with WhisperX output and compute score (overlap / speaker_duration). Assign name for largest score (if score > threshold).
  • This method allows handling of multiple people for a same user (multilple people on same device).
  • User the declared user name in transcription. If multiple people are detected for a user, also add the assigned user. (e.g: charlie (SPEAKER_2))
  • Add tests for user_assign
  • Some code restructuring was necessary

Example Output

Two speakers, single user/microphone (named Cameledev)

{
  "title": "Meeting \"qgd-fnzu-enl\" on 2026-04-22 at 10:15",
  "content": "\n*[Download your recording (external link)](http://localhost:3000/recording/dbd5ce3e-1905-48b0-8eb1-50ea0b20a369)*\n\n\n **Cameledev (SPEAKER_01)**:  Je parle avant que la trans... Je parle après que la transcription ait commencé. Cette fois-ci, je parle plus longtemps.\n\n **Cameledev (SPEAKER_00)**: Je parle avec une deuxième voix différente qui n'est pas la même que la première voix.\n\n **Cameledev (SPEAKER_01)**: Je reparle avec la première voix. Stop.",
  "email": "testmail@test.com",
  "sub": "042bdc02-9a87-436b-a3a3-9960c43ac425"
}

Comment thread .github/workflows/meet.yml Outdated
@cameledev cameledev force-pushed the add-tests-and-ci-testing branch from 7856de6 to efa0d48 Compare March 17, 2026 16:42
@cameledev cameledev changed the base branch from add-tests-and-ci-testing to metadata-agent-leo March 25, 2026 14:03
@cameledev cameledev force-pushed the assign-users branch 2 times, most recently from ce51ac6 to b438c4b Compare March 31, 2026 16:35
Comment on lines +201 to +203
# TODO: change how we get metadata_filename
output_folder = os.getenv("AWS_S3_OUTPUT_FOLDER", "metadata")
metadata_filename = f"{output_folder}/{recording.id}-metadata.json"
Copy link
Copy Markdown
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What should we do with this ?

@cameledev cameledev force-pushed the metadata-agent-leo branch from bf54fea to 80cca39 Compare April 7, 2026 10:16
@lebaudantoine lebaudantoine force-pushed the metadata-agent-leo branch 3 times, most recently from b90bee2 to 8e6e17d Compare April 17, 2026 08:59
Base automatically changed from metadata-agent-leo to main April 17, 2026 10:17
Comment thread src/summary/summary/core/celery_worker.py Fixed
Comment thread src/summary/summary/core/celery_worker.py Fixed
Comment thread src/summary/summary/core/celery_worker.py Fixed
Comment thread src/summary/summary/core/celery_worker.py Fixed
Comment thread src/summary/summary/core/celery_worker.py Fixed
Comment on lines +171 to +172
recording_datetime,
owner_timezone,
Copy link
Copy Markdown
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Put back types. Can't recording_datetime be None ?

Comment on lines +227 to +228
worker_id: Optional[str],
owner_timezone: Optional[str],
Copy link
Copy Markdown
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

use | None for consistency

Comment thread src/summary/summary/core/celery_worker.py Fixed
Comment thread src/summary/summary/core/celery_worker.py Fixed
Comment thread src/summary/summary/core/celery_worker.py Fixed
Comment thread src/summary/summary/core/celery_worker.py Fixed
command:
- "python"
- "multi_user_transcriber.py"
# - "metadata_collector.py" # TODO: FIX THIS
Copy link
Copy Markdown
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Didn't look into this

@cameledev cameledev marked this pull request as ready for review April 22, 2026 09:18
@sonarqubecloud
Copy link
Copy Markdown

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant