Skip to content

Popular repositories Loading

  1. TaDiCodec TaDiCodec Public

    This repository contains a series of works on diffusion-based speech tokenizers, including the official implementation of the paper: "TaDiCodec: Text-aware Diffusion Speech Tokenizer for Speech Lan…

    Python 76 3

  2. SpeechJudge SpeechJudge Public

    SpeechJudge: Towards Human-Level Judgment for Speech Naturalness (https://arxiv.org/abs/2511.07931)

    Python 70 4

  3. SD-Eval SD-Eval Public

    [NeurIPS 2024] SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words

    Python 56 2

  4. FlexiCodec FlexiCodec Public

    [ICLR2026] FlexiCodec: A Dynamic Neural Audio Codec for Low Frame Rates

    Python 45 5

  5. AnyAccomp AnyAccomp Public

    AnyAccomp: Generalizable accompaniment generation for vocals and solo instruments, powered by a quantized melodic bottleneck.

    Python 36 2

  6. SA-Eval SA-Eval Public

    8

Repositories

Showing 9 of 9 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…