-
Fudan University
- Shanghai
-
02:04
(UTC -12:00) - https://gyt1145028706.github.io/
Highlights
- Pro
Pinned Loading
-
XY-Tokenizer
XY-Tokenizer PublicThis is the code for paper: XY-Tokenizer: Mitigating the Semantic-Acoustic Conflict in Low-Bitrate Speech Codecs
-
OpenMOSS/MOSS-Audio-Tokenizer
OpenMOSS/MOSS-Audio-Tokenizer PublicMOSS-Audio-Tokenizer is a Causal Transformer-based audio tokenizer built on the CAT architecture. Trained on 3M hours of diverse audio, it supports streaming and variable bitrates, delivering SOTA …
-
OpenMOSS/MOSS-TTS
OpenMOSS/MOSS-TTS PublicMOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for high‑fidelity, high‑expressiveness, and complex real‑world scenario…
-
OpenMOSS/MOSS-TTSD
OpenMOSS/MOSS-TTSD PublicMOSS-TTSD is a spoken dialogue generation model designed for expressive multi-speaker synthesis. It features long-context modeling, flexible speaker control, and multilingual support, while enablin…
-
SpeechGPT-2.0-preview
SpeechGPT-2.0-preview PublicForked from OpenMOSS/SpeechGPT-2.0-preview
GPT-4o-level, real-time spoken dialogue system.
Python
-
OpenMOSS/MOSS-TTS-Nano
OpenMOSS/MOSS-TTS-Nano PublicMOSS-TTS-Nano is an open-source multilingual tiny speech generation model from MOSI.AI and the OpenMOSS team. With only 0.1B parameters, it is designed for realtime speech generation, can run direc…
If the problem persists, check the GitHub status page or contact support.

