gyt1145028706

Yitian Gong gyt1145028706

My name is Yitian Gong. I am a master student majoring in Computer Science at Fudan University in China.

Achievements

XY-Tokenizer XY-Tokenizer Public

This is the code for paper: XY-Tokenizer: Mitigating the Semantic-Acoustic Conflict in Low-Bitrate Speech Codecs

Python 92 5
OpenMOSS/MOSS-Audio-Tokenizer OpenMOSS/MOSS-Audio-Tokenizer Public

MOSS-Audio-Tokenizer is a Causal Transformer-based audio tokenizer built on the CAT architecture. Trained on 3M hours of diverse audio, it supports streaming and variable bitrates, delivering SOTA …

Python 198 13
OpenMOSS/MOSS-TTS OpenMOSS/MOSS-TTS Public

MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for high‑fidelity, high‑expressiveness, and complex real‑world scenario…

Python 1.7k 159
OpenMOSS/MOSS-TTSD OpenMOSS/MOSS-TTSD Public

MOSS-TTSD is a spoken dialogue generation model designed for expressive multi-speaker synthesis. It features long-context modeling, flexible speaker control, and multilingual support, while enablin…

Python 1.3k 126
SpeechGPT-2.0-preview SpeechGPT-2.0-preview Public

Forked from OpenMOSS/SpeechGPT-2.0-preview

GPT-4o-level, real-time spoken dialogue system.

Python
OpenMOSS/MOSS-TTS-Nano OpenMOSS/MOSS-TTS-Nano Public

MOSS-TTS-Nano is an open-source multilingual tiny speech generation model from MOSI.AI and the OpenMOSS team. With only 0.1B parameters, it is designed for realtime speech generation, can run direc…

Python 2.3k 311