Skip to content

Add DiffSingerThaiPhonemizer#2002

Open
printto wants to merge 1 commit intostakira:masterfrom
printto:diffs_th
Open

Add DiffSingerThaiPhonemizer#2002
printto wants to merge 1 commit intostakira:masterfrom
printto:diffs_th

Conversation

@printto
Copy link
Contributor

@printto printto commented Mar 12, 2026

New pull request from #1274 and #1435 but with clean commits and new phonemizer API. Locally tested.

  • Trained ThaiG2p on Thai Symbols, WER 0.0396 (99.96% of ~31K words were correctly guessed upon testing the G2p model, this G2p is very accurate! Any errors were only off by 1 or 2 phonemes maximum.)
  • This G2p does use slightly different hyper-parameters and a very different optimizer for training, but in my experiments the results obtained are dramatically more accurate than the default parameters. The G2p.onnx model file is roughly ~14mb. I have not noticed this to cause a slowdown in OpenUTAU at all.
  • Added the G2p to the Lyric Helper & Phonetic Assistant.
  • Included a DiffSinger G2p Phonemizer for Thai. It reads "dsdict-th.yaml" and directly supports Thai characters!

Phonemes used for this G2p and Phonemizer are PrintMov's Phoneme system, which you can see here. There is a public dataset available for training on this dataset, so there is precedent for it to be included in OpenUTAU.

See example using Printto Magicbeat's Thai DiffSinger DB image

  • Add ThaiG2p, trained on custom hparams for a very good Thai G2p (99% accurate!!)
  • Add DIFFS TH Phonemizer.

@printto printto marked this pull request as ready for review March 12, 2026 04:41
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant