STTS | WikispeechWikimedia's official information about the project: www.mediawiki.org/wiki/Wikispeech
Below is a list of components to install for Wikispeech. Go to each git repo (linked below), and follow installation instructions in the README file.
| Symbolset | Symbolset/mapping server | Tested commit ID: 3d7fb68 | ||
| Pronlex | Lexicon server | Tested commit ID: 3130d31 | ||
| Matcha TTS | Adapter for Matcha TTS | Tested commit ID: 3e234e6 | ||
| Piper TTS | Adapter for Piper TTS | Tested commit ID: 3e234e6 | ||
| Wikispeech-server | The main Wikispeech-server | Tested commit ID: b9cd3f4 |
$HOME/.local/share/deep_phonemizer:
dp_single_char_swe_sv.pt
mkdir -p $HOME/.local/share/matcha_tts
cd $HOME/.local/share/matcha_tts
curl -L https://github.com/shivammehta25/Matcha-TTS-checkpoints/releases/download/v1.0/generator_v1 -o hifigan_T2_v1
curl -L https://github.com/shivammehta25/Matcha-TTS-checkpoints/releases/download/v1.0/g_02500000 -o hifigan_univ_v1
curl -L https://github.com/shivammehta25/Matcha-TTS-checkpoints/releases/download/v1.0/matcha_ljspeech.ckpt -o matcha_ljspeech.ckpt
curl -L https://github.com/shivammehta25/Matcha-TTS-checkpoints/releases/download/v1.0/matcha_vctk.ckpt -o matcha_vctk.ckpt
From STTS' Google Drive folder, download the following files to $HOME/.local/share/matcha_tts:
martin_singlechar_ipa.{ckpt,json}marianne_singlechar_ipa_20251119.{ckpt,json}
mkdir -p $HOME/.local/share/piper_tts
cd $HOME/.local/share/piper_tts
wget https://huggingface.co/rhasspy/piper-voices/resolve/v1.0.0/ar/ar_JO/kareem/medium/ar_JO-kareem-medium.onnx
wget https://huggingface.co/rhasspy/piper-voices/resolve/v1.0.0/ar/ar_JO/kareem/medium/ar_JO-kareem-medium.onnx.json
wget https://huggingface.co/rhasspy/piper-voices/resolve/v1.0.0/en/en_US/bryce/medium/en_US-bryce-medium.onnx
wget https://huggingface.co/rhasspy/piper-voices/resolve/v1.0.0/en/en_US/bryce/medium/en_US-bryce-medium.onnx.json
Alignment patching for Arabic and US English:
cd <WIKISPEECH-TTS-WRAPPERS>/piper_server
source .venv/bin/activate
uv pip install onnx
python3 -m piper.patch_voice_with_alignment $HOME/.local/share/piper_tts/en_US-bryce-medium.onnx
python3 -m piper.patch_voice_with_alignment $HOME/.local/share/piper_tts/ar_JO-kareem-medium.onnx
From STTS' Google Drive folder, download the following files to $HOME/.local/share/piper_tts:
martin_phonemes_20251017_epoch_alignments.{onnx,onnx.json}marianne_singlechar_ipa_20251119.{onnx,onnx.json}start_wikispeech.sh script for clues. Please note that the sub-servers must be up and running before the main Wikispeech server can be started.
Main API/Wikispeech server: http://localhost:10000
Sub-servers:
At the moment, there is no comprehensive documentation available on how to build a new voice or a new language for Wikispeech.
If you want to contribute to Wikispeech, please consult with the Wikimedia team who will coordinate community based improvements. You can read more about how to contribute here: www.mediawiki.org/wiki/Wikispeech. You are also welcome to contact us at stts.se/contact if you want./admin/lex_import_page, or from the command line using importLex.
Textprocessor:
{"name":"TEXTPROCESSOR_NAME", "lang":"MYLANGAUGE",
"components":[
{
"module":"adapters.marytts_adapter",
"call":"marytts_preproc",
"mapper": {
"from":"LEXICON_SYMBOLSET",
"to":"MARYTTS_SYMBOLSET"
},
},
{
"module":"adapters.lexicon_client",
"call":"lexLookup",
"lexicon":"MY_LEXICON_NAME"
}
]
}
Voice:
{
"lang":"MYLANGUAGE",
"name":"MARYTTS_VOICE_NAME",
"engine":"marytts",
"adapter":"adapters.marytts_adapter",
"mapper": {
"from":"LEXICON_SYMBOL_SET",
"to":"MARYTTS_SYMBOL_SET"
}
}
An UML diagram over the Wikispeech server: github.com/stts-se/wikispeech-server/wiki/uml/basic.png
A list of all STTS' public repositories on github: stts-se.github.io/repos
This work was supported by the Swedish Post and Telecom Authority (PTS) through the grant "Wikispeech – en användargenererad talsyntes på Wikipedia" (2016–2017), and by Swedish Inheritance Fund ("Allmänna Arvsfonden") through the grant "Wikispeech Talsyntes och taldatainsamlare" (2024–2026).
© STTS Södermalms talteknologiservice 2018-2026 | stts.se