click any column header to sort

TTS Bench — Samples — mac-default

Rig: mac-m4 — Apple M4 (10C) · Apple M4 GPU (MPS) · 16 GB RAM · Darwin 25.5.0

Label: default voice

5 prompt(s) · one section per prompt · all models ranked by warm TTFA (fastest first) within each

Each prompt section shows every model's audio output, ordered by warm TTFA (fastest first). Click any audio player to hear that model's rendering.

Prompt 1

[en]"Open the browser and read my email."

Rank	Model	Device	TTFA warm	Audio
1	Voxtral 4B TTS	mps	2.29s

Prompt 2

[en]"I'll start a new git branch, push the changes, and open a pull request when the tests pass."

Rank	Model	Device	TTFA warm	Audio
1	Voxtral 4B TTS	mps	4.58s

Prompt 3

[en]"The Parakeet TDT zero point six billion parameter model achieves one point six nine percent word error rate on LibriSpeech test-clean, beating Whisper Large V3 at two point seven percent while running at over two thousand times realtime on a single GPU."

Rank	Model	Device	TTFA warm	Audio
1	Voxtral 4B TTS	mps	14.58s

Prompt 4

[en]"Run pytest tests slash test underscore voice dot py with verbose flag and capture flag set to no."

Rank	Model	Device	TTFA warm	Audio
1	Voxtral 4B TTS	mps	6.79s

Prompt 5

[fr]"Bonjour, je m'appelle Cicero et je vais vous aider avec votre code aujourd'hui."

Rank	Model	Device	TTFA warm	Audio
1	Voxtral 4B TTS	mps	3.59s