Supertonic TTS service. #122

o-alexandre-felipe · 2025-12-08T11:24:40Z

feat: Add Supertonic TTS speech service

This PR adds a new speech service for Supertonic TTS, a fast, high-quality offline text-to-speech engine using ONNX models.

Features:

Zero configuration: Models auto-download from Hugging Face Hub on first use (~250MB)
Fully offline: After initial download, no internet connection required
Fast inference: Up to 167x faster than real-time on modern hardware
Multiple voices: 4 voice styles available (M1, M2 - male, F1, F2 - female)
Configurable quality: Adjustable denoising steps (speed vs quality tradeoff)

Installation:

    pip install "manim-voiceover[supertonic]"

Usage:

    from manim_voiceover.services.supertonic import SupertonicService

    service = SupertonicService()

    service = SupertonicService(
        voice_style="F1",
        total_step=5,
        speed=1.0,
    )

Dependencies added:

onnxruntime - ONNX model inference
soundfile - Audio file I/O
huggingface-hub - Model downloading

Testing:
Verified in clean Docker container. All dependencies install correctly and TTS generates audio successfully.

Supertonic TTS service.

e720e7b

o-alexandre-felipe requested a review from osolmaz as a code owner December 8, 2025 11:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Supertonic TTS service. #122

Supertonic TTS service. #122

Uh oh!

o-alexandre-felipe commented Dec 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Supertonic TTS service. #122

Are you sure you want to change the base?

Supertonic TTS service. #122

Uh oh!

Conversation

o-alexandre-felipe commented Dec 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant