MOSS-TTS Family

MOSS‑TTS Family open‑source speech and sound generation model

This is an exact mirror of the MOSS-TTS Family project, hosted at https://github.com/OpenMOSS/MOSS-TTS. SourceForge is not affiliated with MOSS-TTS Family.

Downloads: 2 This Week

Last Update: 4 days ago

Get an email when there's a new version of MOSS-TTS Family

Windows Mac Linux BSD ChromeOS

MOSS-TTS is an open-source speech and sound generation model family built for high-fidelity, expressive, and production-oriented audio workflows. It covers long-form speech, voice cloning, multi-speaker dialogue, voice design, environmental sound effects, and real-time streaming TTS. The project is designed for complex real-world use cases where a single speech model may not be enough. Its flagship model focuses on stable long speech generation, multilingual and code-switched synthesis, pronunciation control, and zero-shot voice cloning. The broader family also includes dialogue generation, prompt-based voice creation, streaming voice-agent support, and a unified audio tokenizer. It is especially useful for developers building dubbing, podcasts, audiobooks, voice assistants, character voices, and creative audio tools.

Features

High-fidelity text-to-speech generation
Zero-shot voice cloning
Long-form speech synthesis
Multi-speaker dialogue generation
Real-time streaming TTS
Sound effect and voice design support

Project Samples

MOSS-TTS Family Screenshot 1

Project Activity

See All Activity >

{{ this.obj.activity_extras.summary }}

{{/each}}

Categories

AI Models, Text-to-Speech (TTS) Models

License

Apache License V2.0

Follow MOSS-TTS Family

MOSS-TTS Family Web Site

Other Useful Business Software

The full-stack observability platform that protects your dataLayer, tags and conversion data Icon

The full-stack observability platform that protects your dataLayer, tags and conversion data

Stop losing revenue to bad data today. and protect your marketing data with Code-Cube.io.

Code-Cube.io detects issues instantly, alerts you in real time and helps you resolve them fast. No manual QA. No unreliable data. Just data you can trust and act on.

Learn More

Rate This Project

Login To Rate This Project

User Reviews

Be the first to post a review of MOSS-TTS Family!

Additional Project Details

Programming Language

Related Categories

Python AI Models, Python Text-to-Speech (TTS) Models

Registered

2026-05-28

Similar Business Software

LM-Kit.NET

LM-Kit.NET is a complete local AI runtime for .NET that lets engineering teams ship AI-powered features without cloud dependencies, per-token costs, or data leaving the network. Most .NET AI integrations stop at inference. LM-Kit.NET covers the full range of capabilities production...

See Software
Qwen3-TTS

Qwen3-TTS is an open source series of advanced text-to-speech models developed by the Qwen team at Alibaba Cloud under the Apache-2.0 license, offering stable, expressive, and real-time speech generation with features such as voice cloning, voice design, and fine-grained control of prosody and...

See Software
Google AI Studio

Google AI Studio is a unified development platform that helps teams explore, build, and deploy applications using Google’s most advanced AI models, including Gemini 3.5. It brings text, image, audio, and video models together in one interactive playground. With vibe coding, developers can use...

See Software
Gemini Enterprise Agent Platform

Gemini Enterprise Agent Platform is a comprehensive solution from Google Cloud designed to help organizations build, scale, govern, and optimize AI agents. It represents the evolution of Vertex AI, combining advanced model development with new capabilities for agent orchestration and...

See Software
Cartesia Sonic-3

Cartesia Sonic-3 is a real-time, streaming text-to-speech (TTS) model designed to generate ultra-realistic, expressive voice output with extremely low latency, enabling AI systems to speak as fluidly as humans in live interactions. Built on advanced state space model architecture, Sonic delivers...

See Software
Piper TTS

Piper is a fast, local neural text-to-speech (TTS) system optimized for devices like the Raspberry Pi 4, designed to deliver high-quality speech synthesis without relying on cloud services. It utilizes neural network models trained with VITS and exported to ONNX Runtime, enabling efficient and...

See Software

Report inappropriate content

The full-stack observability platform that protects your dataLayer, tags and conversion data

Stop losing revenue to bad data today. and protect your marketing data with Code-Cube.io.

Code-Cube.io detects issues instantly, alerts you in real time and helps you resolve them fast. No manual QA. No unreliable data. Just data you can trust and act on.

Learn More

Recommended Projects

MOSS-TTS-Nano
MOSS-TTS-Nano is an open-source multilingual tiny speech generation
Miso TTS
Miso TTS is an 8 billion, highly emotive text-to-speech model
MetaVoice-1B
Foundational model for human-like, expressive TTS
Mocking Bird
Clone a voice in 5 seconds to generate arbitrary speech in real-time
MegaTTS 3
Official PyTorch Implementation