Pinokio

MOSS-TTSD

https://github.com/OpenMOSS/MOSS-TTSDupdated 2/17/2026, 10:25:40 AMindexed 2/22/2026, 5:11:49 AM

MOSS-TTSD is a spoken dialogue generation model designed for expressive multi-speaker synthesis. It features long-context modeling, flexible speaker control, and multilingual support, while enabling zero-shot voice cloning from short audio references.

Community tagsLoading...
Sort
Loading…