LongCat AudioDiT
InstallableDiffusion-based TTS with zero-shot voice cloning (1B / 3.5B). Upload a voice reference, auto-transcribe, and generate matching speech for video pickups and ADR.
Community
Loading...
Diffusion-based TTS with zero-shot voice cloning (1B / 3.5B). Upload a voice reference, auto-transcribe, and generate matching speech for video pickups and ADR.