LongCat AudioDiT
v5.0Diffusion-based TTS with zero-shot voice cloning (1B / 3.5B). Upload a voice reference, auto-transcribe, and generate matching speech for video pickups and ADR.
Sort
Loading…
Diffusion-based TTS with zero-shot voice cloning (1B / 3.5B). Upload a voice reference, auto-transcribe, and generate matching speech for video pickups and ADR.