@sup3rmass1ve

bitcoin, 1N942jHr6vVuR2KAe2JEf3nN59eR21tpKv X, https://x.com/SUP3RMASS1VE Github, https://github.com/SUP3RMASS1VE Discord, https://discord.gg/mvDcrA57AQ

@sup3rmass1ve

Creations by @sup3rmass1ve

59 total

Vibevoice Realtime Pinokio

Janus Pro 7B is a powerful multimodal AI model designed for advanced image understanding and text-to-image generation.

Maya1-TTS

Updated 8mo ago

Generate realistic and expressive speech with natural language voice design.

Step-Audio-Edit-LOWVRAM

Updated 8mo ago

A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics,

SongBloom-Windows

Updated 8mo ago

SongBloom, a novel framework for full-length song generation

hallo2

Updated 8mo ago

(WINDOWS)NVIDIA, Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation

NeuTTS-Air

Updated 9mo ago

NeuTTS Air is the world’s first super-realistic, on-device, TTS speech language model with instant voice cloning. Built off a 0.5B

ZipVoice

Updated 9mo ago

Fast and High-Quality Zero-Shot voice clone Text-to-Speech with Flow Matching

Chatterbox-Multilingual

Updated 9mo ago

Fast and High-Quality Zero-Shot voice clone Text-to-Speech with Flow Matching Multilingual

index-tts-2

Updated 9mo ago

Diffusers-Image-Outpainting

Updated 9mo ago

chatterbox-SUP3R

Updated 10mo ago

SoTA open-source TTS

Re-Size-Image-Outpaint-app

Updated 10mo ago

A powerful tool for extending images to different aspect ratios using Stable Diffusion XL.

DreamO

Updated 10mo ago

DreamO: A Unified Framework for Image Customization

Realtime-Transcription

Updated 11mo ago

Real Time Speech Transcription

Ovis2-8B

Updated 11mo ago

interacting with the Ovis2-8B model. The script allows users to load the model, process image and video inputs, and generate text-based responses using a conversational chatbot.

MegaTTS app

Higgs Audio Text-to-Speech Playground

GPUStack

Updated 1y ago

Simple, scalable AI model deployment on GPU clusters