Pinokio

@sup3rmass1ve

4 posts11 checkpointsJoined 1/25/2026, 8:54:10 PM
bitcoin, 1N942jHr6vVuR2KAe2JEf3nN59eR21tpKv X, https://x.com/SUP3RMASS1VE Github, https://github.com/SUP3RMASS1VE Discord, https://discord.gg/mvDcrA57AQ
Creations by @sup3rmass1ve
52 total
Deepseek-ai-JanusUpdated 5 months ago
https://github.com/SUP3RMASS1VE/Deepseek-ai-Janus-Pro-7B
Janus Pro 7B is a powerful multimodal AI model designed for advanced image understanding and text-to-image generation.
Maya1-TTSUpdated 5 months ago
https://github.com/SUP3RMASS1VE/Maya1-TTS-Pinokio
Generate realistic and expressive speech with natural language voice design.
Step-Audio-Edit-LOWVRAMUpdated 5 months ago
https://github.com/SUP3RMASS1VE/Step-Audio-Edit-LOWVRAM
A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics,
SongBloom-WindowsUpdated 6 months ago
https://github.com/SUP3RMASS1VE/SongBloom-Windows-Pinokio
SongBloom, a novel framework for full-length song generation
hallo2Updated 6 months ago
https://github.com/SUP3RMASS1VE/Hallo2
(WINDOWS)NVIDIA, Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
NeuTTS-AirUpdated 6 months ago
https://github.com/SUP3RMASS1VE/NeuTTS-Air-Pinokio
NeuTTS Air is the world’s first super-realistic, on-device, TTS speech language model with instant voice cloning. Built off a 0.5B
ZipVoiceUpdated 6 months ago
https://github.com/SUP3RMASS1VE/ZipVoice-Pinokio
Fast and High-Quality Zero-Shot voice clone Text-to-Speech with Flow Matching
Chatterbox-MultilingualUpdated 6 months ago
https://github.com/SUP3RMASS1VE/ChatterBox-Multilingual
Fast and High-Quality Zero-Shot voice clone Text-to-Speech with Flow Matching Multilingual
index-tts-2Updated 6 months ago
https://github.com/SUP3RMASS1VE/Index-TTS-2-Pinokio
Diffusers-Image-OutpaintingUpdated 7 months ago
https://github.com/SUP3RMASS1VE/Diffusers-Image-Outpainting
Re-Size-Image-Outpaint-appUpdated 8 months ago
https://github.com/SUP3RMASS1VE/Re-Size-Image-Outpaint
A powerful tool for extending images to different aspect ratios using Stable Diffusion XL.
DreamOUpdated 8 months ago
https://github.com/SUP3RMASS1VE/DreamO
DreamO: A Unified Framework for Image Customization
Realtime-TranscriptionUpdated 8 months ago
https://github.com/SUP3RMASS1VE/Realtime-Transcription
Real Time Speech Transcription
Ovis2-8BUpdated 9 months ago
https://github.com/SUP3RMASS1VE/Ovis2-8B-
interacting with the Ovis2-8B model. The script allows users to load the model, process image and video inputs, and generate text-based responses using a conversational chatbot.
MeggaTTSUpdated 9 months ago
https://github.com/SUP3RMASS1VE/MegaTTS-3-Pinokio
MegaTTS app
Higgs Audio TTSUpdated 9 months ago
https://github.com/SUP3RMASS1VE/Higgs-Audio-Text-to-Speech-Pinokio
Higgs Audio Text-to-Speech Playground
GPUStackUpdated 9 months ago
https://github.com/SUP3RMASS1VE/GPUStack
Simple, scalable AI model deployment on GPU clusters
Spark-TTSUpdated 9 months ago
https://github.com/SUP3RMASS1VE/Spark-TTS
Bagel-DFloat11Updated 10 months ago
https://github.com/SUP3RMASS1VE/Bagel-DFloat11
[NVIDIA ONLY] Image generation, image editing and free-form manipulation with a VLM
fish-speech-SUP3RUpdated 11 months ago
https://github.com/SUP3RMASS1VE/fish-speech-SUP3R
SOTA Open Source TTS