Pinokio

@sup3rmass1ve

3 posts4 checkpointsJoined 1/25/2026, 8:54:10 PM
Creations by @sup3rmass1ve
52 total
Deepseek-ai-JanusUpdated 4 months ago
Janus Pro 7B is a powerful multimodal AI model designed for advanced image understanding and text-to-image generation.
Maya1-TTSUpdated 4 months ago
Generate realistic and expressive speech with natural language voice design.
Step-Audio-Edit-LOWVRAMUpdated 4 months ago
A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics,
SongBloom-WindowsUpdated 4 months ago
SongBloom, a novel framework for full-length song generation
hallo2Updated 4 months ago
(WINDOWS)NVIDIA, Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
NeuTTS-AirUpdated 5 months ago
NeuTTS Air is the world’s first super-realistic, on-device, TTS speech language model with instant voice cloning. Built off a 0.5B
ZipVoiceUpdated 5 months ago
Fast and High-Quality Zero-Shot voice clone Text-to-Speech with Flow Matching
Chatterbox-MultilingualUpdated 5 months ago
Fast and High-Quality Zero-Shot voice clone Text-to-Speech with Flow Matching Multilingual
Re-Size-Image-Outpaint-appUpdated 6 months ago
A powerful tool for extending images to different aspect ratios using Stable Diffusion XL.
DreamOUpdated 6 months ago
DreamO: A Unified Framework for Image Customization
Ovis2-8BUpdated 7 months ago
interacting with the Ovis2-8B model. The script allows users to load the model, process image and video inputs, and generate text-based responses using a conversational chatbot.
Higgs Audio TTSUpdated 8 months ago
Higgs Audio Text-to-Speech Playground
GPUStackUpdated 8 months ago
Simple, scalable AI model deployment on GPU clusters
Bagel-DFloat11Updated 9 months ago
[NVIDIA ONLY] Image generation, image editing and free-form manipulation with a VLM