Project updates

More
KenjieDec/RemBG-Pinokiov1.0updated 4mo ago
Pinokio WebUI for danielgatis' RemBG. RemBG is a tool to remove images background
0 check-insNVIDIAAMDApple
neviah/Fara-Pinokiov3.7updated 4mo ago
Microsoft's 7B parameter computer use agent with Gradio interface
@ramshi0 check-insNVIDIAAMDApple
SUP3RMASS1VE/MiraTTS-Pinokiov4.0updated 4mo ago
@sup3rmass1ve0 check-insNVIDIAAMDApple
Paxurux/chatterbox-old-supermasive-vrv3.7updated 4mo ago
SoTA open-source TTS
0 check-insNVIDIAAMDApple
6Morpheus6/IndexTTS2v3.7updated 4mo ago
Emotionally Expressive and Duration-Controlled Auto-Regressive Zero-Shot Text-to-Speech application
@morpheus1 check-inNVIDIAAMDApple
V-Sekai-fire/pinokio-image-to-3dv1.0.0updated 5mo ago
ComfyUI with TRELLIS2, GeometryPack, and UniRig custom nodes for image-to-3D generation
1 check-inNVIDIAAMDApple
heiredjio-beep/e2-f5-ttsv3.7updated 5mo ago
F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching https://huggingface.co/spaces/mrfakename/E2-F5-TTS
0 check-insNVIDIAAMDApple
6Morpheus6/photomaker2v3.7updated 5mo ago
Customizing Realistic Human Photos via Stacked ID Embedding https://huggingface.co/spaces/TencentARC/PhotoMaker-V2
@morpheus1 check-inNVIDIAAMDApple
gotoolkits/ClearerVoice-Studiov2.0updated 5mo ago
0 check-insNVIDIAAMDApple
linus74rn/UmoPinokiov1.0updated 5mo ago
Multi-Identity Consistency for Image Customization via Matching Reward https://github.com/bytedance/UMO
0 check-insNVIDIAAMDApple
sealad886/pinokio-resemble-enhancev2.0updated 5mo ago
AI-powered speech denoising + enhancement (Gradio web demo + CLI).
0 check-insNVIDIAAMDApple
DenisJunio/Z-Image-Fusionv3.7updated 5mo ago
Fast, high-quality image generation using comfyui via a Gradio UI
0 check-insNVIDIAAMDApple
Paxurux/Voxcpmv3.7updated 5mo ago
Voice Synthesis Platform with Smart Chunking, Batch Processing, and Voice Cloning capabilities.
0 check-insNVIDIAAMDApple
jesseloewen/Ollama_Webupdated 5mo ago
A web interface for managing and interacting with Ollama models
0 check-insNVIDIAAMDApple
JL-Bones/Ollama_Webupdated 5mo ago
A web interface for managing and interacting with Ollama models
0 check-insNVIDIAAMDApple
6Morpheus6/zonosv3.7updated 5mo ago
Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS providers. https://github.com/Zyphra/Zonos
@morpheus0 check-insNVIDIAAMDApple
6Morpheus6/mvcv3.7updated 5mo ago
Automatically create music videos. Synchronize the cuts to the music's beat.
@morpheus2 check-insNVIDIAAMDApple
pinokiofactory/ACE-Stepv3.7updated 5mo ago
A Step Towards Music Generation Foundation Model
0 check-insNVIDIAAMDApple