Store
Explore tags
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles.
A complete Pinokio installation package for running Mistral AI's Voxtral locally with a beautiful Gradio web interface.
Modified version of Chatterbox that accepts text files as input and no character restrictions. I use it to make audiobooks, especially for my kids.
ComfyUI Chatterbox TTS & Voice Conversion Node
Advanced 3B parameter language model with Gradio web interface, GPU acceleration, and complete privacy
A complete Pinokio installation package for running FLUX.1-Krea-dev locally with a beautiful Gradio web interface for advanced text-to-image generation.
🔥 [ICCV 2025 Highlight] Official open-source repo for LVFace: Progressive Cluster Optimization for Large Vision Models in Face Recognition
Contribute to ZeusDevs666/ZeusGPT development by creating an account on GitHub.
🎬 Professional Video Dubbing Pipeline with Parakeet-TDT-0.6b-v2, Gemini AI, and Edge TTS. Complete solution for automated video dubbing with step-by-step processing and batch video creation from multiple audio files.
Compatible with all CUDA cards. Windows and linux
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
[NVIDIA ONLY] End-to-end multimodal SVG generator capable of generating complex and detailed SVGs, from simple icons to intricate anime characters. (Minimum Requirements 12GB VRAM / 32GB RAM, Recommended Requirements 24GB VRAM / 24GB RAM)
[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
SkyReels-V2: Infinite-length Film Generative model
From Images to High-Fidelity 3D Assets with Production-Ready PBR Material

Installe Ollama (si pas déjà) et ajoute DeepSeek-Coder au sein de Pinokio.
A self-organizing file system with llama 3
https://hf.co/hexgrad/Kokoro-82M

Runs inference using HuggingFace models