Launcher updates

More
PierrunoYT/SmolLM3-3B-Pinokiov5.0updated 8h ago
Advanced 3B parameter language model with Gradio web interface, GPU acceleration, and complete privacy
@pierrunoyt2 check-insNVIDIAAMDApple
PierrunoYT/LFM2.5-350M-Pinokiov5.0updated 8h ago
Paste long text, clean it into readable sections, summarize each section, and ask questions in-browser with WebGPU.
@pierrunoyt1 check-inNVIDIAAMDApple
PierrunoYT/Higgs-Audio-V2-Pinokiov5.0updated 9h ago
Advanced text-to-speech with voice cloning, multi-speaker support, and background music generation using Higgs Audio V2
@pierrunoyt1 check-inNVIDIAAMDApple
PierrunoYT/TranslateGemma-Pinokiov5.0updated 9h ago
🌍 TranslateGemma - Google's open-source multilingual translation AI. Translate text across 55+ languages and extract/translate text from images. Powered by Gemma 3 architecture.
@pierrunoyt1 check-inNVIDIAAMDApple
PierrunoYT/Photoroom-PRX-Pinokiov5.0updated 9h ago
Gradio web interface for Photoroom's PRX-1024-t2i-beta text-to-image model
@pierrunoyt2 check-insNVIDIAAMDApple
PierrunoYT/soprano-tts-pinokiov5.0updated 9h ago
Instant, Ultra-Realistic Text-to-Speech
@pierrunoyt1 check-inNVIDIAAMDApple
PierrunoYT/KittenTTS-Pinokiov7.0updated 9h ago
Ultra-lightweight text-to-speech (15M-80M params) — CPU optimized, 8 voices, ONNX-powered
@pierrunoyt2 check-insNVIDIAAMDApple
PierrunoYT/VoxCPM-2-Pinokiov5.0updated 10h ago
Tokenizer-free TTS for context-aware speech, voice cloning, and voice design. 2B params, 48kHz, 30 languages (Gradio UI).
@pierrunoyt2 check-insNVIDIAAMDApple
PierrunoYT/Z-Image-Pinokiov5.0updated 10h ago
⚡️ Efficient 6B parameter image generation model with sub-second inference. Generate high-quality, photorealistic images with only 8 inference steps. Features bilingual text rendering (Chinese & English) and Single-Stream Diffusion Transformer architecture.
@pierrunoyt2 check-insNVIDIAAMDApple
PierrunoYT/OmniVoice-Pinokiov5.0updated 10h ago
Zero-shot multilingual TTS (600+ languages) with voice cloning and voice design — Gradio UI (app/app.py)
@pierrunoyt1 check-inNVIDIAAMDApple
PierrunoYT/cohere-transcribe-pinokiov5.0updated 11h ago
State-of-the-art open-source speech recognition model supporting 14 languages. 2B parameter ASR model from Cohere Labs.
@pierrunoyt1 check-inNVIDIAAMDApple
PierrunoYT/MossTTS-Pinokiov5.0updated 11h ago
All-in-one Gradio UI for the MOSS-TTS Family: voice cloning, dialogue generation, voice design from text, and sound effects.
@pierrunoyt3 check-insNVIDIAAMDApple
PierrunoYT/OrpheusTTS-Pinokiov7.0updated 11h ago
Standalone Text-to-Speech using Orpheus TTS with a Gradio UI
@pierrunoyt1 check-inNVIDIAAMDApple
PierrunoYT/LuxTTS-Pinokiov7.0updated 11h ago
High-quality rapid TTS voice cloning model (150x+ realtime) — 48kHz speech, voice cloning
@pierrunoyt1 check-inNVIDIAAMDApple
PierrunoYT/Supertonic-3-Pinokiov5.0updated 12h ago
Lightning-Fast, On-Device, Multilingual TTS — Gradio, ONNX, 44.1kHz
@pierrunoyt1 check-inNVIDIAAMDApple
PierrunoYT/OmniVoice-Studio-Pinokiov7.0updated 12h ago
The open-source ElevenLabs alternative. Local voice cloning, video dubbing, and real-time dictation — 646 languages, no API keys.
@pierrunoyt1 check-inNVIDIAAMDApple
PierrunoYT/PersonaPlex-Pinokiov5.0updated 12h ago
🗣️ PersonaPlex - NVIDIA's real-time speech-to-speech conversational AI model. Natural full-duplex conversations with customizable personas and voices.
@pierrunoyt3 check-insNVIDIAAMDApple
PierrunoYT/Sana-Pinokiov5.0updated 12h ago
Fast Image Generation with Sana Diffusion Model
@pierrunoyt2 check-insNVIDIAAMDApple
PierrunoYT/pocket-tts-pinokiov5.0updated 12h ago
Lightweight CPU text-to-speech with preset voices and optional Hugging Face-authenticated voice cloning.
@pierrunoyt1 check-inNVIDIAAMDApple