Store
Explore tags

One-click install & launcher for MeiGen-AI/InfiniteTalk
Higgs Audio Text-to-Speech Playground (Requires Python 3.10+)
One-click launcher for Stable Diffusion web UI (AUTOMATIC1111/stable-diffusion-webui)

Text+Image → Video with Allegro-TI2V (Rhymes AI), local one-click via Pinokio
A powerful tool for extending images to different aspect ratios using Stable Diffusion XL.
Gradio UI for YuE music generation model

Pinokio app to install and run sdbds/YuE-for-windows, tuned defaults for a single RTX 4060 Ti 16GB GPU. Uses Torch 2.5.1+cu124 and requirements-uv.txt.
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface https://github.com/comfyanonymous/ComfyUI
DreamO: A Unified Framework for Image Customization
Transform lyric transcriptions into karaoke-style MP4 videos. Built on Python-Lyric-Transcriber, this Gradio UI uses Whisper for transcription, an LLM for lyric edits, and Demucs for vocal separation. A fun tool for karaoke fans, though outputs may vary.
Dough is a open source tool for steering AI animations with precision
[NVIDIA Only] Dead simple web UI for training FLUX LoRA with LOW VRAM support (From 12GB)

Lip-sync vidéo avec Wav2Lip en CPU sur macOS (Intel)
Real Time Speech Transcription
Port of Facebook's LLaMA model in C/C++
Professional fee letter generation and email automation for CC Growth EIS Fund. Automatically generates and sends professional fee letters via Microsoft Graph API with Excel data integration.
🎬 Professional Video Dubbing Pipeline with Parakeet-TDT-0.6b-v2, Gemini AI, and Edge TTS. Complete solution for automated video dubbing with step-by-step processing and batch video creation from multiple audio files.