Wanted
1,650 projectsNon-launcher projects without a Pinokio launcher yet.
Text2midi is the first end-to-end model for generating MIDI files from textual descriptions. By leveraging pretrained large language models and a powerful autoregressive transformer decoder, text2midi allows users to create symbolic music that aligns with detailed textual prompts, including musical attributes like chords, tempo, and style.
ComfyUI HiTem3D Integration - Generate 3D models from images using HiTem3D API
Real-time Vision Language Model interaction via webcam - WebRTC-based web interface
Convert your videos to densepose and use it on MagicAnimate
Create ๐ฅ videos with Stable Diffusion by exploring the latent space and morphing between text prompts
An autonomous agent that takes work, does work, gets paid, and gets better at it.
A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics, and features robust zero-shot text-to-speech
The most powerful local music generation model that outperforms most commercial alternatives
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
PyTorch implementation of Real-ESRGAN model
๐ Unleash AMD GPU Performance: Fix PyTorch ROCm detection for 4x AI/ML speedup on RX 6000/7000 series for Pinokio and developers / custom setups
Platform to build admin panels, internal tools, and dashboards. Integrates with 25+ databases and any API.
Wunjo CE: Face Swap, Lip Sync, Control Remove Objects & Text & Background, Restyling, Audio Separator, Clone Voice, Video Generation. Open Source, Local & Free.
A Gradio-based web application for performing image editing tasks using the FireRed-Image-Edit-1.0 model with accelerated 4-step inference. Supports single and multi-image editing through natural language prompts.
Deforum extension for AUTOMATIC1111's Stable Diffusion webui
Gradio based WebUI with a SAM (segment-anything)
[CVPR 2026] Towards Real-Time Diffusion-Based Streaming Video Super-Resolution โ An efficient one-step diffusion framework for streaming VSR with locality-constrained sparse attention and a tiny conditional decoder.
Swift client for the fal.ai model APIs
ComfyUI node for background removal, implementing InSPyreNet the best method up to date
PyTorch implementation of Real-ESRGAN model
