Project updates

More
SUP3RMASS1VE/LiquidAI-LFM2-Audio-1.5Bv3.7updated 3mo ago
LFM2-Audio-1.5B is Liquid AI's first end-to-end audio foundation model. Designed with low latency and real time conversation in mind
@sup3rmass1ve0 check-insNVIDIAAMDApple
Arnold2006/Joy_Caption_Alpha-Two_GUIv3.7updated 3mo ago
Pinokio script for https://huggingface.co/Ole1/Joy_Caption_Batch-GUI
0 check-insNVIDIAAMDApple
TheAwaken1/LuxTTS-Studiov2.0updated 3mo ago
Gradio-based web interface for the LuxTTS voice cloning and text-to-speech model, enabling users to generate customized speech from text using uploaded or recorded audio references with adjustable parameters like speed, guidance scale, and inference steps.
@theawakenone2 check-insNVIDIAAMDApple
Alchemist-Production/alexandria-audiobookv5.0updated 3mo ago
A tool that takes a text document containing a book or a novel, ingests it with an LLM to produce an annotated script, and then uses a TTS API to generate the voice lines, finally stitching them together into an audiobook in MP3 format.
0 check-insNVIDIAAMDApple
Arnold2006/OneTrainerPinokiov3.7updated 3mo ago
OneTrainer para Pinokio vato loco
0 check-insNVIDIAAMDApple
LoneWolfVPS/IC-Light-Pinokiov1.0updated 3mo ago
Imposing Consistent Light - Control lighting of images
0 check-insNVIDIAAMDApple
matrokweb/Wan2GP.pinokioupdated 3mo ago
Fast AI Video Generation per GPU poor (Wan2.1, Hunyuan, LTV). Gradio UI su http://127.0.0.1:7860
56 check-insNVIDIAAMDApple
supersonic13/pinokio-reforgev1.2updated 3mo ago
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion WebUI (based on Gradio) to make development easier, optimize resource management, and speed up inference. https://github.com/Panchovix/stable-diffusion-webui-reForge
0 check-insNVIDIAAMDApple
6Morpheus6/lobev2.0updated 3mo ago
An open-source, modern-design ChatGPT/LLMs UI/Framework. Supports speech-synthesis, multi-modal, and extensible (function call) plugin system. https://github.com/lobehub/lobe-chat
@morpheus0 check-insNVIDIAAMDApple
hobbyquaker/DreamID-V-Pinokiov5.0updated 3mo ago
DreamID-V: Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer
0 check-insNVIDIAAMDApple
abauerenator/TRELLIS.2-Pinokio-Backupv5.0updated 3mo ago
One-click installer for Microsoft TRELLIS.2: High-quality 3D asset generation from images with PBR textures.
0 check-insNVIDIAAMDApple
Ethanyoyo0917/gemini-cli-pinokiov5.0updated 3mo ago
Google's official AI agent for your terminal. Access Gemini 2.5 Pro with 1M token context window directly from the command line.
4 check-insNVIDIAAMDApple
Deathdadev/chatterboxv3.7updated 3mo ago
@death0 check-insNVIDIAAMDApple
benewendebrandstudios-design/BBsPinokioAgentupdated 3mo ago
Mon portail IA personnel
0 check-insNVIDIAAMDApple
peanutcocktail/N8N-Pinokiov3.7updated 3mo ago
Secure Workflow Automation for Technical Teams
0 check-insNVIDIAAMDApple
SCAR6001/RELIGHT-TESTv1.0.0updated 3mo ago
Relight any image using AI (SwitchLight-inspired)
0 check-insNVIDIAAMDApple
organized-thot/pinokio-facefusionv1.6updated 3mo ago
Industry leading face manipulation platform
7 check-insNVIDIAAMDApple
manat0912/TalkingMusev3.7updated 3mo ago
MuseTalk is a cutting-edge video-to-video (V2V) lip-sync solution engineered to deliver highly accurate and natural mouth movements synchronized to audio input. Precision LipSync: Realistic and seamless synchronization of speech audio to facial movements. Efficiently designed to run on 8–12 GB VRAM,
@manatheturipa1 check-inNVIDIAAMDApple
6Morpheus6/Ollama-Studiov3.7updated 3mo ago
🦙 Let 2 models debate about a topic you pick. Create custom Ollama models with your own system prompts and parameters and use them to debate ot publish on ollama.com Easy-to-use Gradio interface for building personalized AI models with temperature control and custom instructions.
@morpheus1 check-inNVIDIAAMDApple