Wanted
1,650 projectsNon-launcher projects without a Pinokio launcher yet.
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
Perfect Green Screen Keys made EZ!
Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice cloning.
Wan2.1 for Mac.
The official JavaScript (Node) library for the ElevenLabs API.
🚀🪐🌕🌑☄️🛸 Opensource equivalent of Google's Antigravity
深度学习辅助漫画翻译工具, 支持一键机翻和简单的图像/文本编辑 | Yet another computer-aided comic/manga translation tool powered by deeplearning
Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.
Use Gemini to decide which Pexels videos to download for your project
Lets make loop video diffusion practical!
A cross-platform desktop application for running AI models from [WaveSpeedAI](https://wavespeed.ai), as well as many free local AI models including Z-Image.
Contribute to WaveSpeedAI/wavespeed-comfyui development by creating an account on GitHub.
An AI Hedge Fund Team
Local-first AI video intelligence platform. Index your video library with multi-modal analysis (YOLO, DeepFace, Whisper), search semantically via natural language, Docker-ready.
VietTTS: An Open-Source Vietnamese Text to Speech
📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
https://dl.acm.org/doi/10.1145/3576915.3623209
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
Kling AI Python SDK - Production-ready, type-safe Python client for Kling AI's cutting-edge video generation and media processing APIs. Supports async/await, Pydantic models, and comprehensive error h
