Wanted
1,649 projectsNon-launcher projects without a Pinokio launcher yet.
Demo of voice cloning with Vox CPM and Vietnamese CheckPoint
High-quality video and image super-resolution powered by Real-ESRGAN. Upscale your media with advanced AI.
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.
An extended improved implementation of Money printer turbo
A full-stack demo showcasing a local RAG (Retrieval Augmented Generation) pipeline to chat with your PDFs.
📨 The ultimate social media scheduling tool, with a bunch of AI 🤖

📦 The official Nextcloud installation method. Provides easy deployment and maintenance with most features included in this one Nextcloud instance.
首家工业级全流程 AI 影视生产平台。Industry-first professional AI Agent platform for controllable film & video production. From shorts to live-action with Hollywood-standard workflows.
Contribute to silvertakana/worldwideview development by creating an account on GitHub.
The best gradio web-ui for ai transcription, translation and TTS. Automatic subtitle creation using faster whisper. Easy one click installation. Fully portable.
Sound Open Firmware
Contribute to bigai-nlco/IMTalker development by creating an account on GitHub.
A multi-voice TTS system trained with an emphasis on quality
A simple aesthetic scorer + pruner + website you can run to view the results from the scoring with
Bring portraits to life!
The open source coding agent.
This ComfyUI node lets you browse the Civitai gallery directly within the interface, featuring infinite scroll, advanced filters (including NSFW), and favorites management. It also allows you to retrieve prompts, metadata, and images/videos to seamlessly reuse them in your workflows.
Contribute to andrewyng/translation-agent development by creating an account on GitHub.
[CVPR 2024 Oral] Rethinking Inductive Biases for Surface Normal Estimation