Store
Explore tags
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
[AMD ONLY] Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. (On Windows supported by 7900(XT), 7800(XT), 7600(XT), Phoenix, 9070(XT) and Strix Halo)
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics, and features robust zero-shot text-to-speech
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
ComfyUI-Manager is an extension designed to enhance the usability of ComfyUI. It offers management functions to install, remove, disable, and enable various custom nodes of ComfyUI. Furthermore, this extension provides a hub feature and convenience functions to access a wide range of information within ComfyUI.
The AI that actually does things https://openclaw.ai
WanGP v10.61 RTX 50XX Pinokio Upgrade - Python 3.11, PyTorch 2.10, CUDA 13.0, NVFP4 kernels. Copy files to wan.git folder, Reset + Install + Update
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
This application allows you to generate a cloned voice from a given text and speaker audio, and create a lip-synced video by combining the generated audio with a video or image. You need to provide...
Upload a video or image and an audio file to create a lip-synced video. Choose a checkpoint and adjust padding and resizing options to get the best results.
Free tool to create viral videos from YouTube, generating clips optimized for TikTok and Instagram with automatic transcription and 9:16 editing.
Seedance 2.0 is a revolutionary multi-modal video generation model that bridges the gap between AI and professional filmmaking. This repository provides the official Python client for interacting with the Seedance API.
Upload a reference audio and a transform audio to change the tone of the transform audio to match the reference audio. You'll receive the modified audio as a result.
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Run Qwen3-TTS text-to-speech locally on Mac (M1/M2/M3/M4). Voice cloning, voice design, custom voices. 100% offline using MLX.
