Store
Explore tags
Generate engaging 1 to 5-minute short stories with LLMs and convert them to audio with Coqui TTS, supports voice cloning, built in speakers and multilingual.
plug whisper audio transcription to a local ollama server and ouput tts audio responses
From Images to High-Fidelity 3D Assets with Production-Ready PBR Material
Lets make video diffusion practical! Contribute to lllyasviel/FramePack development by creating an account on GitHub.
MagicAnimate MiniFeatured
[NVIDIA GPU Only] An optimized version of MagicAnimate https://github.com/sdbds/magic-animate-for-windows
Contribute to xh9998/DiffVSR development by creating an account on GitHub.
Pixel art animation and drawing web app powered by React - jvalen/pixel-art-react
Ovi is a veo-3 like, video+audio generation model that simultaneously generates both video and audio content from text or text+image inputs.
Pause & disable Windows updates for any duration. Remove Recall / Copilot. Debloat / privacy / telemetry tools. Supports Windows 10 and 11.
[NVIDIA ONLY] High-Quality and Efficient 3D Mesh Generation from a Single Image (Minimum requirements 12GB VRAM / 24GB RAM)
An open-source OpusClip alternative
Next-generation face-swapping and enhancement (Codeberg fork of Roop). Easy GUI for images & videos.
Image Dataset Tagger for Stable Diffusion / Lora / DreamBooth Training: https://github.com/mikeknapp/candy-machine
MAGNeTFeatured
MAGNeT is a text-to-music and text-to-sound model capable of generating high-quality audio samples conditioned on text descriptions https://github.com/facebookresearch/audiocraft/blob/main/docs/MAGNET.md
Generates Minecraft skins with a text prompt using the HuggingFace "monadical-labs/minecraft-skin-generator" model.
This is an official PyTorch implementation of our NeurIPS 2023 paper "GeoCLIP: Clip-Inspired Alignment between Locations and Images for Effective Worldwide Geo-localization"
A Fully Self-Hosted Solution for Full-Duplex Voice Interaction - FireRedTeam/FireRedChat
