Store
Explore tags
MeTube
Web GUI for yt-dlp with playlist support. Download videos from YouTube and dozens of other sites.
meeting-minutes
Privacy first, AI meeting assistant with 4x faster Parakeet/Whisper live transcription, speaker diarization, and Ollama summarization built on Rust. 100% local processing. no cloud required. Meetily (Meetly Ai - https://meetily.ai) is the #1 Self-hosted, Open-source Ai meeting note taker for macOS & Windows.
VoxCPM 1.5 - NVIDIA
VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
GitHub - pinokiocomputer/gepeto
Contribute to pinokiocomputer/gepeto development by creating an account on GitHub.
InfernoSaber---BeatSaber-Automapper
Automapper for Beat Saber songs, using Python, Machine Learning and hundreds of user-created maps from the past years.
Hunyuan3D-2-LowVRAM
Text/Image to 3D (Cross Platform: Mac + Windows + Linux): High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models. https://github.com/deepbeepmeep/Hunyuan3D-2GP
Ollama Model Creator
馃 Create custom Ollama models with your own system prompts and parameters. Easy-to-use Gradio interface for building personalized AI models with temperature control and custom instructions.
GitHub - ChasonJiang/GPT-SoVITS: 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
1 min voice data can also be used to train a good TTS model! (few shot voice cloning) - ChasonJiang/GPT-SoVITS
VyvoTTS LFM2
High-quality Text-to-Speech powered by VyvoTTS LFM2 model with easy-to-use web interface
Moondream3 Gradio UI
A web interface for the Moondream3 vision-language model featuring image captioning, visual question answering, object detection, and object pointing.
