Explore tags
🔊 Generate high-quality, realistic speech with LuxTTS, a lightweight text-to-speech model for fast voice cloning and clear audio output.
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
Local-first audiobook production with cloned voices, chapter-based editing, segment regeneration, narrator + character casting, and final export in one web app. XTTS runs fully local by default. Voxtral is available as an optional cloud voice path with your own Mistral API key if you want another engine for specific voices. Voice profiles can use different engines in the same project, so you can mix narrator and character workflows without leaving Audiobook Studio. Built for iterative production: preview voices, regenerate only changed sections, queue chapter or segment work, and assemble the finished audiobook when everything is ready. Note: enabling Voxtral sends synthesis text and selected reference audio to Mistral. Learn more: https://senigami.github.io/audiobook-studio/
Check-ins3 check-ins
🎙️ Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning. High-quality text-to-speech synthesis supporting zero-shot voice cloning and streaming inference with natural emotional expression.
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
Advanced 3B parameter language model with Gradio web interface, GPU acceleration, and complete privacy
Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, with AMD GPU support via ROCm. Windows and Linux.
Open Claude Is Open-source coding-agent CLI for OpenAI, Gemini, DeepSeek, Ollama, Codex, GitHub Models, and 200+ models via OpenAI-compatible APIs.
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
ClawX is a desktop app that provides a graphical interface for OpenClaw AI agents. It turns CLI-based AI orchestration into a desktop experience without using the terminal. China website is https://clawx.com.cn.
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
AI Voice Assistant — voice conversations, animated face, canvas, music generation, and more.
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
Pinokio launcher for Comfy LTX Desktop with GGUF and INT8 support.
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
Advanced text-to-speech with voice cloning, multi-speaker support, and background music generation using Higgs Audio V2
A fork of Git containing Windows-specific patches.
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
One-click ComfyUI + Torch + Python installer by Inteliweb AI. https://github.com/Comfy-Org
Owner@maoper
Check-ins13 check-ins
Platforms
FunGen can be augmented with a Device Controller and a Streamer to connect to XBVR, Stash, local files. See the discord and ko-fi links for more :)
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
🌊 The leading agent orchestration platform for Claude. Deploy intelligent multi-agent swarms, coordinate autonomous workflows, and build conversational AI systems. Features enterprise-grade architecture, distributed swarm intelligence, RAG integration, and native Claude Code / Codex Integration
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple