
@rakshitsharma18
Apps @rakshitsharma18 follows
22 totalIndexTTS-21/27/2026, 12:32:35 PM
Emotionally Expressive and Duration-Controlled Auto-Regressive Zero-Shot Text-to-Speech application
Invoke1/27/2026, 12:32:30 PM
The Gen AI Platform for Pro Studios https://github.com/invoke-ai/InvokeAI
InstantStyle1/27/2026, 12:32:21 PM
Upload the picture of an image, and generate images with that image style. Instant generation with no LoRA required https://huggingface.co/spaces/InstantX/In...
Kokoro-TTS-Multilingual.git1/27/2026, 12:32:14 PM
Super fast Multilingual TTS supporting 54 voices across 8 languages.
IOPaint1/27/2026, 12:32:10 PM
Image inpainting tool powered by SOTA AI models. Remove any unwanted object, defect, or even people from your pictures, and replace (powered by stable diffus...
Kokoro-FastAPI1/27/2026, 12:32:05 PM
A FastAPI wrapper for KokoroTTS. Integrates with Open-WebUI and other API-driven AI applications.
SongGeneration Studio1/27/2026, 11:49:10 AM
AI Song Generation with Full Style Control - Generate complete songs with lyrics, vocals, and instrumental tracks using Tencent AI Lab's SongGeneration (LeVo...
Wan2GP1/27/2026, 11:49:07 AM
Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. https://git...
Wan2GP1/27/2026, 11:49:03 AM
Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. https://git...
aura-sr-upscaler1/27/2026, 11:49:00 AM
AuraSR-v2 - An open reproduction of the GigaGAN Upscaler from fal.ai https://huggingface.co/spaces/gokaygokay/AuraSR-v2
MagicQuill1/27/2026, 11:48:52 AM
An intelligent, interactive Image Editing System. Easily erase and add objects on a user-friendly interface.
Comfyui1/27/2026, 11:48:48 AM
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface. https://github.com/comfyanonymous/ComfyUI
Forge1/27/2026, 11:48:36 AM
[NVIDIA ONLY] The most efficient way to run FLUX (Optimized to run even on low memory machines, as low as 3GB VRAM with 512x512 resolution) https://github.co...
OpenAudio1/27/2026, 11:48:29 AM
Multilingual Text-to-Speech with Voice Cloning (Supports: English, Japanese, Korean, Chinese, French, German, Arabic, and Spanish) https://github.com/fishaud...
Hunyuan3D-2-LowVRAM1/27/2026, 11:48:26 AM
Text/Image to 3D (Cross Platform: Mac + Windows + Linux): High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models. https://github.co...
e2-f5-tts1/27/2026, 11:48:20 AM
F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching https://huggingface.co/spaces/mrfakename/E2-F5-TTS
VibeVoice Realtime1/27/2026, 11:48:17 AM
Realtime streaming TTS demo using microsoft/VibeVoice-Realtime-0.5B
Qwen3-TTS1/27/2026, 11:48:09 AM
Qwen3-TTS is an open-source series of TTS models developed by the Qwen team
Orpheus-TTS-FastAPI1/27/2026, 11:48:05 AM
Orpheus TTS is an open-source text-to-speech system built on the Llama-3b backbone. Orpheus demonstrates the emergent capabilities of using LLMs for speech s...