@rakshitsharma18

0 posts0 checkpointsJoined 1/27/2026, 11:46:23 AM

Activity 0 Posts 0 Checkpoints 0 Apps 22 Creations 0 Following 3 Followers 0

Apps @rakshitsharma18 follows

22 total

All Apps Extensions

IndexTTS-21/27/2026, 12:32:35 PM

https://github.com/6Morpheus6/IndexTTS2v3.7updated 12/22/2025, 2:17:20 AM

Emotionally Expressive and Duration-Controlled Auto-Regressive Zero-Shot Text-to-Speech application

Invoke1/27/2026, 12:32:30 PM

https://github.com/6Morpheus6/invokev3.7updated 12/4/2025, 5:43:02 AM

The Gen AI Platform for Pro Studios https://github.com/invoke-ai/InvokeAI

InstantStyle1/27/2026, 12:32:21 PM

https://github.com/6Morpheus6/instantstylev3.7updated 11/18/2025, 7:01:35 PM

Upload the picture of an image, and generate images with that image style. Instant generation with no LoRA required https://huggingface.co/spaces/InstantX/In...

Kokoro-TTS-Multilingual.git1/27/2026, 12:32:14 PM

https://github.com/6Morpheus6/Kokoro-TTS-Multilingualv3.7updated 12/4/2025, 2:47:12 AM

Super fast Multilingual TTS supporting 54 voices across 8 languages.

IOPaint1/27/2026, 12:32:10 PM

https://github.com/6Morpheus6/iopaint-pinokiov3.7updated 6/20/2025, 10:07:07 PM

Image inpainting tool powered by SOTA AI models. Remove any unwanted object, defect, or even people from your pictures, and replace (powered by stable diffus...

Kokoro-FastAPI1/27/2026, 12:32:05 PM

https://github.com/6Morpheus6/Kokoro-FastAPIv3.7updated 11/19/2025, 11:08:53 PM

A FastAPI wrapper for KokoroTTS. Integrates with Open-WebUI and other API-driven AI applications.

SongGeneration Studio1/27/2026, 11:49:10 AM

https://github.com/BazedFrog/SongGeneration-Studiov3.7updated 1/27/2026, 8:47:53 PM

AI Song Generation with Full Style Control - Generate complete songs with lyrics, vocals, and instrumental tracks using Tencent AI Lab's SongGeneration (LeVo...

Wan2GP1/27/2026, 11:49:07 AM

https://github.com/6Morpheus6/wan2gpv3.7updated 1/25/2026, 6:29:25 PM

Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. https://git...

Wan2GP1/27/2026, 11:49:03 AM

https://github.com/pinokiofactory/wanv3.7updated 1/28/2026, 9:41:31 AM

Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. https://git...

aura-sr-upscaler1/27/2026, 11:49:00 AM

https://github.com/pinokiofactory/aura-sr-upscalerv3.7updated 1/13/2026, 4:59:11 PM

AuraSR-v2 - An open reproduction of the GigaGAN Upscaler from fal.ai https://huggingface.co/spaces/gokaygokay/AuraSR-v2

MagicQuill1/27/2026, 11:48:52 AM

https://github.com/pinokiofactory/MagicQuillv3.7updated 1/11/2026, 8:07:39 PM

An intelligent, interactive Image Editing System. Easily erase and add objects on a user-friendly interface.

Comfyui1/27/2026, 11:48:48 AM

https://github.com/pinokiofactory/comfyv3.7updated 1/14/2026, 11:37:40 AM

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface. https://github.com/comfyanonymous/ComfyUI

Whisper-WebUI1/27/2026, 11:48:41 AM

https://github.com/pinokiofactory/whisper-webuiv3.7updated 1/20/2026, 11:36:49 PM

A Web UI for easy subtitle using whisper model.

Forge1/27/2026, 11:48:36 AM

https://github.com/pinokiofactory/stable-diffusion-webui-forgev2.0updated 1/7/2026, 1:28:44 AM

[NVIDIA ONLY] The most efficient way to run FLUX (Optimized to run even on low memory machines, as low as 3GB VRAM with 512x512 resolution) https://github.co...

OpenAudio1/27/2026, 11:48:29 AM

https://github.com/pinokiofactory/openaudiov3.7updated 1/3/2026, 1:47:18 PM

Multilingual Text-to-Speech with Voice Cloning (Supports: English, Japanese, Korean, Chinese, French, German, Arabic, and Spanish) https://github.com/fishaud...

Hunyuan3D-2-LowVRAM1/27/2026, 11:48:26 AM

https://github.com/pinokiofactory/Hunyuan3d-2-lowvramv3.7updated 12/27/2025, 8:44:51 PM

Text/Image to 3D (Cross Platform: Mac + Windows + Linux): High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models. https://github.co...

e2-f5-tts1/27/2026, 11:48:20 AM

https://github.com/pinokiofactory/e2-f5-ttsv3.7updated 1/23/2026, 9:14:27 PM

F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching https://huggingface.co/spaces/mrfakename/E2-F5-TTS

VibeVoice Realtime1/27/2026, 11:48:17 AM

https://github.com/pinokiofactory/vibevoice-realtimev5.0updated 12/22/2025, 10:00:08 PM

Realtime streaming TTS demo using microsoft/VibeVoice-Realtime-0.5B

Qwen3-TTS1/27/2026, 11:48:09 AM

https://github.com/SUP3RMASS1VE/Qwen3-TTS-Pinokiov5.0updated 1/27/2026, 5:41:21 PM

Qwen3-TTS is an open-source series of TTS models developed by the Qwen team

Orpheus-TTS-FastAPI1/27/2026, 11:48:05 AM

https://github.com/pinokiofactory/Orpheus-TTS-FastAPIv3.7updated 1/24/2026, 11:02:12 PM

Orpheus TTS is an open-source text-to-speech system built on the Llama-3b backbone. Orpheus demonstrates the emergent capabilities of using LLMs for speech s...