My Projects

Building innovative AI solutions that push the boundaries of what's possible

DIS-Vector

Open-source voice intelligence framework for speaker embeddings, emotional features, and prosodic patterns from few seconds of audio.

PyTorchWhisperXTTS+2

Real-time Conversational AI Agent

End-to-end conversational AI with streaming ASR, LLM reasoning, and neural TTS for real-time natural conversations.

PythonFastAPIWebSockets+3

RAG-Powered Knowledge Assistant

Advanced RAG system with hybrid search (dense + sparse) and agentic reasoning for accurate document retrieval.

LangChainLlamaIndexQdrant+2

Multi-modal LLM Interface

Unified interface for text, image, and audio LLM interactions with real-time streaming.

Next.jsPyTorchStable Diffusion+2

Voice Cloning Studio

Professional voice cloning and synthesis platform supporting 20+ languages and emotions.

ReactFastAPIXTTS+2

ML Optimization Suite

Advanced optimization algorithms including GD variants with convergence analysis.

PythonNumPyMatplotlib+1