My Projects
Building innovative AI solutions that push the boundaries of what's possible
DIS-Vector
Open-source voice intelligence framework for speaker embeddings, emotional features, and prosodic patterns from few seconds of audio.
PyTorchWhisperXTTS+2
Real-time Conversational AI Agent
End-to-end conversational AI with streaming ASR, LLM reasoning, and neural TTS for real-time natural conversations.
PythonFastAPIWebSockets+3
RAG-Powered Knowledge Assistant
Advanced RAG system with hybrid search (dense + sparse) and agentic reasoning for accurate document retrieval.
LangChainLlamaIndexQdrant+2
Multi-modal LLM Interface
Unified interface for text, image, and audio LLM interactions with real-time streaming.
Next.jsPyTorchStable Diffusion+2
Voice Cloning Studio
Professional voice cloning and synthesis platform supporting 20+ languages and emotions.
ReactFastAPIXTTS+2
ML Optimization Suite
Advanced optimization algorithms including GD variants with convergence analysis.
PythonNumPyMatplotlib+1