Publications
Research papers, technical reports, and open-source contributions
DIS-Vector: A Framework for Voice Intelligence and Speaker Embedding Extraction
Open-source Project 2024Citations: 5
A comprehensive framework for extracting speaker embeddings, emotional features, and prosodic patterns from limited audio samples, enabling zero-shot voice conversion and few-shot speaker adaptation.
Real-time Conversational AI: Integrating Streaming ASR, LLM Reasoning, and Neural TTS
Technical Report 2024Citations: 3
Architecture and implementation of end-to-end conversational AI systems with multilingual support, context persistence, and emotion-aware response generation.
Advanced RAG Systems: Hybrid Search and Agentic Reasoning
Technical Report 2024Citations: 2
Novel approaches to retrieval-augmented generation combining dense and sparse retrieval with multi-document reasoning for enhanced accuracy.