Research Fellow at Trinity College Dublin, Ireland
November, 2024 - Present
1. Conduct cutting-edge research and development in the field of multimodal large language models to develop tools for gesture recognition, audio-visual modalities, hyper/hypo speech etc.
2. Develop and implement state-of-the-art techniques in audio-visual speech recognition, enabling seamless integration of speech and visual inputs for robust performance across diverse environments.
3. Collaborate with cross-functional teams to design, train, and optimize large-scale multimodal language models, leveraging diverse datasets and advanced machine learning algorithms.