Sanjay Suryanarayanan
Research Associate @ AI4Bhārat, IIT Madras ( Dept. of Data Science & AI )
AI4Bhārat Lab
Indian Institute of Technology, Madras
Chennai, Tamil Nadu, India
Hello · नमस्ते · வணக்கம் · ನಮಸ್ಕಾರ !
I’m Sanjay, a Research Associate at the AI4Bhārat Lab in IIT Madras, where I am advised by Dr. Mitesh Khapra, Dr. Raj Dabre and Dr. Anoop Kunchukuttan.
My current research focuses on two major domains: (1) Machine Translation, and (2) Multilingual and Multimodal Language Modeling. Additionally, I am exploring Mechanistic Interpretability and Reinforcement Learning as complementary directions within these domains. Previously, I was a Research Intern at AI4Bhārat, where I worked on building large-scale data infrastructure to create, curate, and clean Indic Language data for training LLMs. I am fluent in English, Hindi, Tamil, and Kannada (Being a polyglot naturally complements my work in multilingual NLP) !
Beyond research, I’m a curious soul with wide-ranging interests (jack of all trades, master of some). I’m passionate about Sports, Cars, Cinema, Music, and I love studying Mathematics, Computer Science, Physics, Economics, Spirituality, Philosophy, Psychology and yes, a decillion other things!
I am actively seeking work and research opportunities where I can apply my skills to solve complex real-world problems. I am also eager to collaborate with researchers and research groups, if you share similar interests or have collaborative ideas, please feel free to reach out to me via my Email, LinkedIn or X!
news
| Dec 12, 2025 | I will be attending AACL 2025 at IIT Bombay, India, from Dec 20 to 24 and will be presenting our poster on Dec 21 from 4:00 PM to 5:30 PM at VMCC (Session MC-PP4: Multilingual NLP and Machine Translation). Feel free to drop by to chat and learn more about our work. Resources are available here. |
|---|---|
| Oct 25, 2025 | Happy to share that our work “Pralekha: Cross Lingual Document Alignment for Indic Languages” has been accepted to the main conference of AACL 2025! |
| Sep 01, 2025 | Featuring Pralekha in the English–Indic Language Document Translation Task at WAT 2025. |
| Nov 27, 2024 | Our work “Pralekha: Cross-Lingual Document Alignment for Indic Languages” is out on arXiv. |
| Oct 24, 2024 | Honoured to host Dr. Yann LeCun at AI4Bhārat, IIT Madras. His visit was truly inspiring, with insightful discussions on our open-source efforts to advance AI for India. |