Makarand Tapaswi

Hi! I am a Principal Machine Learning Scientist at Wadhwani AI, a non-profit on using AI for Social Good, and an Assistant Professor at the Computer Vision group at IIIT Hyderabad, India.
At Wadhwani AI, we are developing AI solutions that create social impact. In particular, I work on several projects in education and MNCH (maternal, newborn, and child health).
At IIIT, I continue to work on projects at the intersection of video and language understanding, especially related to analyzing stories.
news [archives]
Jan 2025 | Are LLMs good at resolving coreference between people in complicated stories? Our benchmark paper, IdentifyMe, accepted to NAACL 2025 indicates not! arXiv |
---|---|
Jan 2025 | Our paper on building generalizable models to detect pathologies in Chest X-rays is accepted to ISBI 2025! The extended paper is on arXiv. |
Dec 2024 | Our paper on studying the properties of a container simply by listening to sounds of pouring water is accepted to ICASSP 2025! Check out the project page or the extended paper on arXiv. |
Dec 2024 | Congratulations to Darshan for successfully defending his thesis and completing the MS by Research program! Through difficult times, Darshan has persevered and produced some of the best work of our group. |
Dec 2024 | Our paper on fine-grained image captioning is accepted to TMLR! Important work that reveals a lot about image captioning systems with a lot of interesting findings. Check out the project page, arXiv, or Manu’s twitter thread. |
Oct 2024 | Our paper on predicting a video’s memorability and exploration of where humans and models look while predicting memorability of a video is accepted to WACV 2025! arXiv This is our first collaboration with Dr. Vishnu Sreekumar’s group that does interesting work on memory! |
Sep 2024 | Our paper on Major Entity Identification is accepted to EMNLP 2024! arXiv. |
Sep 2024 | Happy to be serving as Area Chair for CVPR 2025! |
Jun 2024 | Visiting Seattle for CVPR. Giving a talk at the workshop on What is Next in Video Understanding?, slides here. Also visiting Allen AI and happy to talk about our lab’s work at Apple and Amazon Prime Video. |
Jun 2024 | Proud advisor moment. Haran was featured in RSIP vision’s CVPR Daily Friday edition magazine as an undergrad presenting a paper at CVPR! Congratulations Haran!! We have great students at IIIT Hyderabad and it is a pleasure to walk them through their first research experience! |