Makarand Tapaswi

Wadhwani AI, IIIT Hyderabad

Hi! I am an ML scientist at Wadhwani AI and an Assistant Professor at the Computer Vision group at IIIT Hyderabad, India.

At Wadhwani AI, we are developing AI solutions that create social impact. In particular, I am working on estimating the weight of newborns from a simple video, with the goal to empower primary healthcare workers and facilities to improve lives of at risk low-birth-weight babies.

At IIIT, I plan to continue working on challenging projects related to analyzing stories and human behavior, especially at the intersection of video and language understanding.

news [archives]

Jul 2021 One paper accepted to ICCV 2021! We propose in-domain, self-supervised pretraining using Airbnb listings to improve Vision-and-Language Navigation models. ArXiv coming soon …
Jul 2021 Launched new website based on the al-folio theme. Time to say goodbye to my old self-made Jinja+Python website and embrace Liquid+Jekyll!
Jul 2021 Excited to join IIIT Hyderabad as an Assistant Professor!
Jun 2021 Analyzing longer videos helps improve spatio-temporal action detection. Read more about it in our CVIU article in the Special Issue on Recent Advances in Modeling, Methodology and Applications of Action Recognition and Detection.
May 2021 Outstanding reviewer award for CVPR 2021.
May 2021 Visual Weighing Machine wins the Best World Changing Idea - APAC at Fast Company’s competition.
Dec 2020 Outstanding reviewer award for ACCV 2020.
Oct 2020 My first work on robotics accepted to CoRL 2020! We try to teach robots simple object manipulations by learning to translate videos into a 3D state space, Real2Sim.
Aug 2020 Outstanding reviewer award for ECCV 2020.
Feb 2020 One paper accepted to CVPR 2020! We show that joint modeling of interactions and relationships between movie characters helps improve performance of both, in a weakly supervised setting.