Ayush Gupta

I am a Ph.D. student at the AIEM lab, Johns Hopkins University in the department of Computer Science. I am advised by Prof. Rama Chellappa working on problems in Computer Vision and Machine Learning. My research has two focus points - general-purpose vision language models, where I work on multimodal LLMs on a wide variety of tasks, and on fine-grained computer vision problems, where I work on person re-identification and gait recognition. I am supported by an IARPA grant, BRIAR.

Previously, I obtained a B.E. in Computer Science from Birla Institute of Technology and Science (BITS), Pilani. At BITS Pilani I was working under the guidance of Prof. Poonam Goyal on generating natural language descriptions of videos, and collaborating with Dr. Yogesh S Rawat from UCF on Gait Recognition.

Email  /  CV  /  Google Scholar  /  Twitter  /  Github

profile photo
News

Publications
MLLMs TOGA: Temporally Grounded Open-Ended Video QA with Weak Supervision

Ayush Gupta, Anirban Roy, Rama Chellappa, Nathaniel D. Bastian, Alvaro Velasquez, Susmit Jha
under submission

We use multimodal LLMs for temporal grounding of question-answer pairs in unconstrained videos.

YouCanRunButNotHide You Can Run but not Hide: Improving Gait Recognition with Intrinsic Occlusion Type Awareness

Ayush Gupta, Rama Chellappa
WACV 2024 (Oral)

We use an auxiliary occlusion detector to solve the occlusion problem in long range gait recognition.

Project Website / arXiv
MimicGait MimicGait: A Model-Agnostic Approach for Occluded Gait Recognition using Correlational Knowledge Distillation

Ayush Gupta, Rama Chellappa
WACV 2025

We tackle the occlusion problem in gait recognition using correlational knowledge distillation.

Project Website / arXiv
GaitContour GaitContour: Efficient Gait Recognition based on a Contour-Pose Representation

Yuxiang Guo, Anshul Shah, Jiang Liu, Ayush Gupta, Cheng Peng, Rama Chellappa
WACV 2025

We develop a novel, efficient contour-based representation for gait recognition.

arXiv
Frailty Transfer Learning for Frailty Classification in Older Adults

Laura McDaniel, Ayush Gupta*, Ime Essien, Ryan Roemmich, Peter Abadir, Rama Chellappa
under submission

Using computer vision techniques to diagnose frailty among older adults.

MVA EchoSAM: Predicting Ejection Fraction using Segmentation Guided Vision Transformers

Basudha Pal, Ayush Gupta ,Vishal Patel

Predicting the Ejection Fraction from ultrasound images of the heart, utilizing the Segment Anything Model.

DomainShiftReID Tackling Domain Shifts in Person Re-Identification: A Survey and Analysis

Vuong Nguyen, Samiha Mirza, Abdollah Zakeri, Ayush Gupta, Rahma Aloui, Khadija Khaldi, Pranav Mantini, Shishir Shah, Fatima Merchant
CVPR 2024 Continual Learning Workshop

A survey on domain shift in Person Re-ID.

Paper
GaitZero GaitZero: Temporal Self-similarity for Unsupervised Gait Recognition

Ayush Gupta, Alexander Matasa, Shruti Vyas, Yogesh S Rawat

Developing a novel technique for unsupervised gait recognition using temporal self similarity.

AttentionVehicle Visually Guided Knowledge selection for Video Captioning

Ayush Gupta*, Ashrya Agrawal*, Poonam Goyal, Navneet Goyal

An approach for generating natural language captions of videos using external knowledge bases.

Paper


Template Credits : Jon Barron