Ayush Gupta
I am a Ph.D. student at the AIEM lab, Johns Hopkins University in the department of Computer Science. I am advised by Prof. Rama Chellappa working on problems in Computer Vision and Machine Learning.
My research has two focus points - general-purpose vision language models, where I work on multimodal LLMs on a wide variety of tasks, and on fine-grained computer vision problems, where I work on person re-identification and gait recognition.
I am supported by an IARPA grant, BRIAR.
Previously, I obtained a B.E. in Computer Science from Birla Institute of Technology and Science (BITS), Pilani. At BITS Pilani I was working under the guidance of Prof. Poonam Goyal on generating natural language descriptions of videos, and collaborating with Dr. Yogesh S Rawat from UCF on Gait Recognition.
Email  / 
CV  / 
Google Scholar  / 
Twitter  / 
Github
|
|
|
TOGA: Temporally Grounded Open-Ended Video QA with Weak Supervision
Ayush Gupta, Anirban Roy, Rama Chellappa, Nathaniel D. Bastian, Alvaro Velasquez, Susmit Jha
under submission
We use multimodal LLMs for temporal grounding of question-answer pairs in unconstrained videos.
|
|
You Can Run but not Hide: Improving Gait Recognition with Intrinsic Occlusion Type Awareness
Ayush Gupta, Rama Chellappa
WACV 2024 (Oral)
We use an auxiliary occlusion detector to solve the occlusion problem in long range gait recognition.
Project Website /
arXiv
|
|
MimicGait: A Model-Agnostic Approach for Occluded Gait Recognition using Correlational Knowledge Distillation
Ayush Gupta, Rama Chellappa
WACV 2025
We tackle the occlusion problem in gait recognition using correlational knowledge distillation.
Project Website /
arXiv
|
|
GaitContour: Efficient Gait Recognition based on a Contour-Pose Representation
Yuxiang Guo, Anshul Shah, Jiang Liu, Ayush Gupta, Cheng Peng, Rama Chellappa
WACV 2025
We develop a novel, efficient contour-based representation for gait recognition.
arXiv
|
|
Transfer Learning for Frailty Classification in Older Adults
Laura McDaniel, Ayush Gupta*, Ime Essien, Ryan Roemmich, Peter Abadir, Rama Chellappa
under submission
Using computer vision techniques to diagnose frailty among older adults.
|
|
EchoSAM: Predicting Ejection Fraction using Segmentation Guided Vision Transformers
Basudha Pal, Ayush Gupta ,Vishal Patel
Predicting the Ejection Fraction from ultrasound images of the heart, utilizing the Segment Anything Model.
|
|
Tackling Domain Shifts in Person Re-Identification: A Survey and Analysis
Vuong Nguyen, Samiha Mirza, Abdollah Zakeri, Ayush Gupta, Rahma Aloui, Khadija Khaldi, Pranav Mantini, Shishir Shah, Fatima Merchant
CVPR 2024 Continual Learning Workshop
A survey on domain shift in Person Re-ID.
Paper
|
|
GaitZero: Temporal Self-similarity for Unsupervised Gait Recognition
Ayush Gupta, Alexander Matasa, Shruti Vyas, Yogesh S Rawat
Developing a novel technique for unsupervised gait recognition using temporal self similarity.
|
|
Visually Guided Knowledge selection for Video Captioning
Ayush Gupta*, Ashrya Agrawal*, Poonam Goyal, Navneet Goyal
An approach for generating natural language captions of videos using external knowledge bases.
Paper
|
|