Sharut Gupta

I am a third year Ph.D student at MIT CSAIL advised by Prof. Phillip Isola and Prof. Stefanie Jegelka. Prior to this, I earned my Bachelor's and Master's degree (Dual) from the Indian Institute of Technology, Delhi (IIT Delhi) studying Mathematics and Computer Science. During this time, I had the privilege of being mentored by Prof. Yoshua Bengio for my thesis. I have also been fortunate to spend time at FAIR (Meta AI), Google DeepMind and Microsoft.

My research interests broadly lie in Machine Learning, specifically around multi-modal representation learning, robustness and out-of-distribution generalization. I am always happy to discuss new research directions and am open to both collaborating/advising students (not restricted to MIT). So if you're interested to work on these topics, feel free to reach out to me!

What's New

[02/2025] Excited to join MIT’s CSAIL Alliances podcast to discuss my recent work—listen here
[01/2025] Our work on Learning Disentangled Multimodal Representations got accepted at ICLR 2025!
[12/2024] Our NeurIPS'24 work, In-Context Symmetries was featured by MIT News!
[09/2024] Our work, In-Context Symmetries got accepted at NeurIPS 2024! Short version will appear as Oral at NeurIPS'24 SSL Workshop.
[09/2024] Our recent paper, Understanding the Role of Equivariance in Self-supervised Learning got accepted at NeurIPS 2024!
[08/2024] Third time's a charm! Co-brewing the ML Tea Seminar once more. Curious to learn about latest work in ML, sign up here!
[07/2024] Serving on the executive committee for the MIT Graduate Application Assistance Program (GAAP). Sign up here!
[06/2024] Thrilled to be interning at in the Gemini Team at Google DeepMind this summer with Dilip Krishnan!
[04/2024] Organizing the WiDS Cambridge Datathon 2024, happening on April 27th 2024. Sign up here if interested.
[01/2024] Our recent paper, Context is Environment got accepted at ICLR 2024!
[01/2024] Our paper, Structuring Representation Geometry with Rotationally Equivariant Contrastive Learning got accepted at ICLR 2024!
[01/2024] Our work on Removing Biases from Molecular Representations via Information Maximization got accepted at ICLR 2024!
[01/2024] Back at it! Co-brewing the ML Tea Seminar once again. If you’re ready to present your work or catch the latest ideas, sign up here!
[08/2023] Co-organizing ML Tea Seminar. Want to share your work or keen to hear latest ideas, join us by signing up here!
[06/2023] Find me in Paris, interning at Meta AI with David Lopez-Paz and Kartik Ahuja.
[03/2023] Organizing the WiDS Cambridge Datathon. Video coverage can be found here.
[08/2022] I have officialy started my PhD at MIT!

Publications

Google Scholar

Gemini at Google DeepMind
Sharut Gupta, Jarred Barber, Dilip Krishnan

U.S. Patent

Developed an alternative approach to the traditional chain-of-thought-based reasoning, enhancing the reasoning capabilities of large language models by around 18% over real-world reasoning benchmarks

In-Context Symmetries: Self-Supervised Learning through Contextual World Models
Sharut Gupta*, Chenyu Wang*, Yifei Wang*, Tommi Jaakkola, Stefanie Jegelka

NeurIPS 2024 Oral @SSL MIT News MIT Podcast

Oral Presentation (top 4) at NeurIPS'24 SSL

An Information Criterion for Controlled Disentanglement of Multimodal Data
Chenyu Wang*, Sharut Gupta*, Xinyi Zhang, Sana Tonekaboni, Stefanie Jegelka, Tommi Jaakkola, Caroline Uhler

ICLR 2025 Oral @UniReps Honorable Mention

Oral Presentation (top 4) and the Honorable Mention Award at NeurIPS'24 UniReps

Understanding the Role of Equivariance in Self-supervised Learning
Yifei Wang, Kaiwen Hu, Sharut Gupta, Ziyu Ye, Yisen Wang, Stefanie Jegelka

NeurIPS 2024

Also at ICML'24 TF2M

Context is Environment
Sharut Gupta, Stefanie Jegelka, David Lopez-Paz, Kartik Ahuja

ICLR 2024

Also at NeurIPS'23 DistShift, NeurIPS'23 R0-FoMo

Structuring Representation Geometry with Rotationally Equivariant Contrastive Learning
Sharut Gupta*, Joshua Robinson*, Derek Lim, Soledad Villar, Stefanie Jegelka

ICLR 2024

Also at ICML'23 TAG-ML, NeurIPS'23 SSL

Removing Biases from Molecular Representations via Information Maximization
Chenyu Wang, Sharut Gupta, Caroline Uhler, Tommi Jaakkola

ICLR 2024

Also at NeurIPS'23 New Frontiers of AI for Drug Discovery and Development

Near-Optimal Algorithms for Group Distributionally Robust Optimization and Beyond
Tasuku Soma, Khashayar Gatmiry, Sharut Gupta, Stefanie Jegelka
arXiv

Collaborative privacy-preserving approaches for distributed deep learning using multi-institutional data
Sharut Gupta, Sourav Kumar, Ken Chang, Charles Lu, Praveer Singh, Jayashree Kalpathy-Cramer

RSNA RadioGraphics 2023

Minimizing Client Drift in Federated Learning via Adaptive Bias Estimation
Farshid Varno, Laya Rafiee, Sharut Gupta, Stan Matwin, Mohammad Havaei

ECCV 2022

FL Games: A federated learning framework for distribution shifts
Sharut Gupta, Kartik Ahuja, Mohammad Havaei, Niladri Chatterjee, Yoshua Bengio

NeurIPS FL 2022 SPOTLIGHT

Addressing catastrophic forgetting for medical domain expansion
Sharut Gupta, Praveer Singh, Ken Chang, Liangqiong Qu et al.

NeurIPS ML4H 2022 SPOTLIGHT

More Publications

Work Experience

Student Researcher, Google DeepMind (Gemini)
Manager: Dilip Krishnan
May 2024 - Sept 2024

Research Intern, Meta AI (FAIR Labs)
Managers: David Lopez-Paz, Kartik Ahuja
May 2023 - Aug 2023

Research Intern, Mila - Quebec AI Institute
Host: Prof. Yoshua Bengio
Sept 2021 - May 2022

Software Eng. Intern, Google Research
Manager: Sriram Lakshminarasimhan
May 2021 - July 2021

Research Intern, Microsoft Research and Development
Manager: Mithun Das Gupta
May 2020 - July 2020

Research Intern, Harvard University (QTIM)
Host: Prof. Jayashree Kalpathy-Cramer
Jan 2020 - May 2021

Research Intern, INRIA
Hosts: Prof. Paul Muhlethaler, Prof. Soumya Banerjee
May 2019 - July 2019

Research Outreach and Leadership

Talks

MIT CSAIL Embodied Intelligence Seminar

2024

NeurIPS Workshop on Federated Learning: Recent Advances and New Challenges

2022

TAG-DS, The Pacific Northwest Seminar on Topology, Algebra, and Geometry in Data Science

2024

The Quantitative Translational Imaging in Medicine (QTIM)

2024

MIT Machine Learning (ML) Tea Seminar series

2024

MIT LIDS and STATS Tea Talks

2024

Research Outreach

WiDS Cambridge Datathon | co-organizer of the WiDS Cambridge Datathon as a part of the global WiDS Conference Datathon

2023-present

TILOS AI Institute | co-organizer of the TILOS Social at NeurIPS'23 and NeurIPS'24

2023-present

ML Tea, MIT | co-organizer of ML Tea, a weekly seminar series from members of the machine learning community around MIT

2023-present

The Gradient | editor for biweekly newsletter covering recent AI news and research at the Gradient substack.

2023-present

NeurIPS | recieved the volunteer award to help organize the NeurIPS conference.

2022

Teaching

Graduate Teaching Assistant, Deep Learning at MIT

Course information can be found here

2024

Instructor, Mysteries of the Hilbert's Hotel, Splash MIT

Taught a class on the Mysteries of Hilbert's Infinite Hotel ("Room" for thought!) to a class of 100 high school students. The slides for the class are available here

2023

Undergraduate Teaching Assistant, Differential Equations at IIT Delhi

Systems of differential equations, Existence and uniqueness theorems for initial value problems of semilinear and nonlinear ODEs, continuous dependence and well-posed ness; Comparison theorems of Sturms, Sturm-Liouville eigenvalue problems; Phase-plane analysis, Linear and Non-linear stability, Liapunov functions and applications;First order Partial differential equations, Method of characteristics, local and global solutions, envelop of solutions, complete and general solutions; Second order equations: Heat and Wave equation, fundamental solutions, method of eigenfunctions, Duhamel’s principle. Maximum priciples for Heat and Laplace equation,Greens functions.

2021

Undergraduate Teaching Assistant, Analysis and Design of algorithms at IIT Delhi

Models of computation: RAM and Turing Machines; Algorithm Analysis techniques; Basic techniques for designing algorithms: dynamic programming, divide-and-conquer and Greedy; DFS , BFS and their applications; Some Basic Graph Algorithms; linear time sorting algorithms; NP-Completeness and Approximation Algorithms.

2021

Undergraduate Teaching Assistant, Probability and Stochastic Processes at IIT Delhi

Axioms of probability, Probability space, Conditional probability, Independence, Bayes’ rule, Random variable, Some common discrete and continuous distributions, Distribution of Functions of Random Variable, Moments, Generating functions, Two and higher dimensional distributions, Functions of random variables, Order statistics, Conditional distributions, Covariance, Correlation coefficient, conditional expectation, Modes of convergences, Laws of large numbers, Central limit theorem, Definition of Stochastic process, Classification and properties of stochastic processes, Simple Markovian stochastic processes, Gaussian processes, Stationary processes, Discrete and continuous time Markov chains, Classification of states, Limiting distribution, Birth and death process, Poisson process, Steady state and transient distributions, Simple Markovian queuing models (M/M/1, M/M/1/N, M/M/c/N, M/M/N/N, M/M/∞).

2020

Reviewing Service

ICLR | 3 papers

2025

NeurIPS | 6 papers

2024

Machine Learning for Health (ML4H) Symposium | 4 papers

2024

Symmetry and Geometry in Neural Representations (NeurReps), NeurIPS | 3 papers

2024

Geometry-grounded Representation Learning and Generative Modeling (GRaM), ICML | 3 papers

2024

Workshop on In-Context Learning (ICL), ICML | 2 papers

2024

Theoretical Foundations of Foundation Models (TF2M), ICML | 2 papers

2024

Robustness of Few-shot and Zero-shot Learning in Large Foundation Models (R0-FoMo), NeurIPS | 2 papers

2023

Social Engagements & Leadership

Executive Team, MIT EECS Graduate Application Assistance Program (GAAP)

The Graduate Application Assistance Program (GAAP) is a student-run initiative offered by PhD students in the MIT EECS department. It pairs applicants with current student volunteers, who mentor them 1:1 through the graduate application process, meeting periodically with applicants all the way up to the deadline. More information here.

2024

Session Chair, ACM SIGKDD

Chaired the session on Responsible AI as a part of 'Data Science in India', an ACM SIGKDD India Chapter event. More information about this event is available here.

2022

Deputy General Secretary Mentorship, Board for Student Welfare (BSW), IIT Delhi

Initialised an auxiliary program to tackle crucial issues of substance abuse, intellectual plagiarism and language issues
Co-established the Office of Accessible Education (OAE) providing special assistance for the disabled community
Founded research mentorship and journal club at IIT Delhi, which is dedicated towards fostering student research

More information about the board is available here.

2020-2021

Student Representative, IIT Delhi Strategy and Vision Document 2030 Implementation Committee

The IIT Delhi Endowment Fund was launched by the Honourable President of India in 2019, backed by an initial commitment of INR 250 crore by alumni, with a stated goal of raising USD 1 billion over a period of time. Subsequently, IIT Delhi initiated the development of its vision and direction for 2030, with a focus to build and use the Endowment Fund towards achieving these goals. More details can be found here.

2020-2021

Core Team Member, Initiative for Gender Equity and Sensitisation (IGES), IIT Delhi

Initiative for Gender Equity and Sensitisation (IGES), under Indian Institute of Technology, Delhi, aims to create a safe and violence-free educational atmosphere for all, irrespective of diversities in identities of gender, sex, caste, class, ethnicity, language, race, disability and sexual orientation. IGES also advocates a zero tolerance policy against sexual harassment. More information can be found here.

2020-2021

Captain, National Baseball Championship

2016

Creative Coding

Disclaimer: I'm a novice to generative art, and I'm still finding my feet. But I think I’m learning and having fun. In case you find something cool and interesting, or are looking forward to collaborating, please feel free to get in touch with me! It would really mean a lot :)
Here are a few of my attempts at generative art using p5js.

Previous Next
Craters' On The Moon

The artwork represents the terrain of moon. Each particle on this terrain has a variable life, post which it fades off and dies. The motion of each particle is constructed using 2D Perlin Noise.
A Scenic Paradise

The artwork represents an evening at the Newport Beach a.k.a Easton's Beach in Newport, Rhode Island. The indivisible smallest unit used for constructing this resembles a pinecone geometry.
Previous Next
Tesseract

The artwork depicts the beauty of lines bent across various angles to create masterpieces of kaleidoscopes. A continous outward spiral can also be created from a bunch of straight lines.
Loss Landscape

A representation of Loss Landscape of Neural Networks. It is constructed using a Triangle element, populated across a terrain generated using 2D Perlin Noise. Color of each element is based on height of the terrain.
Previous Next
Alone in Crowd: A Silent Pandemic

The artwork depicts the power of tiling. It represents the loneliness associated with urban living owing to acute dependency on gadgets, materialistic desires and instant gratification etc.
Previous Next
Cubic Beauty

The artwork shows squares. The constructing unit of this artwork is a 2D projection of a cube across its longest diagonal. This unit is translated using a cosine function and the distance from left end of the window.
Algal Bloom

The depiction of the growth and accumulation in the population of algae. Each algal unit is represented by a filled bezier curve. The motion is generated using 2D Perlin Noise and basic shapes.
Previous Next
Dark Rooms

The artwork depicts a house full of infinitely long dark rooms. The building blocks of this work are basic squares which are translated with varying gaps as we move inside a room.
Random Walk

The artwork depicts random walk whereby a particle takes a random step generated by incrementing both x and y by a random number in -1 to 1. Neighbouring colors are chosen to ensure closeness in RGB space.

Projects

3D Rendered Ping-Pong Game

IIT Delhi - Computer Vision

Project Partner: Harkirat Singh Dhanoa

Using multiple views of chessboard, estimated the camera caliberation matrix
Rendered a 3D augmented reality object (ball and animals) over the chessboard
Used video input from webcam, two visual markers as paddles reflecting the ball off the plane using laws of reflection

October 2019 - November 2019

CovidNet: Segmenting COVID-19 abnormalities

QTIM Lab - Deep Learning

Developed a CT segmentation algorithm that estimates the extent of abnormality in chest CTs from COVID-19 patients
Achieved a dice score of 0.71 on the test set with Intra-Class Correlation and Spearman coefficient as 0.99 and 0.98

March 2020 - March 2020

My Exam Scribe

International Women’s Hackathon - Software Development

Project Partner: Sakshi Taparia

Project Presentation:

Built a mobile application on top of the Google Assistant using Dialogflow, Webhook and Firebase Cloud Database
Enabled visually impaired to write exams without the use of human scribes by reading questions and storing answers

March 2019 - May 2019

Sharut Gupta

What's New

Publications

Google Scholar

Work Experience

Research Outreach and Leadership

Talks

Research Outreach

Teaching

Reviewing Service

Social Engagements & Leadership

Creative Coding

Craters' On The Moon

A Scenic Paradise

Tesseract

Loss Landscape

Alone in Crowd: A Silent Pandemic

Cubic Beauty

Algal Bloom

Dark Rooms

Random Walk

Projects

3D Rendered Ping-Pong Game

CovidNet: Segmenting COVID-19 abnormalities

My Exam Scribe