profile photo

Zhehui Huang

I am a final-year Computer Science Ph.D. student at the University of Southern California (USC), advised by Prof. Gaurav Sukhatme.

I received my master's degree in Computer Science from USC and my bachelor's degree in Computer Science from Harbin Institute of Technology.


News

[Jan 2026] LAE got accepted to ICRA 2026.
[Sept 2025] LAN2CB got accepted to MRS 2025.
[Aug 2025] Started working as a Student Researcher at Google.
[July 2025] DSBench was selected as an evaluation benchmark for OpenAI's most advanced LLM model o3 and their first agent ChatGPT Agent to evaluate their reasoning and coding abilities.
[June 2025] Successfully co-organized Resource Constrained Robotics Workshop at RSS 2025.
[Jan 2025] DSBench got accepted to ICLR 2025.
[Dec 2024] HRT-ML got accepted to WMAC @ AAAI 2025.
[Dec 2024] MonTA got accepted to LM4Plan @ AAAI 2025.
[Sept 2024] LLMs for Robot Routing got accepted to ISRR 2024.
[May 2024] Started internship at Tencent America.
[Mar 2024] Received AI Research Grant from Cohere.
[Jan 2024] Two papers got accepted to ICRA 2024. [Paper #1] and [Paper #2]
[Nov 2023] Gave a talk at USC Robotics Seminar (URoS).
[Apr 2023] QuadSwarm got accepted to ICRA 2023 Workshop: The Role of Robotics Simulators for Unmanned Aerial Vehicles.
[Mar 2023] Passed qualifying exam.
[Dec 2022] Received $70,000 AWS cloud credit for research.
[May 2022] Started internship at NVIDIA.
[Sept 2021] Decentralized Control of Quadrotor Swarms got accepted to CoRL 2021.
[Sept 2021] Received $43,000 AWS cloud credit for research.
[Aug 2021] Started Ph.D. at USC.

Research

My research aims to develop intelligent agents that can robustly and safely perform complex tasks in unstructured environments by autonomously adapting to new situations through unsupervised and continual learning. To achieve this, my research spans three interconnected areas: (1) reinforcement learning, (2) robot learning, and (3) foundation models.

Reinforcement Learning (RL):

Robot Learning: Foundation Models:

Publications

DSBench: How Far Are Data Science Agents to Becoming Data Science Experts?

TL;DR: Comprehensive benchmark for evaluating data science agents with realistic tasks, bridging the gap between simplified settings and real-world data science applications.

πŸ† Selected as evaluation benchmark for OpenAI's o3 model and ChatGPT Agent. Source: OpenAI Blog

QuadSwarm: A Modular Multi-Quadrotor Simulator for Deep Reinforcement Learning with Direct Thrust Control

TL;DR: Open-source modular simulator enabling realistic quadrotor swarm experimentation with direct thrust control for deep RL research.