Longtao Zheng 郑龙韬

Email: longtao001 [at] e.ntu.edu.sg
GitHub: https://github.com/ltzheng

Longtao Zheng is a PhD student at the College of Computing and Data Science, Nanyang Technological University (NTU) Singapore, advised by Prof. Bo An. Previously, he obtained his Bachelor's degree in computer science from University of Science and Technology of China (USTC) in 2022. Motivated by the goal of building open-ended agents in open-ended worlds, his research focuses on: (i) training agents to achieve long-term goals over extended periods of time, and (ii) training foundation world models through action-conditioned video generation.

Publications ( show selected / show all by date / show all by topic )

Topics: Agents / Video Generation (*/†: indicates equal contribution.) Past topics: Multi-Agent Reinforcement Learning

SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning

Zhenghai Xue, Longtao Zheng, Qian Liu, Yingru Li, Zejun Ma, Bo An

Preprint Blog / Code

Simple trajectory filtering stabilizes multi-turn RL and emerges diverse reasoning

Towards Efficient Online Tuning of VLM Agents via Counterfactual Soft Reinforcement Learning

Lang Feng, Weihao Tan, Zhiyi Lyu, Longtao Zheng, Haiyang Xu, Ming Yan, Fei Huang, Bo An

ICML 2025

Fine-tuning VLM agents with online RL

Cradle: Empowering Foundation Agents Towards General Computer Control

Cradle Team (Longtao Zheng as core contributor)

ICML 2025 Paper / Project Page / Code

An agent that can play AAA video games

MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation

Longtao Zheng, Yifan Zhang, Hanzhong Guo, Jiachun Pan, Zhenxiong Tan, Jiahao Lu, Chuanxin Tang, Bo An, Shuicheng Yan

Preprint Paper / Project Page / Code / Model

A SOTA and open-weight model for audio-driven talking video generation

AgentStudio: A Toolkit for Building General Virtual Agents

Longtao Zheng, Zhiyuan Huang, Zhenghai Xue, Xinrun Wang, Bo An, Shuicheng Yan

ICLR 2025 Paper / Project Page / Code / Data

A trinity of environments, tools, and benchmarks for general virtual agents

A Multimodal Foundation Agent for Financial Trading: Tool-Augmented, Diversified, and Generalist

Wentao Zhang, Lingxuan Zhao, Haochong Xia, Shuo Sun, Jiaze Sun, Molei Qin, Xinyi Li, Yuqing Zhao, Yilei Zhao, Xinyu Cai, Longtao Zheng, Xinrun Wang, Bo An

KDD 2024 Paper

The first multimodal agent for financial trading

Synapse: Trajectory-as-Exemplar Prompting with Memory for Computer Control

Longtao Zheng, Rundong Wang, Xinrun Wang, Bo An

ICLR 2024 Paper / Project Page / Code

A computer agent with state abstraction, trajectory prompting, and memory

True Knowledge Comes from Practice: Aligning Large Language Models with Embodied Environments via Reinforcement Learning

Weihao Tan, Wentao Zhang, Shanqi Liu, Longtao Zheng, Xinrun Wang, Bo An

ICLR 2024 Paper / Code

Fine-tuning LLM agents with online RL

Controlling Type Confounding in Ad Hoc Teamwork with Instance-wise Teammate Feedback Rectification

Dong Xing, Pengjie Gu, Qian Zheng, Xinrun Wang, Shanqi Liu, Longtao Zheng, Bo An, Gang Pan

ICML 2023 Paper

A causality-based solution to deal with type confounding in ad hoc teamwork

Multi-Agent Multi-Game Entity Transformer: Towards Generalist Models in MARL

Rundong Wang, Weixuan Wang, Xianhan Zeng, Liang Wang, Zhengjie Lian, Yiming Gao, Feiyu Liu, Siqin Li, Xianliang Wang, Qiang Fu, Wei Yang, Lanxiao Huang, Longtao Zheng, Zinovi Rabinovich, Bo An

DAI 2024 Best Paper Award Paper

A generalist transformer for Honor of Kings, Starcraft II, and Neural MMO

Towards Skilled Population Curriculum for Multi-Agent Reinforcement Learning

Rundong Wang, Longtao Zheng, Wei Qiu, Bowei He, Bo An, Zinovi Rabinovich, Yujing Hu, Yingfeng Chen, Tangjie Lv, Changjie Fan

Preprint Paper / Code

Autocurricula for MARL in complex sparse-reward environments like Google Football