I am Kaizhi Zheng (郑恺之), a fourth-year CSE Ph.D. candidate at the University of California, Santa Cruz, working with Prof. Xin (Eric) Wang. Previously I was an M.S’ student at the University of Michigan, Ann Arbor, working with Prof. Chad Jenkins.

Research Interests

My research interests focus on multimodal understanding, generation, and embodied AI. My research goal is to establish intelligent agents who can understand and interact with the environment. Feel free to email me if you have common interests in this area. I will be happy to talk with you!

News

  • [2024.06] I will join Adobe Research as a full-time research intern in 2024 summer.
  • [2024.02] I’m joining Microsoft as a part-time research intern now!
  • [2023.10] Our paper R2H has been accepted by EMNLP 2023!
  • [2023.06] I will join Samsung Research America (SRA) as a research intern in 2023 summer.
  • [2023.06] Our SlugJARVIS team wins the top three in the first Alexa Prize SimBot Challenge! [Media Coverage]
  • [2023.04] Our paper ESC has been accepted by ICML 2023!
  • [2023.03] Our Sage team has been selected to work on Alexa Prize Taskbot Challenge 2!
  • [2022.09] Our paper VLMbench has been accepted by NeurIPS 2022!
  • [2022.05] Our SlugJARVIS team won the Alexa Prize SimBot Public Benchmark Challenge! [Media Coverage]

Publication & Preprints

  • EditRoom: LLM-parameterized Graph Diffusion for Composable 3D Room Layout Editing
    Kaizhi Zheng, Xiaotong Chen, Xuehai He, Jing Gu, Linjie Li, Zhengyuan Yang, Kevin Lin, Jianfeng Wang, Lijuan Wang, Xin Eric Wang
    Preprint

  • MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos
    Xuehai He, Weixi Feng, Kaizhi Zheng, Yujie Lu, Wanrong Zhu, Jiachen Li, Yue Fan, Jianfeng Wang, Linjie Li, Zhengyuan Yang, Kevin Lin, William Yang Wang, Lijuan Wang, Xin Eric Wang
    Preprint

  • Toffee: Efficient Million-Scale Dataset Construction for Subject-Driven Text-to-Image Generation
    Yufan Zhou, Ruiyi Zhang, Kaizhi Zheng, Nanxuan Zhao, Jiuxiang Gu, Zichao Wang, Xin Eric Wang, Tong Sun
    Preprint

  • MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens
    Kaizhi Zheng*, Xuehai He*, Xin Eric Wang
    Preprint

  • R2H: Building Multimodal Navigation Helpers that Respond to Help
    Yue Fan, Jing Gu, Kaizhi Zheng, Xin Eric Wang
    EMNLP 2023

  • ESC: Exploration with Soft Commonsense Constraints for Zero-shot Object Navigation
    Kaiwen Zhou, Kaizhi Zheng, Connor Pryor, Yilin Shen, Hongxia Jin, Lise Getoor, Xin Eric Wang
    ICML 2023

  • JARVIS: A Neuro-Symbolic Commonsense Reasoning Framework for Conversational Embodied Agents
    Kaizhi Zheng*, Kaiwen Zhou*, Jing Gu*, Yue Fan*, Jialu Wang*, Zonglin Di, Xuehai He, Xin Eric Wang
    Preprint

  • VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation
    Kaizhi Zheng, Xiaotong Chen, Odest Chadwicke Jenkins, Xin Eric Wang
    NeurIPS 2022 (Track Datasets and Benchmarks)

  • Manipulation-Oriented Object Perception in Clutter through Affordance Coordinate Frames
    Xiaotong Chen, Kaizhi Zheng, Zhen Zeng, Shreshtha Basu, James Cooney, Jana Pavlasek, Odest Chadwicke Jenkins
    Humanoids 2022
    YouTube

  • Composable Causality in Semantic Robot Programming
    Emily Sheetz, Xiaotong Chen, Zhen Zeng, Kaizhi Zheng, Qiuyu Shi, Odest Chadwicke Jenkins
    ICRA 2022

Education

  • B.S. in EE, Huazhong University of Science and Technology, 2015-2019
  • M.S. in ECE, University of Michigan, Ann Arbor, 2019-2021
  • Ph.D. in CSE, University of California, Santa Cruz, 2021-Present