publication

*: Equal contribution, #: Corresponding author

2025

  1. Navi2Gaze: Leveraging Foundation Models for Navigation and Target Gazing IROS'25 Oral
    Jun Zhu, Zihao Du, Haotian Xu, Fengbo Lan, Zilong Zheng, Bo Ma, Shengjie Wang, and Tao Zhang, in IROS, 2025.
  2. In-situ Value-aligned Human-Robot Interactions with Physical Constraints IROS'25 Oral
    Hongtao Li, Ziyuan Jiao, Xiaofeng Liu#, Hangxin Liu#, and Zilong Zheng#, in IROS, 2025.
  3. Xuekai Zhu, Daixuan Cheng, Hengli Li, Kaiyan Zhang, Ermo Hua, Xingtai Lv, Ning Ding, Zhouhan Lin#, Zilong Zheng#, and Bowen Zhou#, in ICML, 2025.
  4. Xinyue Zheng*, Haowei Lin*, Kaichen He, Zihao Wang, Zilong Zheng#, and Yitao Liang#, in ICML, 2025.
  5. Tong Wu, Junzhe Shen, Zixia Jia, Yuxuan Wang, and Zilong Zheng#, in ICML, 2025.
    GitHub Repo stars
  6. Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs ICLR'25
    Zhaowei Zhang, Fengshuo Bai, Qizhi Chen, Chengdong Ma, Mingzhi Wang, Haoran Sun, Zilong Zheng#, and Yaodong Yang#, in ICLR, 2025.
  7. In-Context Editing: Learning Knowledge from Self-Induced Distributions ICLR'25
    Siyuan Qi#, Bangcheng Yang, Kailin Jiang, Xiaobo Wang, Jiaqi Li, Yifan Zhong, Yaodong Yang, and Zilong Zheng#, in ICLR, 2025.
  8. MMKE-Bench: A Multimodal Editing Benchmark for Diverse Visual Knowledge ICLR'25
    Yuntao Du*, Kailin Jiang*, Zhi Gao, Chenrui Shi, Zilong Zheng#, Siyuan Qi, and Qing Li#, in ICLR, 2025.
  9. Probing and Inducing Combinational Creativity in Vision-Language Models CogSci'25 Oral
    Yongqian Peng*, Yuxi Ma*, Mengmeng Wang, Yuxuan Wang, Yizhou Wang, Chi Zhang, Yixin Zhu#, and Zilong Zheng#, in CogSci, 2025.
  10. Yuxuan Wang, Yueqian Wang, Bo Chen, Tong Wu, Dongyan Zhao, and Zilong Zheng#, in CVPR, 2025.
  11. Look Both Ways and No Sink: Converting LLMs into Text Encoders without Training ACL'25
    Ziyong Lin*, Haoyi Wu*, Shu Wang, Kewei Tu#, Zilong Zheng#, and Zixia Jia#, in ACL, 2025.
  12. ReflectEvo: Improving Meta Introspection of Small LLMs by Learning Self-Reflection ACL'25
    Jiaqi Li, Xinyi Dong, Yang Liu, Zhizhuo Yang, Quansen Wang, Xiaobo Wang, SongChun Zhu, Zixia Jia#, and Zilong Zheng#, in ACL Findings, 2025.
  13. Are the Values of LLMs Structurally Aligned with Humans? A Causal Perspective ACL'25
    Yipeng Kang, Junqi Wang, Yexin Li, Mengmeng Wang, Wenming Tu, Quansen Wang, Hengli Li, Tingjun Wu, Xue Feng, Fangwei Zhong, and Zilong Zheng#, in ACL Findings, 2025.
  14. DiveR-CT: Diversity-enhanced Red Teaming Large Language Model Assistants with Relaxing Constraints AAAI'25 Oral
    Andrew Zhao, Quentin Xu, Matthieu Liu, Shenzhi Wang, Yong-jin Liu, Zilong Zheng#, and Gao Huang#, in AAAI, 2025.
  15. Andrew Zhao, Yiran Wu, Yang Yue, Tong Wu, Quentin Xu, Yang Yue, Matthieu Lin, Shenzhi Wang, Qingyun Wu, Zilong Zheng#, and Gao Huang#, Preprint, 2025.
    GitHub Repo stars
  16. Yang Liu, Jiaqi Li, and Zilong Zheng#, Preprint, 2025.
  17. Hengli Li, Chenxi Li, Tong Wu, Xuekai Zhu, Yuxuan Wang, Zhaoxin Yu, Eric Hanchen Jiang, Song-Chun Zhu, Zixia Jia, Ying Nian Wu#, and Zilong Zheng#, Preprint, 2025.

2024

  1. JARVIS-1: Open-world Multi-task Agents with Memory-Augmented Multimodal Language Models TPAMI'24
    Zihao Wang, Shaofei Cai, Anji Liu, Yonggang Jin, Jinbing Hou, Bowei Zhang, Haowei Lin, Zhaofeng He, Zilong Zheng, Yaodong Yang, Xiaojian Ma#, and Yitao Liang#, TPAMI, 2024.
  2. MindDial: Belief Dynamics Tracking with Theory-of-Mind Modeling for Situated Neural Dialogue Generation SIGDIAL'24 Oral
    Shuwen Qiu, Mingdian Liu, Hengli Li, Song-Chun Zhu, and Zilong Zheng#, in SIGDIAL, 2024.
  3. Tong Wu, Yanpeng Zhao, and Zilong Zheng#, in NeurIPS, 2024.
  4. Mars: Situated Inductive Reasoning in an Open-World Environment NeurIPS'24
    Xiaojuan Tang, Jiaqi Li, Yitao Liang, Muhan Zhang#, and Zilong Zheng#, in NeurIPS D&B Track, 2024.
  5. Efficient Temporal Extrapolation of Multimodal Large Language Models with Temporal Grounding Bridge for Long Video Understanding EMNLP'24
    Yuxuan Wang, Yueqian Wang, Pengfei Wu, Jianxin Liang, Dongyan Zhao, Yang Liu, and Zilong Zheng#, in EMNLP, 2024.
  6. Varying Sentence Representations via Condition-Specified Routers EMNLP'24
    Ziyong Lin, Quansen Wang, Zixia Jia#, and Zilong Zheng#, in EMNLP, 2024.
  7. ExoViP: Step-by-step Verification and Exploration with Exoskeleton Modules for Compositional Visual Reasoning CoLM'24
    Yuxuan Wang, Alan Yuille, Zhuowan Li#, and Zilong Zheng#, in CoLM, 2024.
  8. Boosting LLM Agents with Recursive Contemplation for Effective Deception Handling ACL'24
    Shenzhi Wang, Chang Liu, Zilong Zheng#, Siyuan Qi, Shuo Chen, Qisen Yang, Andrew Zhao, Shaofei Wang, Shiji Song, and Gao Huang#, in ACL Findings, 2024.
  9. LooGLE: Can Long-Context Language Models Understand Long Contexts? ACL'24
    Jiaqi Li, Mengmeng Wang, Zilong Zheng#, and Muhan Zhang#, in ACL, 2024.
  10. LangSuit⋅E: Controlling, Planning, and Interacting with Large Language Models in Embodied Text Environments ACL'24
    Zixia Jia, Mengmeng Wang, Baichen Tong, Song-Chun Zhu, and Zilong Zheng#, in ACL Findings, 2024.
  11. Combining Supervised Learning and Reinforcement Learning for Multi-Label Classification Tasks with Partial Labels ACL'24
    Zixia Jia, Junpeng Li, Shichuan Zhang, and Zilong Zheng#, in ACL, 2024.
  12. MindAgent: Emergent Gaming Interaction
    Ran Gong, Qiuyuan Huang, Xiaojian Ma, Hoi Vo, Zane Durante, Yusuke Noda, Zilong Zheng, Demetri Terzopoulos, Fei-Fei Li, and Jianfeng Gao, in NAACL Findings, 2024.

2023

  1. ProBio: A Protocol-guided Multimodal Dataset for Molecular Biology Lab NeurIPS'23
    Jieming Cui*, Ziren Gong*, Baoxiong Jia*, Siyuan Huang, Zilong Zheng#, Jianzhu Ma#, and Yixin Zhu#, in NeurIPS D&B Track, 2023.
  2. DiPlomat: A Dialogue Dataset for Situated Pragmatic Reasoning NeurIPS'23
    Hengli Li, Song-Chun Zhu, and Zilong Zheng#, in NeurIPS D&B Track, 2023.
  3. SQA3D: Situated Question Answering in 3D Scenes ICLR'23
    Xiaojian Ma*, Silong Yong*, Zilong Zheng#, Qing Li, Yitao Liang, Song-Chun Zhu, and Siyuan Huang#, in ICLR, 2023.
  4. Semi-automatic Data Enhancement for Document-Level Relation Extraction with Distant Supervision from Large Language Models EMNLP'23
    Junpeng Li*, Zixia Jia*, and Zilong Zheng#, in EMNLP, 2023.
  5. VSTAR: A Video-grounded Dialogue Dataset for Situated Semantic Understanding with Scene and Topic Transitions ACL'23
    Yuxuan Wang, Zilong Zheng#, Xueliang Zhao, Jinpeng Li, Yueqian Wang, and Dongyan Zhao#, in ACL, 2023.
  6. Modeling Instance Interactions for Joint Information Extraction with Neural High-Order Conditional Random Field ACL'23
    Zixia Jia, Zhaohui Yan, Wenjuan Han, Zilong Zheng#, and Kewei Tu#, in ACL, 2023.
  7. Shuō Wén Jiě Zì: Rethinking Dictionaries and Glyphs for Chinese Language Pre-training ACL'23
    Yuxuan Wang, Jianghui Wang, Dongyan Zhao#, and Zilong Zheng#, in ACL Findings, 2023.

2022

  1. SHARP: Search-Based Adversarial Attack for Structured Prediction NAACL'22
    Liwen Zhang, Zixia Jia, Wenjuan Han, Zilong Zheng, and Kewei Tu#, in NAACL Findings, 2022.
  2. VGStore: A Multimodal Extension to SPARQL for Querying RDF Scene Graph ISWC'22
    Yanzeng Li, Zilong Zheng, Wenjuan Han, and Lei Zou, in ISWC Poster & Demo Track, 2022.
  3. Unsupervised Vision-Language Grammar Induction with Shared Structure Modeling ICLR'22 Oral
    Bo Wan, Wenjuan Han, Zilong Zheng, and Tinne Tuytelaars, ICLR, 2022.
  4. Unsupervised Vision-Language Parsing: Seamlessly Bridging Visual Scene Graphs with Language Structures via Dependency Relationships CVPR'22
    Chao Lou, Wenjuan Han#, Yuhuan Lin, and Zilong Zheng#, in CVPR, 2022.
  5. Energy-Based Generative Cooperative Saliency Prediction AAAI'22 Oral
    Jing Zhang, Jianwen Xie, Zilong Zheng, and Nick Barnes, AAAI, 2022.

2021

  1. Cooperative Training of Fast Thinking Initializer and Slow Thinking Solver for Multi-Modal Conditional Learning TPAMI
    Jianwen Xie*, Zilong Zheng*, Xiaolin Fang, Song-Chun Zhu, and Ying Nian Wu, IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021.
  2. Patchwise Generative ConvNet: Training Energy-Based Models from a Single Natural Image for Internal Learning CVPR'21 Oral
    Zilong Zheng, Jianwen Xie, and Ping Li, in CVPR, 2021.
  3. Learning Triadic Belief Dynamics in Nonverbal Communication from Videos CVPR'21 Oral
    Lifeng Fan, Shuwen Qiu, Zilong Zheng, Tao Gao, Song-Chun Zhu, and Yixin Zhu, in CVPR, 2021.
  4. Generative PointNet: Deep Energy-Based Learning on Unordered Point Sets for 3D Generation, Reconstruction and Classification CVPR'21
    Jianwen Xie, Yifei Xu, Zilong Zheng, Song-Chun Zhu, and Ying Nian Wu, in CVPR, 2021.
  5. GRICE: A Grammar-based Dataset for Recovering Implicature and Conversational rEasoning ACL'21
    Zilong Zheng, Shuwen Qiu, Lifeng Fan, Yixin Zhu, and Song-Chun Zhu, in ACL Findings, 2021.
  6. Learning Energy-Based Model with Variational Auto-Encoder as Amortized Sampler AAAI'21
    Jianwen Xie, Zilong Zheng, and Ping Li, in AAAI, 2021.
  7. Learning Cycle-Consistent Cooperative Networks via Alternating MCMC Teaching for Unsupervised Cross-Domain Translation AAAI'21
    Jianwen Xie*, Zilong Zheng*, Xiaolin Fang, Song-Chun Zhu, and Ying Nian Wu, AAAI, 2021.

2020

  1. Generative VoxelNet: Learning Energy-Based Models for 3D Shape Synthesis and Analysis TPAMI
    Jianwen Xie*, Zilong Zheng*, Ruiqi Gao, Wenguan Wang, Song-Chun Zhu, and Ying Nian Wu, in IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020.
  2. Joint Inference of States, Robot Knowledge, and Human (False-)Beliefs ICRA'20
    Tao Yuan, Hangxin Liu, Lifeng Fan, Zilong Zheng, Tao Gao, Yixin Zhu, and Song-Chun Zhu, in ICRA, 2020.
  3. Motion-Based Generator Model: Unsupervised Disentanglement of Appearance, Trackable and Intrackable Motions in Dynamic Patterns AAAI'20 Oral
    Jianwen Xie*, Ruiqi Gao*, Zilong Zheng, Song-Chun Zhu, and Ying Nian Wu, in AAAI, 2020.

2019

  1. Reasoning Visual Dialogs with Structural and Partial Observations CVPR'19 Oral
    Zilong Zheng*, Wenguan Wang*, Siyuan Qi*, and Song-Chun Zhu, in CVPR, 2019.
  2. Learning Dynamic Generator Model by Alternating Back-Propagation Through Time AAAI'19 Spotlight
    Jianwen Xie*, Ruiqi Gao*, Zilong Zheng, Song-Chun Zhu, and Ying Nian Wu, in AAAI, 2019.

2018

  1. Learning Descriptor Networks for 3D Shape Synthesis and Analysis CVPR'18 Oral
    Jianwen Xie*, Zilong Zheng*, Ruiqi Gao, Wenguan Wang, Song-Chun Zhu, and Ying Nian Wu, in CVPR, 2018.