I received my Ph.D. degree from the Department of Computer Science at University of California, Los Angeles (UCLA). Before that, I obtained dual bachelor degree of Computer Science at University of Minnesota and at University of Electronic Science and Technology of China (UESTC).
My research interests lie in the intersection of statistical machine learning, natural language processing and cognition. Current research themes include:
- General Multimodal Perception: General multimodal understanding, parsing and explainable modeling.
- Natural Language Reasoning: Neural-symbolic reasoning and planning with language models.
- Generative Modeling: Statistical generative modeling (e.g. EBMs, diffusions) on high-dimensional data.
- Human Robot Cooperation: Building dialogue and planning models that are capable of interating with humans in realisitc environments.
I am always looking for self-motivated interns and long-term collaborators. Please contact me if you have excellent background or share similar research interests with me.
|2023/10||One short paper on DocumentRE is accepted to EMNLP’23.|
|2023/08||Two benchmarks are accepted to NeurIPS’23 D&B Track. Congratulations to Hengli and Jieming.|
|2023/05||Three papers are accepted to ACL’23. Congratulations to Yuxuan and Zixia.|
|2023/02||One paper is accepted to ICLR’23. Congratulations to Xiaojian and Silong.|
|2022/12||I served as an organizer on the first UM-IoS workshop at EMNLP’22. I also presented a keynote speak on Vison-Language Joint Parsing.|
: Equal contribution,
: Corresponding author
DiPlomat: A Dialogue Dataset for Situated Pragmatic Reasoning NeurIPS'23In The Thirty-Seventh Annual Conference on Neural Information Processing Systems Datasets and Benchmarks Track (NeurIPS D&B Track) , 2023
MindDial: Belief Dynamics Tracking with Theory-of-Mind Modeling for Situated Neural Dialogue Generation ICML'23In Workshop on Theory-of-Mind at Fortieth International Conference on Machine Learning (ICML) , 2023
SQA3D: Situated Question Answering in 3D Scenes ICLR'23The Tenth International Conference on Learning Representations (ICLR) , 2023
Semi-automatic Data Enhancement for Document-Level Relation Extraction with Distant Supervision from Large Language Models EMNLP'23In The Conference on Empirical Methods in Natural Language Processing (EMNLP) , 2023
VSTAR: A Video-grounded Dialogue Dataset for Situated Semantic Understanding with Scene and Topic Transitions ACL'23In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (ACL) , 2023
Modeling Instance Interactions for Joint Information Extraction with Neural High-Order Conditional Random Field ACL'23In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (ACL) , 2023
In situ bidirectional human-robot value alignment ScienceRoboticsScience Robotics , 2022
Unsupervised Vision-Language Grammar Induction with Shared Structure Modeling Oral ICLR'22The Tenth International Conference on Learning Representations (ICLR) , 2022
Patchwise Generative ConvNet: Training Energy-Based Models from a Single Natural Image for Internal Learning Oral CVPR'21In Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR) , 2021
Reasoning Visual Dialogs with Structural and Partial Observations Oral CVPR'19In Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR) , 2019
Learning Descriptor Networks for 3D Shape Synthesis and Analysis Oral CVPR'18In Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR) , 2018
- Conference reviewer for ICML 2022-2023; ICLR 2022-2023; CVPR 2019-2022; NeurIPS 2020-2022; AAAI 2020-2022; ICCV 2019-2023; ECCV 2020-2022; BMVC 2020; WACV 2021
- Journal reviewer for International Journal of Computer Vision (IJCV), Pattern Recognition (PR), Neurocomputing