
Huazhe Xu*, Boyuan Chen*, Yang Gao, Trevor Darrell.
Zero-shot Policy Learning with Spatial Temporal RewardDecomposition on Contingency-aware Observation
International Conference on Robot Automation (ICRA), 2021.

Mike Lambeta, Huazhe Xu, Jingwei Xu, Po-Wei Chou, Shaoxiong Wang,Trevor Darrell, and Roberto Calandra
PyTouch: A Machine Learning Library for Touch Processing
International Conference on Robot Automation (ICRA), 2021.

Jiashun Wang, Huazhe Xu, Jingwei Xu, Sifei Liu, Xiaolong Wang.
Synthesizing Long-Term 3D Human Motion and Interaction in 3D Scenes.
Conference on Computer Vision and Pattern Recognition (CVPR), 2021.

Yunfei Li, Huazhe Xu, Yilin Wu, Xiaolong Wang, Yi Wu.
Solving Compositional Reinforcement Learning Problems via Task Reduction.
International Conference on Learning Representations (ICLR), 2021.

Zhenggang Tang, Chao Yu, Boyuan Chen, Huazhe Xu, Xiaolong Wang, Fei Fang, Simon Shaolei Du, Yu Wang, Yi Wu.
Discovering Diverse Multi-Agent Strategic Behavior via Reward Randomization.
International Conference on Learning Representations (ICLR), 2021.


Tianjun Zhang, Huazhe Xu, Xiaolong Wang, Yi Wu, Kurt Keutzer, Joseph E. Gonzalez, Yuandong Tian.
BEBOLD: Exploration Beyond the Boundary of
Explored Regions
Arxiv preprints, 2020.

Ruihan Yang, Huazhe Xu, Yi Wu, Xiaolong Wang.
Multi-Task Reinforcement Learning with Soft Modularization.
Conference on Neural Information Processing Systems (NeurIPS), 2020.

Jingwei Xu*, Huazhe Xu*, Bingbing Ni, Xiaokang Yang, Xiaolong Wang, Trevor Darrell.
Hierarchical Style-based Networks for Motion Synthesis.
European Conference on Computer Vision (ECCV), 2020.

Jingwei Xu*, Huazhe Xu*, Bingbing Ni, Xiaokang Yang, Trevor Darrell.
Video Prediction via Demonstration Guidance
International Conference on Machine Learning (ICML), 2020.

Yuping Luo, Huazhe Xu, Tengyu Ma.
Learning Self-Correctable Policies and Value Functions from Demonstrations with Negative Sampling,
International Conference on Learning Representation (ICLR), 2020.

Jierui Lin*, Yifei Xing*, Huazhe Xu, Yang Gao.
Learning a Perception-Logic Network for Unsupervised Scene Conditioned Driving Behavior,
Robotics: Science and Systems IDA workshop (RSS workshop), 2020.


Hang Gao*, Huazhe Xu*, Qi-zhi Cai, Ruth Wang, Fisher Yu, Trevor Darrell.
Disentangling Propagationand Generation for Video Prediction,
International Conference on Computer Vision (ICCV), 2019.

Yang Gao*,Huazhe Xu*, Fisher Yu, Sergey Levine, Trevor Darrell.
Reinforcement Learning from Imperfect Demonstrations,
Neurips 2018 Deep RL Symposium (Neurips Symposium), 2018.

Haoran Tang*, Dennis Lee*, Jeffrey O Zhang, Huazhe Xu, Trevor Darrell, Pieter Abbeel.
Modular Architecture for StarCraft II with Deep Reinforcement Learning,
he 14th AAAI Conference on Artificial Intelligenceand Interactive Digital Entertainmen (AIIDE), 2018.
