Huazhe Xu

Ph.D. Candidate [Google Scholar] [CV]
Home Publication Contact Misc

Huazhe Xu*, Boyuan Chen*, Yang Gao, Trevor Darrell.
Zero-shot Policy Learning with Spatial Temporal RewardDecomposition on Contingency-aware Observation
International Conference on Robot Automation (ICRA), 2021.

[arXiv] [code coming soon] [project page]

Mike Lambeta, Huazhe Xu, Jingwei Xu, Po-Wei Chou, Shaoxiong Wang,Trevor Darrell, and Roberto Calandra
PyTouch: A Machine Learning Library for Touch Processing
International Conference on Robot Automation (ICRA), 2021.

[arXiv coming soon] [code coming soon]

Jiashun Wang, Huazhe Xu, Jingwei Xu, Sifei Liu, Xiaolong Wang.
Synthesizing Long-Term 3D Human Motion and Interaction in 3D Scenes.
Conference on Computer Vision and Pattern Recognition (CVPR), 2021.

[arXiv] [project page]

Yunfei Li, Huazhe Xu, Yilin Wu, Xiaolong Wang, Yi Wu.
Solving Compositional Reinforcement Learning Problems via Task Reduction.
International Conference on Learning Representations (ICLR), 2021.

[openreview] [project page]

Zhenggang Tang, Chao Yu, Boyuan Chen, Huazhe Xu, Xiaolong Wang, Fei Fang, Simon Shaolei Du, Yu Wang, Yi Wu.
Discovering Diverse Multi-Agent Strategic Behavior via Reward Randomization.
International Conference on Learning Representations (ICLR), 2021.

[openreview] [project page]

Tianjun Zhang, Huazhe Xu, Xiaolong Wang, Yi Wu, Kurt Keutzer, Joseph E. Gonzalez, Yuandong Tian.
Multi-Agent Collaboration via Reward Attribution Decomposition
Arxiv preprints, 2020.

[arXiv] [Code]

Tianjun Zhang, Huazhe Xu, Xiaolong Wang, Yi Wu, Kurt Keutzer, Joseph E. Gonzalez, Yuandong Tian.
BEBOLD: Exploration Beyond the Boundary of Explored Regions
Arxiv preprints, 2020.

[arXiv] [Code coming soon]

Ruihan Yang, Huazhe Xu, Yi Wu, Xiaolong Wang.
Multi-Task Reinforcement Learning with Soft Modularization.
Conference on Neural Information Processing Systems (NeurIPS), 2020.

[pdf] [code] [project page] [Talk]

Jingwei Xu*, Huazhe Xu*, Bingbing Ni, Xiaokang Yang, Xiaolong Wang, Trevor Darrell.
Hierarchical Style-based Networks for Motion Synthesis.
European Conference on Computer Vision (ECCV), 2020.

[arXiv] [project page] [BibTeX]

Jingwei Xu*, Huazhe Xu*, Bingbing Ni, Xiaokang Yang, Trevor Darrell.
Video Prediction via Demonstration Guidance
International Conference on Machine Learning (ICML), 2020.

[arXiv] [project page] [code]

Yuping Luo, Huazhe Xu, Tengyu Ma.
Learning Self-Correctable Policies and Value Functions from Demonstrations with Negative Sampling,
International Conference on Learning Representation (ICLR), 2020.

[arXiv]

Jierui Lin*, Yifei Xing*, Huazhe Xu, Yang Gao.
Learning a Perception-Logic Network for Unsupervised Scene Conditioned Driving Behavior,
Robotics: Science and Systems IDA workshop (RSS workshop), 2020.

[manuscript] [code coming soon] [talk] [demo]

Yuping Luo*, Huazhe Xu*, Yuanzhi Li, Yuandong Tian, Tengyu Ma.
Algorithmic Framework for Model-based Reinforcement Learning with Theoretical Guarantees,
International Conference on Learning Representation (ICLR), 2019.

[arXiv] [code]

Hang Gao*, Huazhe Xu*, Qi-zhi Cai, Ruth Wang, Fisher Yu, Trevor Darrell.
Disentangling Propagationand Generation for Video Prediction,
International Conference on Computer Vision (ICCV), 2019.

[arXiv] [code coming soon]

Yang Gao*,Huazhe Xu*, Fisher Yu, Sergey Levine, Trevor Darrell.
Reinforcement Learning from Imperfect Demonstrations,
Neurips 2018 Deep RL Symposium (Neurips Symposium), 2018.

[arXiv]

Haoran Tang*, Dennis Lee*, Jeffrey O Zhang, Huazhe Xu, Trevor Darrell, Pieter Abbeel.
Modular Architecture for StarCraft II with Deep Reinforcement Learning,
he 14th AAAI Conference on Artificial Intelligenceand Interactive Digital Entertainmen (AIIDE), 2018.

[arXiv]

Huazhe Xu*, Yang Gao*, Fisher Yu, Trevor Darrell.
End-to-end Learning of Driving Models from Large-scale Video Datasets,
Conference on Computer Vision and Pattern Recognition (CVPR), 2017. (oral)

[arXiv] [code]

Ronghang Hu, Huazhe Xu, Marcus Rohrbach, Jiashi Feng, Kate Saeko, Trevor Darrell.
atural Language Object Retrieval,
Conference on Computer Vision and Pattern Recognition (CVPR), 2016. (oral)

[arXiv] [code]