Lei Han (韩磊)

Principal Research Scientist

Tencent Robotics X
Tencent, Shenzhen, China

Office: Binhai Tower, Shenzhen, China.
Tencent Email: lxhan at tencent dot com
Gmail: leihan dot cs at gmail dot com

Previous email addresses: lei.han@msstate.edu; lhan@stat.rutgers.edu; leihan@comp.hkbu.edu.hk; hanlei@cis.pku.edu.cn


I am a principal research scientist at Tencent. I am directing the Center of Embodied Intelligence at Tencent Robotics X. I was an assistant research professor at the Department of Basic Science, Mississippi State University, USA. I received my Ph.D from Peking University (advised by Professor Kunqing Xie) and spent two years in Hong Kong Baptist University (advised by Professor Yu Zhang) and Rutgers University (advised by Professor Tong Zhang) as postdoctoral researcher. I was the winner of the Best Dissertation Award of Chinese Association for Artificial Intelligence (CAAI) (中国人工智能学会优博). My research interests mainly focus on machine learning, artifical intelligence, and applications in robotics and game.

Selected Publications

[Google Scholar]

Long-Term Projects

  • [4] Lei Han*, Qingxu Zhu*, Jiapeng Sheng*, Chong Zhang*, Tingguang Li*, Yizheng Zhang*, He Zhang*, Yuzhen Liu, Cheng Zhou, Rui Zhao, Jie Li, Yufeng Zhang, Rui Wang, Wanchao Chi, Xiong Li, Yonghui Zhu, Lingzhu Xiang, Xiao Teng, Zhengyou Zhang. (* Equal contribution)
    Lifelike Agility and Play in Quadrupedal Robots using Reinforcement Learning and Generative Pre-trained Models. [project page][nature][arxiv][code & data]
    In: Nature Machine Intelligence, Vol. 6, No. 7, 2024. (Cover Article)

  • [3] Lei Han, Jiechao Xiong, Peng Sun, Xinghai Sun, Meng Fang, Qingwei Guo, Qiaobo Chen, Tengfei Shi, Zhengyou Zhang. (* Equal contribution. Correspondence to the first three authors.)
    TStarBot-X: An Open-Sourced and Comprehensive Study for Efficient League Training in StarCraft II Full Game. [arxiv]
    arXiv preprint arXiv:2011.13729, 2020.
  • [2] Peng Sun, Jiechao Xiong, Lei Han, Xinghai Sun, Shuxing Li, Jiawei Xu, Meng Fang, Zhengyou Zhang. (* Equal contribution. Correspondence to the first three authors.)
    TLeague: A Framework for Competitive Self-Play based Distributed Multi-Agent Reinforcement Learning. [arxiv]
    arXiv preprint arXiv:2011.12895, 2020.
  • [1] Peng Sun*, Xinghai Sun*, Lei Han*, Jiechao Xiong*, Qing Wang, Bo Li, Yang Zheng, Ji Liu, Yongsheng Liu, Han Liu, Tong Zhang. (*Equal contribution)
    TStarBots: Defeating the Cheating Level Builtin AI in StarCraft II in the Full Game. [arxiv]
    arXiv:1809.07193 [cs.AI], 2018.

2024

  • [61] Xinyu Xu, Yizheng Zhang, Yong-Lu Li, Lei Han, Cewu Lu.
    HumanVLA: Towards Vision-Language Directed Object Rearrangement by Physical Humanoid. [arxiv]
    In: The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS), 2024.
  • [60] Ye Tian, Baolin Peng, Linfeng Song, Lifeng Jin, Dian Yu, Lei Han, Haitao Mi, Dong Yu.
    Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing. [arxiv]
    In: The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS), 2024.
  • [59] Pengyu Cheng, Tianhao Hu, Han Xu, Zhisong Zhang, Yong Dai, Lei Han, Nan Du, Xiaolong Li.
    Self-playing Adversarial Language Game Enhances LLM Reasoning. [arxiv]
    In: The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS), 2024.
  • [58] Lei Han*, Qingxu Zhu*, Jiapeng Sheng*, Chong Zhang*, Tingguang Li*, Yizheng Zhang*, He Zhang*, Yuzhen Liu, Cheng Zhou, Rui Zhao, Jie Li, Yufeng Zhang, Rui Wang, Wanchao Chi, Xiong Li, Yonghui Zhu, Lingzhu Xiang, Xiao Teng, Zhengyou Zhang. (* Equal contribution)
    Lifelike Agility and Play in Quadrupedal Robots using Reinforcement Learning and Generative Pre-trained Models. [project page][nature][arxiv][code & data]
    In: Nature Machine Intelligence, Vol. 6, No. 7, 2024.
  • [57] Yingru Li, Jiawei Xu, Lei Han, Zhiquan Luo.
    Q-Star Meets Scalable Posterior Sampling: Bridging Theory and Practice via HyperAgent. [arxiv]
    In: International Conference on Machine Learning (ICML), 2024.
  • [56] Chong Zhang, Jiapeng Sheng, Tingguang Li, He Zhang, Cheng Zhou, Qingxu Zhu, Rui Zhao, Yizheng Zhang, Lei Han.
    Learning Highly Dynamic Behaviors for Quadrupedal Robots. [arxiv]
    In: IEEE International Conference on Robotics and Automation (ICRA), 2024.
  • [55] Haojie Shi, Tingguang Li, Qingxu Zhu, Jiapeng Sheng, Lei Han, Max Q.-H. Meng.
    An Efficient Model based Approach on Learning Agile Motor Skills without Reinforcement. [arxiv]
    In: IEEE International Conference on Robotics and Automation (ICRA), 2024.
  • [54] Jiyuan Shi, Chenjia Bai, Haoran He, Lei Han, Dong Wang, Bin Zhao, Mingguo Zhao, Xiu Li, Xuelong Li.
    Robust Quadrupedal Locomotion via Risk-Averse Policy Learning. [arxiv]
    In: IEEE International Conference on Robotics and Automation (ICRA), 2024.
  • [53] Rui Yang, Han Zhong, Jiawei Xu, Amy Zhang, Chongjie Zhang, Lei Han, Tong Zhang.
    Towards Robust Offline Reinforcement Learning under Diverse Data Corruption. [arxiv]
    In: The Twelfth International Conference on Learning Representations (ICLR), 2024. (Spotlight)
  • [52] Yucheng Yang, Tianyi Zhou, Qiang He, Lei Han, Mykola Pechenizkiy, Meng Fang.
    Task Adaptation from Skills: Information Geometry, Disentanglement, and New Objectives for Unsupervised Reinforcement Learning. [arxiv]
    In: The Twelfth International Conference on Learning Representations (ICLR), 2024. (Spotlight)
  • [51] Yucheng Yang, Tianyi Zhou, Lei Han, Meng Fang and Mykola Pechenizkiy.
    Automatic Curriculum for Unsupervised Reinforcement Learning. [arxiv]
    In: International Conference on Autonomous Agents and Multiagent Systems (AAMAS), 2024.
  • [50] Jiawei Xu, Cheng Zhou, Yizheng Zhang, Baoxiang Wang, Lei Han.
    Relative Policy-Transition Optimization for Fast Policy Transfer. [arxiv]
    In: The Thirty-Eighth AAAI Conference on Artificial Intelligence (AAAI), 2024.

2023

  • [49] Huiqi Zhao, Yizheng Zhang, Lei Han, Weiqi Qian, Jiabin Wang, Heting Wu, Jingchen Li, Yuan Dai, Zhengyou Zhang, Chris R. Bowen, Ya Yang.
    Intelligent Recognition using Ultralight Multifunctional Nano-layered Carbon Aerogel Sensors with Human-like Tactile Perception. [PDF]
    In: Nano-Micro Letters (IF=23.655), 2023.
  • [48] Honghua Dong, Jiawei Xu, Yu Yang, Rui Zhao, Shiwen Wu, Chun Yuan, Xiu Li, Chris J. Maddison, Lei Han.
    MeGraph: Capturing Long-Range Interactions by Alternating Local and Hierarchical Aggregation on Multi-Scaled Graph Hierarchy. [PDF]
    In: Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS), 2023.
  • [47] Qingxu Zhu*, He Zhang*, Mengting Lan, Lei Han. (*Equal Contribution)
    Neural Categorical Priors for Physics-Based Character Control. [project page][arxiv]
    In: ACM Transactions on Graphics (TOG) (Proc. ACM SIGGRAPH Asia 2023), 2023. (Best Paper Honourable Mention)
  • [46] Tingguang Li, Yizheng Zhang, Chong Zhang, Qingxu Zhu, Jiapeng sheng, Wanchao Chi, Cheng Zhou, Lei Han.
    Learning Terrain-Adaptive Locomotion with Agile Behaviors by Imitating Animals. [arxiv]
    In: International Conference on Intelligent Robots and Systems (IROS), 2023.
  • [45] Shenao Zhang, Li Shen, Lei Han, Li Shen.
    Learning Meta Representations for Agents in Multi-Agent Reinforcement Learning. [arxiv]
    In: Conference on Lifelong Learning Agents (CoLLAs), 2023.
  • [44] Shuxing Li*, Jiawei Xu*, Honghua Dong, Yu Yang, Chun Yuan, Peng Sun, Lei Han. (*Equal Contribution)
    The Fittest Wins: a Multi-Stage Framework Achieving New SOTA in ViZDoom Competition. [link]
    In: IEEE Transactions on Games (TG), 2023.
  • [43] Jiawei Xu*, Shuxing Li*, Rui Yang, Chun Yuan, Lei Han. (*Equal Contribution)
    Efficient Multi-Goal Reinforcement Learning via Value Consistency Prioritization. [link]
    In: Journal of Artificial Intelligence Research (JAIR), 2023.
  • [42] Rui Zhao*, Xu Liu*, Yizheng Zhang*, Minghao Li, Cheng Zhou, Shuai Li and Lei Han. (*Equal Contribution)
    CraftEnv: A Flexible Collective Robotic Construction Environment for Multi-Agent Reinforcement Learning. [link]
    In: International Conference on Autonomous Agents and Multiagent Systems (AAMAS), 2023.

2022

  • [41] Hao Sun, Lei Han, Rui Yang, Xiaoteng Ma, Jian Guo, Bolei Zhou.
    Exploiting Reward Shifting in Value-Based Deep RL. [arxiv]
    In: The Conference on Neural Information Processing Systems (NeurIPS), 2022.
  • [40] Rui Yang*, Chenjia Bai*, Xiaoteng Ma, Zhaoran Wang, Chongjie Zhang, Lei Han. (*Equal Contribution)
    RORL: Robust Offline Reinforcement Learning via Conservative Smoothing. [arxiv]
    In: The Conference on Neural Information Processing Systems (NeurIPS), 2022. (Spotlight)
  • [39] Qiwei Xu, Yizheng Zhang, Shenghao Zhang, Rui Zhao, Zhuoxing Wu, Dongsheng Zhang, Cheng Zhou, Xiong Li, Jiahong Chen, Zengjun Zhao, Luyang Tang, Zhengyou Zhang, Lei Han.
    RECCraft System: Towards Reliable and Efficient Collective Robotic Construction. [PDF]
    In: Proceedings of the 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2022.
  • [38] Rui Yang, Yiming Lu, Wenzhe Li, Hao Sun, Meng Fang, Yali Du, Xiu Li, Lei Han, Chongjie Zhang.
    Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL. [PDF]
    In: The Tenth International Conference on Learning Representations (ICLR), 2022.

2021

  • [37] Chenjia Bai, Lingxiao Wang, Lei Han, Animesh Garg, Jianye Hao, Peng Liu, Zhaoran Wang.
    Dynamic Bottleneck for Robust Self-Supervised Exploration. [PDF]
    In: The Conference on Neural Information Processing Systems (NeurIPS), 2021.
  • [36] Rui Yang, Meng Fang, Lei Han, Yali Du, Feng Luo, Xiu Li.
    MHER: Model-based Hindsight Experience Replay. [PDF]
    In: DeepRL Workshop at NeurIPS, 2021.
  • [35] Chenjia Bai, Lingxiao Wang, Lei Han, Jianye Hao, Animesh Garg, Peng Liu, Zhaoran Wang.
    Principled Exploration via Optimistic Bootstrapping and Backward Induction. [arxiv]
    In: International Conference on Machine Learning (ICML), 2021. (Spotlight)
  • [34] Chenjia Bai, Peng Liu, Kaiyu Liu, Lingxiao Wang, Yingnan Zhao, Lei Han, Zhaoran Wang.
    Variational Dynamic for Self-Supervised Exploration in Deep Reinforcement Learning. [PDF]
    In: IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021.
  • [33] Lu Wang, Lei Han, Xinru Chen, Chengchang Li, Junzhou Huang, Weinan Zhang, Wei Zhang, Xiaofeng He, Dijun Luo.
    Hierarchical Multi-Agent Reinforcement Learning for Allocating Guaranteed Display Ads. [PDF]
    In: IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021.
  • [32] Kaitlyn Waters, Cheng Gao, Matthew Ykema, Lei Han, Lynden Voth, Yizhi Tao, Xiu-Feng Wan.
    Triple reassortment increases compatibility among viral ribonucleoprotein genes of contemporary avian and human influenza A viruses. [Link]
    In: PLOS Pathogens, 2021.
  • [31] Kaitlyn Waters, Hamilton Wan, Lei Han, Jianli Xue, Matthew Ykema, Yizhi Tao, Xiu-Feng Henry Wan.
    Variations outside the conserved motifs of PB1 catalytic active site may affect replication efficiency of the RNP complex of influenza A virus. [Link]
    In: Virology, 2021.

2020

  • [30] Lei Han, Kean Ming Tan, Ting Yang and Tong Zhang.
    Local Uncertainty Sampling for Large-Scale Multi-Class Logistic Regression. [PDF] [arxiv]
    In: Annals of Statistics (AOS), 48(3): 1770-1788, 2020. arXiv:1604.08098, 2016.
  • [29] Lei Li, Deborah Chang, Lei Han, Xiaojian Zhang, Joseph Zaia, Xiu-Feng Wan.
    Multi-Task Learning Sparse Group Lasso: a Method for Quantifying Antigenicity of Influenza A(H1N1) Virus using Mutations and Variations in Glycosylation of Hemagglutinin. [PDF]
    In: BMC Bioinformatics, 2020.
  • [28] Yiheng Huang, Jinchuan Tian, Lei Han, Guangsen Wang, Xingchen Song, Dan Su, Dong Yu.
    A Random Gossip BMUF Process for Neural Language Modeling. [arxiv]
    In: International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2020.
  • [27] Xiangtai Li, Houlong Zhao, Lei Han, Yunhai Tong, Shaohua Tan, Kuiyuan Yang.
    Gated Fully Fusion for Semantic Segmentation. [arxiv]
    In: The Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI), 2020.

2019

  • [26] Yali Du*, Lei Han*, Meng Fang, Ji Liu, Tianhong Dai, Dacheng Tao. (* equal contribution)
    LIIR: Learning Individual Intrinsic Reward in Multi-Agent Reinforcement Learning. [PDF]
    In: The Thirty-third Annual Conference on Neural Information Processing Systems (NeurIPS), 2019.
  • [25] Meng Fang, Tianyi Zhou, Yali Du, Lei Han, Zhengyou Zhang.
    Curriculum-guided Hindsight Experience Replay. [PDF]
    In: The Thirty-third Annual Conference on Neural Information Processing Systems (NeurIPS), 2019.
  • [24] Lei Han*, Peng Sun*, Yali Du*, Jiechao Xiong, Qing Wang, Xinghai Sun, Han Liu, Tong Zhang. (* equal contribution)
    Grid-Wise Control for Multi-Agent Reinforcement Learning in Video Game AI. [PDF][Supplementary Material][demo video]
    In: The Thirty-sixth International Conference on Machine Learning (ICML), 2019.
  • [23] Yu Zhang and Lei Han.
    Learning (from) Deep Hierarchical Structure among Features. [PDF][Link]
    In: The Thirty-Third AAAI Conference on Artificial Intelligence (AAAI), 2019.

2018

  • [22] Qing Wang, Jiechao Xiong, Lei Han, Peng Sun, Han Liu and Tong Zhang.
    Exponentially Weighted Imitation Learning for Batched Historical Data. [PDF][Link]
    In: The Thirty-second Annual Conference on Neural Information Processing Systems (NeurIPS), 2018.
  • [21] Lei Han, Yiheng Huang and Tong Zhang.
    Candidates vs Noises Estimation for Large Multi-Class Classification Problem. [PDF][Link][arxiv]
    In: The 35th International Conference on Machine Learning (ICML), 2018. (Long Presentation)
  • [20] Lei Han, Lei Li, Feng Wen, Lei Zhong, Tong Zhang and Xiu-Feng Wan.
    Graph-Guided Multi-Task Sparse Learning Model: a Method for Identifying Antigenic Variants of Influenza A(H3N2) Virus. [PDF]
    In: Bioinformatics, 2018.
  • [19] Dong Dai, Lei Han, Ting Yang and Tong Zhang.
    Bayesian Model Averaging with Exponentiated Least Squares Loss. [Link][arxiv]
    In: IEEE Transactions on Information Theory (TIT), 2018.
  • [18] Sichen Du, Guojie Song, Lei Han and Haikun Hong.
    Temporal Causal Inference with Time Lag. [Link]
    In: Neural Computation 30 (1), 271-291, 2018.

2016

  • [17] Lei Han, Yu Zhang, Xiu-Feng Wan and Tong Zhang.
    Generalized Hierarchical Sparse Model for Arbitrary-Order Interactive Antigenic Sites Identification in Flu Virus Data. [PDF][Link][Supplementary Material][Code]
    In: Proceedings of the 22nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), San Francisco, USA, 2016. (Acceptance Rate = 18.1%. Full presentation with acceptance rate = 8.9%)
  • [16] Lei Han*, Yu Zhang* and Tong Zhang (*equal contribution)
    Fast Component Pursuit for Large-Scale Inverse Covariance Estimation. [PDF][Link]
    In: Proceedings of the 22nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), San Francisco, USA, 2016. (Acceptance Rate = 18.1%)
  • [15] Lei Han and Yu Zhang. (Both authors contributed equally)
    Reduction Techniques for Graph-based Convex Clustering. [PDF][Link][Supplementary Material]
    In: Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence (AAAI), Phoenix, Arizona, USA, 2016. (Acceptance Rate = 26%)
  • [14] Lei Han and Yu Zhang. (Both authors contributed equally)
    Multi-Stage Multi-Task Learning with Reduced Rank. [PDF][Link][Supplementary Material]
    In: Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence (AAAI), Phoenix, Arizona, USA, 2016. (Acceptance Rate = 26%)
  • [13] Lei Li, Lei Han and Xiu-Feng Wan.
    Identification of glycosylation sites and mutations determining antigenic drift events for influenza A viruses using sparse group lasso regression. [Link]
    In: GLYCOBIOLOGY 26 (12), 1393-1394, 2016.
  • [12] Xiabing Zhou, Xingxing Xing, Lei Han, Haikun Hong, Kaigui Bian, Kunqing Xie.
    Structure feature learning method for incomplete data. [PDF][Link][Supplementary Material]
    In: International Journal of Pattern Recognition and Artificial Intelligence 30 (9): 1660007, 2016.

2015

  • [11] Lei Han and Yu Zhang. (Both authors contributed equally)
    Learning Tree Structure in Multi-Task Learning. [PDF][Link][Supplementary Material][Code]
    In: Proceedings of the 21st ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), Sydney, 2015. (Acceptance Rate = 19%)
  • [10] Xiabing Zhou, Lei Han, Xingxing Xing, Haikun Hong, Wenhao Huang, Kaigui Bian and Kunqing Xie.
    Incorporating temporal smoothness and group structure in learning with incomplete data. [Link]
    In: Proceedings of 12th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD), 2015.
  • [9] Ye Liu, Liqiang Nie, Lei Han, Luming Zhang, David Rosenblum.
    Action2Activity: Recognizing Complex Activities from Sensor Data. [Link]
    In: International Joint Conference on Artificial Intelligence (IJCAI), 2015. (Acceptance Rate = 28.8%)
  • [8] Guojie Song, Lei Han* and Kunqing Xie. (* The corresponding author; the first two authors contributed equally)
    Overlapping Decomposition for Gaussian Graphical Modeling. [PDF][Link]
    In: IEEE Transactions on Knowledge and Data Engineering (TKDE), 2015. (An improved version of the conference paper appeared in KDD2012)
  • [7] Lei Han and Yu Zhang. (Both authors contributed equally)
    Learning Multi-Level Task Groups in Multi-Task Learning. [PDF][Link][Supplementary Material][Code]
    In: Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence (AAAI), Austin Texas, USA, 2015. (Acceptance Rate = 26.7%)
  • [6] Lei Han and Yu Zhang. (Both authors contributed equally)
    Discriminative Feature Grouping. [PDF][Link][Code]
    In: Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence (AAAI), Austin Texas, USA, 2015. (Acceptance Rate = 26.7%)

Before 2014

  • [5] Lei Han, Yu Zhang, Guojie Song and Kunqing Xie.
    Encoding tree-sparsity in Multi-Task Learning: A Probabilistic Framework. [PDF][Link]
    In: Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence (AAAI), Quebec City, Quebec, Canada, 2014. (Acceptance Rate = 28%)
  • [4] Lei Han, Guojie Song, Gao Cong and Kunqing Xie.
    Overlapping Decomposition for Causal Graphical Modeling. [PDF][Link]
    In: Proceedings of the 18th ACM SIGKDD international conference on Knowledge Discovery and Data Mining (KDD), Beijing, China, 2012. (Acceptance Rate = 18%)
  • [3] Lei Han, Kunqing Xie and Guojie Song.
    Adaptive Fit Parameters Tuning with Data Density Changes in Locally Weighted Learning. [PDF]
    In: Proceedings of the 7th International Symposium on Neural Networks (ISNN), Shanghai, China, 2010.
  • [2] Lei Han, Jianying Wu, Ping Gu, Kunqing Xie, Guojie Song, Shiwei Tang, Dongqing Yang, Bingli Jiao and Feng Gao.
    Adaptive Knowledge Transfer based on Locally Weighted Learning. [PDF]
    In: Proceedings of the Conference on Technologies and Applications of Artificial Intelligence (TAAI), Hsinchu, Taiwan, 2010.
  • [1] Lei Han, Meng Shuai, Kunqing Xie, Guojie Song and Xiujun Ma.
    Locally Kernel Regression Adapting with Data Distribution in Prediction of Traffic Flow. [PDF]
    In: Proceedings of the 18th International Conference on Geoinformatics (Geoinformatics), Beijing, China, 2010.

Preprints & Technical Reports

  • Qing Wang, Jiechao Xiong, Lei Han, Meng Fang, Xinghai Sun, Zhuobin Zheng, Peng Sun, Zhengyou Zhang. (* equal contribution)
    Arena: a toolkit for Multi-Agent Reinforcement Learning. [arxiv]
    arXiv:1907.09467 [cs.LG], 2019.
  • Yiheng Huang, Liqiang He*, Guangsen Wang, Lei Han and Dan Su. (* equal contribution)
    Phrase-Level Class based Language Model for Mandarin Smart Speaker Query Recognition. [arxiv]
    arXiv:1909.00556 [cs.CL], 2019.
  • Jiechao Xiong, Qing Wang, Zhuoran Yang, Peng Sun, Lei Han, Yang Zheng, Haobo Fu, Tong Zhang, Ji Liu, Han Liu
    Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space. [arxiv]
    arXiv:1810.06394 [cs.LG], 2018.

Ph.D. Thesis (In Chinese)

  • Lei Han. Multi-Task Learning Methods in Traffic Network Analysis. EECS, Peking University, July, 2014.

Activities

  • Journal Reviewer:
    IEEE Transactions on Knowledge and Data Engineering (TKDE)
    IEEE Transactions on Intelligent Transportation Systems (TITS)
    Journal of Machine Learning Research (JMLR)
    IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
    Neurocomputing

  • PC Member / Reviewer:
    AAAI: 2016-2020
    NeurIPS: 2016-2023
    ICML: 2018-2023
    ICLR: 2018-2023

  • Senior PC Member:
    IJCAI: 2020, 2021

Honors and Awards

  • ACM Transactions on Graphics (Proc. ACM SIGGRAPH Asia 2023), Best Paper Honourable Mention, 2023.

  • The Best Dissertation Award of Chinese Association for Artificial Intelligence (CAAI) (中国人工智能学会优博), 2016. [Link]

  • Outstanding Ph.D. Graduate Award in Beijing, 2014

  • Outstanding Ph.D. Graduate Award in Peking University, 2014

  • President Scholarship (the highest Scholarship in Peking University), 2013-2014

  • President Scholarship (the highest Scholarship in Peking University), 2012-2013

  • Chinese National Scholarship (selected from top Ph.D. students in China), 2011-2012

  • President Scholarship (the highest Scholarship in Peking University), 2011-2012

  • President Scholarship (the highest Scholarship in Peking University), 2010-2011

  • Merit Student Award (selected from top postgraduates in Peking University), 2010-2011

  • President Scholarship (the highest Scholarship in Peking University), 2009-2010

  • Merit Student Award (selected from top postgraduates in Peking University), 2009-2010