Lei Han (韩磊)

Principal Research Scientist

Tencent Robotics X Lab
Tencent AI Lab
Tencent, Shenzhen, China

Office: Binhai Tower, Shenzhen, China.
Tencent Email: lxhan at tencent dot com
Gmail: leihan dot cs at gmail dot com

Previous email addresses: lei.han@msstate.edu; lhan@stat.rutgers.edu; leihan@comp.hkbu.edu.hk; hanlei@cis.pku.edu.cn


I am a principal research scientist at Tencent Robotics X Lab. I am directing the Agent Learning Center. I was an assistant research professor at the Department of Basic Science, Mississippi State University, USA. I received my Ph.D from Peking University (advised by Professor Kunqing Xie) and spent two years in Hong Kong Baptist University (advised by Professor Yu Zhang) and Rutgers University (advised by Professor Tong Zhang) as postdoctoral researcher. I was the winner of the Best Dissertation Award of Chinese Association for Artificial Intelligence (CAAI) (中国人工智能学会优博).

My research interests mainly focus on machine learning and artifical intelligence. I am especially interested in large-scale statistical machine learning, reinforcement learning, optimization, multi-task learning and their applications in robotics, game, NLP, CV and bioinformatics.

Selected Publications

[Google Scholar]

Preprints & Technical Reports

  • Lei Han, Jiechao Xiong, Peng Sun, Xinghai Sun, Meng Fang, Qingwei Guo, Qiaobo Chen, Tengfei Shi, Zhengyou Zhang.
    TStarBot-X: An Open-Sourced and Comprehensive Study for Efficient League Training in StarCraft II Full Game. [arxiv]
    arXiv preprint arXiv:2011.13729, 2020 (* Equal contribution, correspondence to the first three authors)

  • Peng Sun, Jiechao Xiong, Lei Han, Xinghai Sun, Shuxing Li, Jiawei Xu, Meng Fang, Zhengyou Zhang.
    TLeague: A Framework for Competitive Self-Play based Distributed Multi-Agent Reinforcement Learning. [arxiv]
    arXiv preprint arXiv:2011.12895, 2020 (* Equal contribution, correspondence to the first three authors)

  • Qing Wang, Jiechao Xiong, Lei Han, Meng Fang, Xinghai Sun, Zhuobin Zheng, Peng Sun, Zhengyou Zhang. (* equal contribution)
    Arena: a toolkit for Multi-Agent Reinforcement Learning. [arxiv]
    arXiv:1907.09467 [cs.LG], 2019.

  • Yiheng Huang, Liqiang He*, Guangsen Wang, Lei Han and Dan Su. (* equal contribution)
    Phrase-Level Class based Language Model for Mandarin Smart Speaker Query Recognition. [arxiv]
    arXiv:1909.00556 [cs.CL], 2019.

  • Jiechao Xiong, Qing Wang, Zhuoran Yang, Peng Sun, Lei Han, Yang Zheng, Haobo Fu, Tong Zhang, Ji Liu, Han Liu
    Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space. [arxiv]
    arXiv:1810.06394 [cs.LG], 2018.

  • Peng Sun*, Xinghai Sun*, Lei Han*, Jiechao Xiong*, Qing Wang, Bo Li, Yang Zheng, Ji Liu, Yongsheng Liu, Han Liu, Tong Zhang. (*equal contribution)
    TStarBots: Defeating the Cheating Level Builtin AI in StarCraft II in the Full Game. [arxiv]
    arXiv:1809.07193 [cs.AI], 2018.

2022

  • [40] Hao Sun, Lei Han, Rui Yang, Xiaoteng Ma, Jian Guo, Bolei Zhou.
    Exploiting Reward Shifting in Value-Based Deep RL. [arxiv]
    In: The Conference on Neural Information Processing Systems (NeurIPS), 2022.
  • [39] Rui Yang, Chenjia Bai, Xiaoteng Ma, Zhaoran Wang, Chongjie Zhang, Lei Han.
    RORL: Robust Offline Reinforcement Learning via Conservative Smoothing. [arxiv]
    In: The Conference on Neural Information Processing Systems (NeurIPS), 2022.
  • [38] Qiwei Xu, Yizheng Zhang, Shenghao Zhang, Rui Zhao, Zhuoxing Wu, Dongsheng Zhang, Cheng Zhou, Xiong Li, Jiahong Chen, Zengjun Zhao, Luyang Tang, Zhengyou Zhang, Lei Han.
    RECCraft System: Towards Reliable and Efficient Collective Robotic Construction. [PDF]
    In: Proceedings of the 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2022.
  • [37] Rui Yang, Yiming Lu, Wenzhe Li, Hao Sun, Meng Fang, Yali Du, Xiu Li, Lei Han, Chongjie Zhang.
    Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL. [PDF]
    In: The Tenth International Conference on Learning Representations (ICLR), 2022.

2021

  • [36] Chenjia Bai, Lingxiao Wang, Lei Han, Animesh Garg, Jianye Hao, Peng Liu, Zhaoran Wang.
    Dynamic Bottleneck for Robust Self-Supervised Exploration. [PDF]
    In: The Conference on Neural Information Processing Systems (NeurIPS), 2021.
  • [35] Chenjia Bai, Lingxiao Wang, Lei Han, Jianye Hao, Animesh Garg, Peng Liu, Zhaoran Wang.
    Principled Exploration via Optimistic Bootstrapping and Backward Induction. [arxiv]
    In: International Conference on Machine Learning (ICML), 2021.

  • [34] Chenjia Bai, Peng Liu, Kaiyu Liu, Lingxiao Wang, Yingnan Zhao, Lei Han, Zhaoran Wang.
    Variational Dynamic for Self-Supervised Exploration in Deep Reinforcement Learning. [PDF]
    In: IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021.
  • [33] Lu Wang, Lei Han, Xinru Chen, Chengchang Li, Junzhou Huang, Weinan Zhang, Wei Zhang, Xiaofeng He, Dijun Luo.
    Hierarchical Multi-Agent Reinforcement Learning for Allocating Guaranteed Display Ads. [PDF]
    In: IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021.

  • [32] Kaitlyn Waters, Cheng Gao, Matthew Ykema, Lei Han, Lynden Voth, Yizhi Tao, Xiu-Feng Wan.
    Triple reassortment increases compatibility among viral ribonucleoprotein genes of contemporary avian and human influenza A viruses. [Link]
    In: PLOS Pathogens, 2021.
  • [31] Kaitlyn Waters, Hamilton Wan, Lei Han, Jianli Xue, Matthew Ykema, Yizhi Tao, Xiu-Feng Henry Wan.
    Variations outside the conserved motifs of PB1 catalytic active site may affect replication efficiency of the RNP complex of influenza A virus. [Link]
    In: Virology, 2021.

2020

  • [30] Lei Han, Kean Ming Tan, Ting Yang and Tong Zhang.
    Local Uncertainty Sampling for Large-Scale Multi-Class Logistic Regression. [PDF] [arxiv]
    In: Annals of Statistics (AOS), 48(3): 1770-1788, 2020. arXiv:1604.08098, 2016.

  • [29] Lei Li, Deborah Chang, Lei Han, Xiaojian Zhang, Joseph Zaia, Xiu-Feng Wan.
    Multi-Task Learning Sparse Group Lasso: a Method for Quantifying Antigenicity of Influenza A(H1N1) Virus using Mutations and Variations in Glycosylation of Hemagglutinin. [PDF]
    In: BMC Bioinformatics, 2020.

  • [28] Yiheng Huang, Jinchuan Tian, Lei Han, Guangsen Wang, Xingchen Song, Dan Su, Dong Yu.
    A Random Gossip BMUF Process for Neural Language Modeling. [arxiv]
    In: International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2020.

  • [27] Xiangtai Li, Houlong Zhao, Lei Han, Yunhai Tong, Shaohua Tan, Kuiyuan Yang.
    Gated Fully Fusion for Semantic Segmentation. [arxiv]
    In: The Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI), 2020.

2019

  • [26] Yali Du*, Lei Han*, Meng Fang, Ji Liu, Tianhong Dai, Dacheng Tao. (* equal contribution)
    LIIR: Learning Individual Intrinsic Reward in Multi-Agent Reinforcement Learning. [PDF]
    In: The Thirty-third Annual Conference on Neural Information Processing Systems (NeurIPS), 2019.

  • [25] Meng Fang, Tianyi Zhou, Yali Du, Lei Han, Zhengyou Zhang.
    Curriculum-guided Hindsight Experience Replay. [PDF]
    In: The Thirty-third Annual Conference on Neural Information Processing Systems (NeurIPS), 2019.

  • [24] Lei Han*, Peng Sun*, Yali Du*, Jiechao Xiong, Qing Wang, Xinghai Sun, Han Liu, Tong Zhang. (* equal contribution)
    Grid-Wise Control for Multi-Agent Reinforcement Learning in Video Game AI. [PDF][Supplementary Material][demo video]
    In: The Thirty-sixth International Conference on Machine Learning (ICML), 2019.

  • [23] Yu Zhang and Lei Han.
    Learning (from) Deep Hierarchical Structure among Features. [PDF][Link]
    In: The Thirty-Third AAAI Conference on Artificial Intelligence (AAAI), 2019.

2018

  • [22] Qing Wang, Jiechao Xiong, Lei Han, Peng Sun, Han Liu and Tong Zhang.
    Exponentially Weighted Imitation Learning for Batched Historical Data. [PDF][Link]
    In: The Thirty-second Annual Conference on Neural Information Processing Systems (NeurIPS), 2018.

  • [21] Lei Han, Yiheng Huang and Tong Zhang.
    Candidates vs Noises Estimation for Large Multi-Class Classification Problem. [PDF][Link][arxiv]
    In: The 35th International Conference on Machine Learning (ICML), 2018. (Long Presentation)

  • [20] Lei Han, Lei Li, Feng Wen, Lei Zhong, Tong Zhang and Xiu-Feng Wan.
    Graph-Guided Multi-Task Sparse Learning Model: a Method for Identifying Antigenic Variants of Influenza A(H3N2) Virus. [PDF]
    In: Bioinformatics, 2018.

  • [19] Dong Dai, Lei Han, Ting Yang and Tong Zhang.
    Bayesian Model Averaging with Exponentiated Least Squares Loss. [Link][arxiv]
    In: IEEE Transactions on Information Theory (TIT), 2018.

  • [18] Sichen Du, Guojie Song, Lei Han and Haikun Hong.
    Temporal Causal Inference with Time Lag. [Link]
    In: Neural Computation 30 (1), 271-291, 2018.

2016

  • [17] Lei Han, Yu Zhang, Xiu-Feng Wan and Tong Zhang.
    Generalized Hierarchical Sparse Model for Arbitrary-Order Interactive Antigenic Sites Identification in Flu Virus Data. [PDF][Link][Supplementary Material][Code]
    In: Proceedings of the 22nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), San Francisco, USA, 2016. (Acceptance Rate = 18.1%. Full presentation with acceptance rate = 8.9%)

  • [16] Lei Han*, Yu Zhang* and Tong Zhang (*equal contribution)
    Fast Component Pursuit for Large-Scale Inverse Covariance Estimation. [PDF][Link]
    In: Proceedings of the 22nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), San Francisco, USA, 2016. (Acceptance Rate = 18.1%)

  • [15] Lei Han and Yu Zhang. (Both authors contributed equally)
    Reduction Techniques for Graph-based Convex Clustering. [PDF][Link][Supplementary Material]
    In: Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence (AAAI), Phoenix, Arizona, USA, 2016. (Acceptance Rate = 26%)

  • [14] Lei Han and Yu Zhang. (Both authors contributed equally)
    Multi-Stage Multi-Task Learning with Reduced Rank. [PDF][Link][Supplementary Material]
    In: Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence (AAAI), Phoenix, Arizona, USA, 2016. (Acceptance Rate = 26%)

  • [13] Lei Li, Lei Han and Xiu-Feng Wan.
    Identification of glycosylation sites and mutations determining antigenic drift events for influenza A viruses using sparse group lasso regression. [Link]
    In: GLYCOBIOLOGY 26 (12), 1393-1394, 2016.

  • [12] Xiabing Zhou, Xingxing Xing, Lei Han, Haikun Hong, Kaigui Bian, Kunqing Xie.
    Structure feature learning method for incomplete data. [PDF][Link][Supplementary Material]
    In: International Journal of Pattern Recognition and Artificial Intelligence 30 (9): 1660007, 2016.

2015

  • [11] Lei Han and Yu Zhang. (Both authors contributed equally)
    Learning Tree Structure in Multi-Task Learning. [PDF][Link][Supplementary Material][Code]
    In: Proceedings of the 21st ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), Sydney, 2015. (Acceptance Rate = 19%)

  • [10] Xiabing Zhou, Lei Han, Xingxing Xing, Haikun Hong, Wenhao Huang, Kaigui Bian and Kunqing Xie.
    Incorporating temporal smoothness and group structure in learning with incomplete data. [Link]
    In: Proceedings of 12th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD), 2015.

  • [9] Ye Liu, Liqiang Nie, Lei Han, Luming Zhang, David Rosenblum.
    Action2Activity: Recognizing Complex Activities from Sensor Data. [Link]
    In: International Joint Conference on Artificial Intelligence (IJCAI), 2015. (Acceptance Rate = 28.8%)

  • [8] Guojie Song, Lei Han* and Kunqing Xie. (* The corresponding author; the first two authors contributed equally)
    Overlapping Decomposition for Gaussian Graphical Modeling. [PDF][Link]
    In: IEEE Transactions on Knowledge and Data Engineering (TKDE), 2015. (An improved version of the conference paper appeared in KDD2012)

  • [7] Lei Han and Yu Zhang. (Both authors contributed equally)
    Learning Multi-Level Task Groups in Multi-Task Learning. [PDF][Link][Supplementary Material][Code]
    In: Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence (AAAI), Austin Texas, USA, 2015. (Acceptance Rate = 26.7%)

  • [6] Lei Han and Yu Zhang. (Both authors contributed equally)
    Discriminative Feature Grouping. [PDF][Link][Code]
    In: Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence (AAAI), Austin Texas, USA, 2015. (Acceptance Rate = 26.7%)

Before 2014

  • [5] Lei Han, Yu Zhang, Guojie Song and Kunqing Xie.
    Encoding tree-sparsity in Multi-Task Learning: A Probabilistic Framework. [PDF][Link]
    In: Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence (AAAI), Quebec City, Quebec, Canada, 2014. (Acceptance Rate = 28%)

  • [4] Lei Han, Guojie Song, Gao Cong and Kunqing Xie.
    Overlapping Decomposition for Causal Graphical Modeling. [PDF][Link]
    In: Proceedings of the 18th ACM SIGKDD international conference on Knowledge Discovery and Data Mining (KDD), Beijing, China, 2012. (Acceptance Rate = 18%)

  • [3] Lei Han, Kunqing Xie and Guojie Song.
    Adaptive Fit Parameters Tuning with Data Density Changes in Locally Weighted Learning. [PDF]
    In: Proceedings of the 7th International Symposium on Neural Networks (ISNN), Shanghai, China, 2010.

  • [2] Lei Han, Jianying Wu, Ping Gu, Kunqing Xie, Guojie Song, Shiwei Tang, Dongqing Yang, Bingli Jiao and Feng Gao.
    Adaptive Knowledge Transfer based on Locally Weighted Learning. [PDF]
    In: Proceedings of the Conference on Technologies and Applications of Artificial Intelligence (TAAI), Hsinchu, Taiwan, 2010.

  • [1] Lei Han, Meng Shuai, Kunqing Xie, Guojie Song and Xiujun Ma.
    Locally Kernel Regression Adapting with Data Distribution in Prediction of Traffic Flow. [PDF]
    In: Proceedings of the 18th International Conference on Geoinformatics (Geoinformatics), Beijing, China, 2010.

Ph.D. Thesis (In Chinese)

  • Lei Han. Traffic Network based Multi-Task Learning Method. EECS, Peking University, July, 2014.

Activities

  • Journal Reviewer:
    IEEE Transactions on Knowledge and Data Engineering (TKDE)
    IEEE Transactions on Intelligent Transportation Systems (TITS)
    Journal of Machine Learning Research (JMLR)
    IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
    Neurocomputing

  • PC Member / Reviewer:
    AAAI: 2016-2020
    NeurIPS: 2016-Now
    ICML: 2018-Now
    ICLR: 2018-Now

  • Senior PC Member:
    IJCAI: 2020, 2021

Students

  • Yali Du (Assistant Professor, King’s College London. Intership at Tencent AI Lab, 2018-2019)

  • Shuai Li (Assistant Professor, Shanghai Jiao Tong University. Intership at Tencent AI Lab, 2018-2019)

  • Chenjia Bai (Researcher, Shanghai AI Laboratory. Intership at Tencent Robotics X Lab, 2020)

  • Rui Yang (Ph.D., HKUST. Intership at Tencent Robotics X Lab, 2021-2022)

  • Honghua Dong (Ph.D., University of Toronto. Intership at Tencent Robotics X Lab, 2021-2022)

  • Jiawei Xu (Ph.D., CUHK. Intership at Tencent Robotics X Lab, 2021-2022)

  • Hao Sun (Ph.D., CUHK. Intership at Tencent Robotics X Lab, 2021)

  • Shuxing Li (Master, Tsinghua University. Intership at Tencent Robotics X Lab, 2021-2022)

  • Yingru Li (Ph.D., CUHK. Intership at Tencent Robotics X Lab, 2020-2021)

Honors and Awards

  • The Best Dissertation Award of Chinese Association for Artificial Intelligence (CAAI) (中国人工智能学会优博), 2016. [Link]

  • Outstanding Ph.D. Graduate Award in Beijing, 2014

  • Outstanding Ph.D. Graduate Award in Peking University, 2014

  • President Scholarship (the highest Scholarship in Peking University), 2013-2014

  • President Scholarship (the highest Scholarship in Peking University), 2012-2013

  • Chinese National Scholarship (selected from top Ph.D. students in China), 2011-2012

  • President Scholarship (the highest Scholarship in Peking University), 2011-2012

  • President Scholarship (the highest Scholarship in Peking University), 2010-2011

  • Merit Student Award (selected from top postgraduates in Peking University), 2010-2011

  • President Scholarship (the highest Scholarship in Peking University), 2009-2010

  • Merit Student Award (selected from top postgraduates in Peking University), 2009-2010