Research

	Grid Cells and Recurrent Neural Networks Supervisor: Xue-Xin Wei, University of Texas at Austin Current position. Working on RNN models of grid cell formation. Studying the relationship between CANN and RNN models of grid representations. [Thesis-21]
	Evaluating Adversarial Robustness in Simulated Cerebellum Supervisor: Bo Li, Qifeng Chen, HKUST Built the Marr model of the cerebellum with modern deep learning practice. Evaluated its adversarial robustness on multiple aspects (network width, LTD and sparse connectivity). [NeurIPS-Preregister-20]
	RL in Sparse Reward Environments with Memory Graph Planning Supervisor: Changshui Zhang, Tsinghua University Built an external topological memory for goal-conditioned DQN. Solved the Montezuma's Revenge with high sample efficiency (2500+ points in less than 3M timesteps). [Thesis-19, Video]
	Visual Attention and Deep Reinforment Learning Supervisor: Dana H. Ballard, University of Texas at Austin Combined human visual selective attention model with A2C agent in Atari games. Visualized the deep RL agent behaviors in details. [arXiv-18]
	Reinforcement Learning: An introduction (2nd Edition, Chineses Version) Supervisor: Kai Yu, Shanghai Jiao Tong University Proofread the Chinese translation of chapter 5, 6, 7, 8, 12 and 13. [Book]
	Humanoid Kicking Trajectory Planning, RoboCup 2017 Nagoya Supervisor: Mingguo Zhao, Tsinghua University Planned kicking trajectories of an adult size humanoid (1.4m) with cubic spline and inverse dynamics. My work (high kick) helped our team win 2nd place in the Technical Challenge.