Grid Cells and Recurrent Neural Networks
Supervisor: Xue-Xin Wei, University of Texas at Austin

  • Current position. Working on RNN models of grid cell formation.
  • Studying the relationship between CANN and RNN models of grid representations.
[Thesis-21]

Evaluating Adversarial Robustness in Simulated Cerebellum
Supervisor: Bo Li, Qifeng Chen, HKUST

  • Built the Marr model of the cerebellum with modern deep learning practice.
  • Evaluated its adversarial robustness on multiple aspects (network width, LTD and sparse connectivity).
[NeurIPS-Preregister-20]

RL in Sparse Reward Environments with Memory Graph Planning
Supervisor: Changshui Zhang, Tsinghua University

  • Built an external topological memory for goal-conditioned DQN.
  • Solved the Montezuma's Revenge with high sample efficiency (2500+ points in less than 3M timesteps).
[Thesis-19, Video]

Visual Attention and Deep Reinforment Learning
Supervisor: Dana H. Ballard, University of Texas at Austin

  • Combined human visual selective attention model with A2C agent in Atari games.
  • Visualized the deep RL agent behaviors in details.
[arXiv-18]

Reinforcement Learning: An introduction (2nd Edition, Chineses Version)
Supervisor: Kai Yu, Shanghai Jiao Tong University

  • Proofread the Chinese translation of chapter 5, 6, 7, 8, 12 and 13.
[Book]

Humanoid Kicking Trajectory Planning, RoboCup 2017 Nagoya
Supervisor: Mingguo Zhao, Tsinghua University

  • Planned kicking trajectories of an adult size humanoid (1.4m) with cubic spline and inverse dynamics.
  • My work (high kick) helped our team win 2nd place in the Technical Challenge.