I received a master's degree in Computer Science from University of Southern California. I received a bachelor's degree in Computer Science from Harbin Institute of Technology.
Trained policy based on self-built simulator quad-swarm-rl , by using deep reinforcement learning and using sim-to-real transfer to deploy the policy on the real world quadrotors.
Sample Factory is the fastest open source single-machine RL implementations (see paper for details). If you plan to train RL agents on large amounts of experience, consider using it. Sample Factory can significantly speed up the experimentation or allow you to collect more samples in the same amount of time and achieve better performance.