Research Projects

Here are some of my past research projects in reinforcement learning, robotics, and AI applications.


Humanoid Robot Motion Imitation

Developed a control strategy for the humanoid robot G1 using imitation learning, where a well-designed reward function enables the robot to accurately replicate the dynamic motion capture data of professional dancers.

Watch the demo:


Bipedal Robot Locomotion using Reinforcement Learning

Implemented a PPO-based stable walking algorithm for the bipedal robot Hector in Isaac Gym, utilizing an asymmetric actor-critic approach.

Watch the demo:


Resource Allocation and Scheduling in IoT & Edge Computing

Designed a deep reinforcement learning algorithm for efficient microservice scheduling in edge computing environments, balancing latency and resource constraints.

System Architecture: IoT Resource Scheduling - System Architecture

Time Slot Design: IoT Resource Scheduling - Time Slot Optimization


YOLO-based Smoking Detection

Personal Project(for fun)
Developed a real-time smoking behavior detection system using YOLO object detection model. The system can accurately detect and classify smoking activities in various environments.

Watch the detection demo: