RL Optimization PPO Algorithm

Optimization of Airline Scheduling Using Reinforcement Learning with PPO Algorithm

Abstract: The Airline Scheduling Problem (ASP) has significant economic and operational value in air trans portation management. However, its complexity and dynamics make traditional mixed integer ...

GitHub

Taming Masked Diffusion Language Models via Consistency Trajectory Reinforcement Learning with Fewer Decoding Step

📢 If you also engaged in the research of MDLMs or RL, we welcome your suggestions. And feel free to create an issue, when you have any questions about the code. If you are interested in our work, ...

GitHub

RuntimeError: CUDA error: misaligned address

(WorkerDict pid=2862157) [rank3]:[E923 11:14:11.615370309 ProcessGroupNCCL.cpp:1895] [PG ID 0 PG GUID 0(default_pg) Rank 3] Process group watchdog thread terminated with exception: CUDA error: ...

IEEE

Intelligent Quadruped Robot Locomotion using PPO Agent

Abstract: This paper presents the application of the Proximal Policy Optimization (PPO) algorithm in Reinforcement Learning (RL) for autonomous locomotion in quadruped robots. These robots are ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results