Inspired by the success of reinforcement learning (RL) in refining large language models (LLMs), we propose AR-GRPO, an approach to integrate online RL training into autoregressive (AR) image ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results