May 26, 2026 Released Trust Region Q Adjoint Matching (TRQAM), a stable off-policy RL algorithm for pretrained flow policies. arXiv · blog · code May 01, 2026 Co-authored RLDX-1 Technical Report released. arXiv · code