AutoPRM: Automating Procedural Supervision for Multi-Step Reasoning via Controllable Question Decomposition

Written on February 18, 2024