ERL adds a reflection step to reinforcement learning: attempt, feedback, explanation, refined attempt. The result: faster learning, higher reward, same inference cost.
ERL adds a reflection step to reinforcement learning: attempt, feedback, explanation, refined attempt. The result: faster learning, higher reward, same inference cost.