Tag: ai-coding

All the articles with the tag "ai-coding".

Multistep Reasoning Agents (with GRPO & RLEF) - Project Euler Edition
Published:Mar 23, 2025 at 02:38 AM in 21 min read
Converting chat LLMs into reasoning agents through GRPO and reinforcement learning from execution feedback — achieving performance improvements on Project Euler's algorithmic challenges with multi-step reasoning, tool use, and code execution.

Multistep Reasoning Agents (with GRPO & RLEF) - Project Euler Edition