Tag: reasoning
All the articles with the tag "reasoning".
Multistep Reasoning Agents (with GRPO & RLEF) - Project Euler Edition
Published: at 02:38 AM in 21 min readConverting chat LLMs into reasoning agents through GRPO and reinforcement learning from execution feedback — achieving performance improvements on Project Euler's algorithmic challenges with multi-step reasoning, tool use, and code execution.
o3 and the Future of Science
Published: at 10:04 PM in 23 min reado3 and Human-AI scientific collaboration
o1 and Reasoning
Published: at 12:53 AM in 62 min readOrganizing research presumably related to OpenAI's o1 reasoning model.