Tag: reasoning

All the articles with the tag "reasoning".

Multistep Reasoning Agents (with GRPO & RLEF) - Project Euler Edition
Published:Mar 23, 2025 at 02:38 AM in 21 min read
Converting chat LLMs into reasoning agents through GRPO and reinforcement learning from execution feedback — achieving performance improvements on Project Euler's algorithmic challenges with multi-step reasoning, tool use, and code execution.
o3 and the Future of Science
Published:Dec 23, 2024 at 10:04 PM in 23 min read
o3 and Human-AI scientific collaboration
o1 and Reasoning
Published:Nov 20, 2024 at 12:53 AM in 62 min read
Organizing research presumably related to OpenAI's o1 reasoning model.

Multistep Reasoning Agents (with GRPO & RLEF) - Project Euler Edition