Welcome to Ando's Logs

👋 Hi - I'm Alex Andonian (aka Ando) and these are my personal "training" logs.

As a deep learning researcher, I spend a lot of time inspecting data and debugging training logs. Recently, I wondered why I don't do the same for my own neural network. So here we are!

Jokes aside, I'm hoping this will become a growing collection of articles, tutorials, and guides about deep learning, AI, programming, and technology. By documenting my own learning trajectories, I hope it can help others (human or otherwise) on their fine-tuning journeys.

Featured

Multistep Reasoning Agents (with GRPO & RLEF) - Project Euler Edition
Published:Mar 23, 2025 at 02:38 AM in 21 min read
Converting chat LLMs into reasoning agents through GRPO and reinforcement learning from execution feedback — achieving performance improvements on Project Euler's algorithmic challenges with multi-step reasoning, tool use, and code execution.
Bridging the Knowledge Gap in Multimodal AI with DeepResearch
Published:Mar 2, 2025 at 04:27 AM in 19 min read
How OpenAI's DeepResearch reveals what industry labs aren't sharing about next-gen AI assistants
o3 and the Future of Science
Published:Dec 23, 2024 at 10:04 PM in 23 min read
o3 and Human-AI scientific collaboration
Back to Backprop
Published:Dec 12, 2024 at 06:42 AM in 29 min read
A review of backpropagation, the workhorse of deep learning.
Can AI Write about AI Personality With Personality?
Published:Nov 23, 2024 at 07:33 AM in 31 min read
Should AI have "Personality" or will "Bland and Mid" suffice?
o1 and Reasoning
Published:Nov 20, 2024 at 12:53 AM in 62 min read
Organizing research presumably related to OpenAI's o1 reasoning model.
All you need (to know) about attention
Published:Nov 4, 2024 at 02:35 AM in 35 min read
A highly compressed guide to quickly refresh (mostly) all you need to know about attention mechanisms in deep learning.

Welcome to Ando's Logs

Featured

Multistep Reasoning Agents (with GRPO & RLEF) - Project Euler Edition

Bridging the Knowledge Gap in Multimodal AI with DeepResearch

o3 and the Future of Science

Back to Backprop

Can AI Write about AI Personality With Personality?

o1 and Reasoning

All you need (to know) about attention

Recent Posts

Swirling Thoughts on AI, LLMs, Novel Hypothesis Generation and Crypto?

Pandas Primer

Python and Pandas and P-values, Oh My! A Statistical Journey

A Straight Line Through Linear Algebra