Posts
All the articles I've posted.
Neural Scaling Laws (Then and Now)
Published: at 04:07 AM in 12 min readScaling laws have been a cornerstone of deep learning research for decades. In this post, we'll explore the history of scaling laws in deep learning, and how they've evolved over time.
Feeding the Beast - Data Loading Secrets for Hungry Neural Networks
Published: at 02:53 AM in 11 min readData loading is a critical part of training deep learning models. In this post, we'll explore the best practices for loading data into neural networks, with a focus on PyTorch.
Large-Scale Neural Network Training
Published: at 01:58 AM in 23 min readLarge-scale neural network training is a critical component of modern machine learning workflows. This post provides an overview of the challenges and solutions for training deep learning models at scale.
Can AI Write about AI Personality With Personality?
Published: at 07:33 AM in 31 min readShould AI have "Personality" or will "Bland and Mid" suffice?
o1 and Reasoning
Published: at 12:53 AM in 62 min readOrganizing research presumably related to OpenAI's o1 reasoning model.
All you need (to know) about attention
Published: at 02:35 AM in 34 min readA highly compressed quide to quickly refresh (mostly) all you need to know about attention mechanisms in deep learning.