Posts
All the articles I've posted.
Feeding the Beast - Data Loading Secrets for Hungry Neural Networks
Published: at 02:53 AM in 11 min readData loading is a critical part of training deep learning models. In this post, we'll explore the best practices for loading data into neural networks, with a focus on PyTorch.
Large-Scale Neural Network Training
Published: at 01:58 AM in 23 min readLarge-scale neural network training is a critical component of modern machine learning workflows. This post provides an overview of the challenges and solutions for training deep learning models at scale.
Can AI Write about AI Personality With Personality?
Published: at 07:33 AM in 31 min readShould AI have "Personality" or will "Bland and Mid" suffice?
o1 and Reasoning
Published: at 12:53 AM in 62 min readOrganizing research presumably related to OpenAI's o1 reasoning model.
All you need (to know) about attention
Published: at 02:35 AM in 34 min readA highly compressed quide to quickly refresh (mostly) all you need to know about attention mechanisms in deep learning.
Byte Pair Encoding Tokenizer
Published: at 01:37 AM in 6 min readAn in-depth exploration of the Byte Pair Encoding (BPE) tokenizer, its mechanics, and its applications in natural language processing.