Posts
All the articles I've posted.
Large-Scale Neural Network Training
Published: at 01:58 AM in 23 min readLarge-scale neural network training is a critical component of modern machine learning workflows. This post provides an overview of the challenges and solutions for training deep learning models at scale.
Can AI Write about AI Personality With Personality?
Published: at 07:33 AM in 30 min readShould AI have "Personality" or will "Bland and Mid" suffice?
o1 and Reasoning
Published: at 12:53 AM in 62 min readOrganizing research presumably related to OpenAI's o1 reasoning model.
All you need (to know) about attention
Published: at 02:35 AM in 35 min readA highly compressed guide to quickly refresh (mostly) all you need to know about attention mechanisms in deep learning.
Byte Pair Encoding Tokenizer
Published: at 01:37 AM in 6 min readAn in-depth exploration of the Byte Pair Encoding (BPE) tokenizer, its mechanics, and its applications in natural language processing.
Automating App Deployment on a VPS with GitHub Actions
Published: at 03:47 AM in 8 min readLearn how to automate your VPS application deployment process using GitHub Actions for a seamless CI/CD experience.