So much working, reading, thinking, living to do! A lifetime is not long enough.
Sylvia Plath
A Deep Q-Network learns to play Enduro
02 November 2025
Implementing the first Deepmind papers on deep reinforcement learning and realizing that the Sutton-Barto textbook is a masterpiece in technical pedagogy.
330 September 2025
Why does BatchNorm work?
5Forward-mode and reverse-mode autodiff in python featuring dual numbers and graph traversals
11Sums of random variables and convolutions
24 April 2025
A note on how Gaussians are convolved to make the reparameterization trick work in the diffusion forward process.
1205 March 2025
Andrew Barto and Richard Sutton for pioneering Reinforcement Learning
20Some known problems in text to image generation
06 January 2025
Learning about where diffusion models break and collecting attempts at solutions
21Distilling Stockfish into a Transformer
29 October 2024
My thoughts on the "Grandmaster-Level Chess Without Search" paper from Google DeepMind
27DreamCoder and Neural Program Induction
21 October 2024
Notes and some ideas while reading the DreamCoder paper
29Dimensional Analysis on Loss Functions
20 February 2024
A half-baked idea I had while checking my equations
53Learning speech representations by combining CL and MLM
28 December 2023
Paper reading notes from w2v-BERT by Google Brain
62Segmentation model outputs as prompts to SAM
11 November 2023
Some ideas on improving semantic segmentation with SAM
65Seam Carving — Energy Function
01 September 2023
Live Coding Seam Carving Part 2
76Intermediate Activations in Llama 2.7B
13 August 2023
there is a country layer in the llama 2 transformer
7917 July 2023
Three implementations of self attention, one for visualizing the self attention between two tensors, one for language, and one in PyTorch
88Einstein Summation Notation Exercises
13 July 2023
Exercises in faster and more intuitive tensor notation for deep learning.
89Visualizing and Working with Audio Files
27 December 2022
Helper code that has different ways to visualize audio files, mostly for future reference.
102Root Finding with Newton's Method
18 December 2022
Testing whether Newton's method decays updates exponentially
104Seam Carving — Intuition and Ideas
30 October 2022
Live Coding Seam Carving Part 1
107Traffic event description using Unsupervised Learning Methods
10 December 2020
13th International (Online) Conference on Transportation Planning and Implementation Methodologies for Developing Countries
13806 June 2020
Official Work Product Submission for the 2020 edition of Google Summer of Code with TensorFlow.
14231 January 2020
Hardcore History, Kobe Bryant and new adventures.
150