Recent Reads
Papers
Not all AI-assisted programming is vibe coding (but vibe coding rocks).
A really cool article that perfectly aligns with my views on the current misuse of AI tools for software development. It portrays how misusing tools such as ChatGPT, Claude, Gemini and other LLMs for coding can hinder the development process of a project. If anything, I believe that this article misses on another important aspect for young SWEs who are starting their career paths like me - the learning aspect.
ReadMap Reduce - Simplified Data Processing on Large Clusters
Map Reduce is Google's approach on big data processing and within Google's technological context. Within this context, fault-tolerance over commodity machinery was a priority. Therefore, MapReduce is great for data processing over a distributed cluster. Great read for a real-life application of distributed system concepts such as quorum, synchronization, etc.
ReadThe Google File System
An approach on one of the more complex topic on distribution systems - distributed file systems (DFS). The discussion in this paper is Google's approach on creating a DFS which hundreds of users could access at the same time. GFS uses an advanced lock service Google first developed known as Chubby. Great paper to better understand how DFS function.
ReadBigtable - A Distributed Storage System for Structured Data
Bigtable is Google's attempt on a distributed storage system which is highly reliable, scalable, and available. With a very unique approach and schema on mapping structured data. In addition, Google uses its own environment and technologies such as GFS, Chubby and others to be able to create Bigtable's schema. A great read to get involved into Distributed Systems.
ReadAutoregressive Image Generation using Residual Quantization
A paper that expands on the common methodology to compress high-dimensional datesets like embeddings into codebooks. They use this new technique called residual quantization which fixes the codebook's to a certain size and recursively quantizes the subsequent feature map in a coarse-to-fine manner.
ReadAudioPaLM - A Large Language Model That Can Speak and Listen
A cool paper I found out at random. But a very cool application of NLP to make a audio-audio model. From what I understand this is a foundational model for LLMs which can understand and generate speech. In terms of contemporary research this is a bit old (2023), but foundational nonetheless!
ReadAccelerating Particle and Fluid Simulations with Differentiable Graph Networks for Solving Forward and Inverse Problems
A very interesting paper on the applications of Automatic Differentiation outside the domain of machine learning. It is a bit of a hard read, specially for someone a little out of touch on the CDF community. Nevertheless, it is a great read!
ReadAutomatic Differentiation in Machine Learning - a Survey
The paper explores the topic of Automatic/Algorithmic Differentiation (AD); an essential technology in contemporary machine learning models. It surveys a variety of methodologies to accomplish such concept and provides useful examples.
ReadAttention is all you need.
This revolutionary article pioneers the idea of Attention. One of the many algorithms utilized in the now renowned GPT LLMs OpenAI implements. Great read to understand how a transformer (the 'T' on 'GPT') functions.
ReadBooks
Distributed Systems
A great introductory book into the highly complex world of distributed systems. The textbook describes into high detail all the building blocks on creating a highly reliable, scalable, and fault-tolerant distributed system.
GetNo Longer Human
A total existentialist book who follows the live of a person who never felt to really fit or understand human thoughts and actions. Providing a very unique perspective on how society functions.
GetPursuit of Happiness
A beautiful book about the search for happiness. Depicting a 'classic' rags to riches story of a single-dad broker into a enterpreneur. Discussions like family trauma and crisis are unforgettable. Must read!
Get