Three visions of 2028, the Industrial Revolution's unfinished lessons, and what the window before superintelligence demands of engineers today.
Cloud object storage can be expensive for CDC at scale. This post explores cost-saving alternatives: challenger storage providers with radically different pricing, and the role caching plays under Zipf access patterns to drive costs down further.
CDC chunks are the right logical unit for deduplication, but storing them as individual objects is prohibitively expensive. This post explores containers, the storage abstraction that makes CDC viable at scale, and the fragmentation, garbage collection, and restore challenges they introduce.
See CDC-based deduplication in action, learn where CDC is deployed today, and explore the frontier of structure-aware chunking for source code.
An exploration of FastCDC's Gear hash, normalized chunking with dual masks, and the 2020 two-byte-per-iteration optimization, with code in pseudocode, Rust, and TypeScript.
An introduction to content-defined chunking: why fixed-size splitting fails, how content-aware boundaries solve the deduplication problem, and a taxonomy of three CDC algorithm families.
How a simple grid of lines evolved into an organic, wind-driven canvas animation through iterative prompting.
10 years. 10 lessons. The most important things I learned building developer tools at GitHub.
Recursive promises, a Y-combinator, and an AI-generated poem walk into a browser—this post is what happens next.
An exploration of how Ruby's inject and zip methods can be implemented using nothing but anonymous functions and the Y-combinator. A silly, elegant, mind-bending combinator journey that is slower than you'd like.