blogs

📝

2014-01-15• Updated: 2025-03-12

Everyone Should Know This About WAL: The Foundation of Database Durability

15 min read

A deep dive into Write Ahead Logs (WAL) - the fundamental technique that ensures data durability in distributed systems, databases, and streaming platforms.

📝

2013-11-05• Updated: 2025-03-05

Epoll, Kqueue, and Event Loops: Scaling C Network Servers

19 min read

How readiness-based I/O (epoll/kqueue) lets C servers scale: level vs edge triggering, drain-until-EAGAIN, fair scheduling, timers, and backpressure—without 3 a.m. incidents.

📝

2014-12-03• Updated: 2025-01-11

Page Cache, mmap, and When to Bypass It

22 min read

How the Linux page cache actually works, what mmap buys you over read/write, where readahead and writeback help (or hurt), and when O_DIRECT is the right tool—not the default.

📝

2016-01-27• Updated: 2024-12-15

Building Lock-Free Structures in C: Hazard Pointers vs. Epoch GC

22 min read

Implement lock-free stacks/queues in C and reclaim memory safely: ABA, tagged pointers, hazard pointers, and epoch-based reclamation—what they are, when they win, and how to use them without footguns.

📝

2020-07-20• Updated: 2024-12-10

High-performance DataFrames: Polars, pandas 2, and Arrow Interop

13 min read

Pick the right engine; exploit Apache Arrow and expression pipelines to get faster, more memory‑efficient DataFrames in Python—without painting yourself into a corner.

📝

2020-08-20• Updated: 2024-12-01

Memory: Refcounting, Generational GC, and Finding Leaks Without Guessing

17 min read

A production-first tour of CPython’s memory model: what refcounts really guarantee, how the cyclic GC works, why RSS doesn’t always go down, and how to reason about growth without guesswork. Part 1 lays the mental model and refcounting truths.

📝

2020-11-12• Updated: 2024-12-01

Profiling That Matters: py-spy, eBPF, perf, and Interpreter-level Counters

14 min read

A practical, low-overhead toolbox for Python performance: what to use (and when), how to keep overhead in single digits, and how to read profiles you can trust.

📝

2020-08-20• Updated: 2024-12-01

Types that Pay for Themselves: Pydantic v2, mypy/pyright, and Runtime Contracts

14 min read

Make Python types carry their weight: combine static checking (mypy/pyright) with fast runtime validation (Pydantic v2) to turn annotations into contracts that prevent bugs, speed up onboarding, and keep hot paths fast.

📝

2020-11-05• Updated: 2024-11-21

Building Fast Native Extensions: Cython, cffi, HPy, and a Tiny C-extension by Hand

17 min read

The shortest safe path from Python to C-speed: a practical roadmap for choosing Cython, cffi, HPy, or the raw C API, with a minimal wheelable extension, packaging/ABI mental models, and performance guardrails you can apply today.

📝

2020-08-12• Updated: 2024-11-20

AsyncIO at Scale: Backpressure, Structured Concurrency, and Cancellation Semantics

14 min read

Build async services that stay responsive under load: apply backpressure with bounded queues, adopt structured concurrency, and make cancellation a contract with deadlines.

📝

2020-10-15• Updated: 2024-11-20

GIL Realities and the Path Toward No-GIL (PEP 703): What Changes for You

12 min read

A practical guide to Python’s GIL today—where threads help, where they don’t—and what the emerging no-GIL path means for the way you write concurrency, performance, and extension code.

📝

2015-11-17• Updated: 2024-08-03

Deterministic Repro in C: Seeds, Schedules, and Record/Replay

22 min read

Techniques for making concurrent C code reproducible: deterministic RNG and seeding, time and clock control, schedule capture, and record/replay so tests fail once and explain why.

📝

2020-10-05• Updated: 2024-06-15

The Specializing Interpreter in CPython 3.11+: Why Your Code Got Faster

11 min read

Demystify CPython 3.11's specializing/adaptive bytecode interpreter—quickening, inline caches, and the patterns that help your code hit the fast path without changing a line.

📝

2013-06-23• Updated: 2024-03-08

Strict Aliasing, Effective Types, and Optimizer Realities

18 min read

Demystifying strict aliasing and effective types in C: what the standard actually says, how miscompiles happen, and the safe, high-performance patterns that keep the optimizer on your side.

📝

2013-12-22• Updated: 2023-09-16

Resilient C Services: Backpressure, Retries, and Idempotency

23 min read

Apply streaming and event-driven patterns to robust C backends: bound queues, honor deadlines, retry the right way, and make handlers idempotent so failure doesn’t cascade.

📝

2014-02-11• Updated: 2023-07-19

Deep C Memory Model: Sequencing, Atomics, and UB in Practice

16 min read

A hands-on tour of C’s memory model: from pre-C11 sequence points to C11 atomics, data races, fences, and practical patterns to avoid undefined behavior in systems code.

📝

2015-04-06• Updated: 2023-05-27

Log-Structured Storage in C: Segments, GC, and Bloom Filters

34 min read

Build an LSM-ish key-value store in C: from append-only segments and fsync discipline to SSTable format, compaction (tiered vs leveled), and Bloom filters that dodge disk seeks.

📝

2016-06-09• Updated: 2023-01-30

Designing a WAL in C: Append-Only, fsync, and Crash Consistency

20 min read

A minimal, production-minded Write-Ahead Log (WAL) in C: append-only records, checksums, segment management, fsync discipline, and recovery guarantees.

📝

2016-10-21• Updated: 2022-12-09

Async I/O in C: POSIX AIO vs io_uring vs Threads

22 min read

Async in C without 3 a.m. incidents: understand POSIX AIO, io_uring, and thread-pool strategies; compare latency/throughput/complexity; and build a thin, testable abstraction with real cancellation.

📝

2015-09-14• Updated: 2022-11-02

Cache-Aware C: False Sharing, Prefetching, and Layout Control

23 min read

Struct/layout tactics, padding, and prefetch patterns to minimize cache misses and false sharing in real C code.

📝

2013-03-29• Updated: 2022-09-07

Zero-Copy in C: sendfile, splice, vmsplice, and mmap

24 min read

Practical zero-copy techniques on Linux: when bytes can skip userland, how sendfile/splice/vmsplice/mmap actually move data, and how to avoid hidden copies and stalls.

📝

2014-08-18• Updated: 2022-04-22

Syscalls, File Descriptors, and Robust I/O Patterns in C

20 min read

readv/writev, nonblocking I/O, partial writes, and EINTR-safe loops for production C.

📝

2014-05-02• Updated: 2022-02-14

Measuring What Matters: perf, eBPF, and Flamegraphs for C

13 min read

Sampling vs tracing, symbolizing, and pinpointing cache-miss hotspots; guiding real optimizations in C.