Apr 16, 2026
In-progress notes from building a JIRA-to-PR agent for Apache Pinot, the over engineering instinct I started with, the signals that told me when to add each layer and the things I'm still figuring out.
ai / llm / software-engineering / build-in-public
Apr 11, 2026
I spent a year hand-building visual explainers in Cursor. Then I tried to automate it with AI. The first month was all agent infrastructure and zero visuals.
ai / agents / software-engineering / build-in-public
Apr 02, 2026
A real incident walkthrough for when your system is healthy and completely wrong
distributed-systems / debugging / software-engineering / data-engineering
Mar 31, 2026
A practical explanation of snapshots, Delta Lake, Apache Iceberg and why data teams suddenly care about table formats.
data / data-engineering / big-data / apache-iceberg / delta-lake / software-engineering
Aug 12, 2025
The art of removing costs you didn't know you were paying on every row.
distributed-systems / data-engineering / java / apache-arrow / software-engineering
Jun 20, 2024
What it took to make a Stable Diffusion pipeline fast enough to replace Twitter profile pictures with anime versions as you scroll.
ai / stable-diffusion / chrome-extension / build-in-public / software-engineering
Feb 28, 2023
Achieve harmony in complex clusters using finite-state machines
big-data / software-engineering / distributed-systems / programming / apache-pinot
Jan 03, 2023
Unleashing the full potential of your RocksDB with the right configuration
database / programming / configuration / distributed-systems / software
Mar 02, 2021
Before diving into this article, I should state that — as a developer — any situation requiring placing a language A project into a language B package should occur very rarely. Most of the time it’s preferable to consider re-designing the interaction between various language components in these situations. But what if this situation is unavoidable? Open source projects such as Apache Flink and Apache Spark serve as examples. These projects have been written completely in Java but also have python modules available for those who don’t want to use the Java API.
java / apache / programming / python / software-development
Sep 29, 2020
analytics / software-development / sql / real-time-analytics / programming
Aug 18, 2020
One of the primary advantages of using Pinot is its pluggable architecture. The plugins make it easy to add support for any third-party system which can be an execution framework, a filesystem, or input format.
distributed-systems / pinot / s3 / spark / big-data
Apr 09, 2020
big-data / database / software-development / machine-learning / programming
Apr 01, 2020
work-life-balance / personal / work-from-home / lifestyle / software-development
Mar 30, 2020
distributed-systems / technology / software-development / big-data / programming
Jan 20, 2020
A glimpse at capabilities which makes Airflow better than its predecessors
data / big-data / data-engineering / programming / software-engineering
Jan 11, 2020
Explore the various strategies to deploy ML models in Apache Flink/Spark or other realtime data streaming applications.
programming / machine-learning / software-engineering / big-data / data-science