Feb 28, 2023
Apache Helix: The Distributed System’s Orchestra Conductor
Achieve harmony in complex clusters using finite-state machines
big-data / software-engineering / distributed-systems / programming / apache-pinot
Writing archive
Build notes, systems writing, and project logs.
Feb 28, 2023
Achieve harmony in complex clusters using finite-state machines
big-data / software-engineering / distributed-systems / programming / apache-pinot
Jan 03, 2023
Unleashing the full potential of your RocksDB with the right configuration
database / programming / configuration / distributed-systems / software
Mar 02, 2021
Before diving into this article, I should state that — as a developer — any situation requiring placing a language A project into a language B package should occur very rarely. Most of the time it’s preferable to consider re-designing the interaction between various language components in these situations. But what if this situation is unavoidable? Open source projects such as Apache Flink and Apache Spark serve as examples. These projects have been written completely in Java but also have python modules available for those who don’t want to use the Java API.
java / apache / programming / python / software-development
Sep 29, 2020
Groovy Functions
analytics / software-development / sql / real-time-analytics / programming
Aug 18, 2020
One of the primary advantages of using Pinot is its pluggable architecture. The plugins make it easy to add support for any third-party system which can be an execution framework, a filesystem, or input format.
distributed-systems / pinot / s3 / spark / big-data
Apr 09, 2020
big-data / database / software-development / machine-learning / programming
Apr 01, 2020
work-life-balance / personal / work-from-home / lifestyle / software-development
Mar 30, 2020
Leader and Followers
distributed-systems / technology / software-development / big-data / programming
Jan 20, 2020
A glimpse at capabilities which makes Airflow better than its predecessors
data / big-data / data-engineering / programming / software-engineering
Jan 11, 2020
Explore the various strategies to deploy ML models in Apache Flink/Spark or other realtime data streaming applications.
programming / machine-learning / software-engineering / big-data / data-science