In this post, we will try to understand the different factors which control the write operations in MongoDB. We will try to tie in the common concepts like checkpointing, journaling, replication that we hear so often in the context of write operations...
Developers who learn or start with Golang are taught to treat goroutines as a very cheap version of threads. The minimum cost of spawning a goroutine has been decreasing with different versions and is now currently at 2 kB for Golang 1...
This article covers how to resolve python dependencies using Pythonβs Abstract Syntax Trees (AST). There are different and maybe better ways to understand the scope of your python dependencies...
This blog is the 2nd part of a blog post on how to write custom logic for your kubernetes objects. The first post can be found here [https://gsarmaonline...
In this post, we will be going over the fastest no-frills approach to getting your operator off the ground using kubebuilder. The post assumes knowledge of the following: Kubernetes and how it works Kubernetes custom resource definitions Kubernetes Operators and reconciliation loops Setting up a local cluster, I use kind for my k8s orchestration needs Golang The task is to create an operator that operates on a Kubernetes CRD TodoList...
Most of the problems that people mention with their Kafka implementations is that they donβt have the complete visibility required over the configuration and the API usage. Having more visibility into the commonly required tweaks can allow admins and developers to use Kafka as comfortably as a MySQL or PostgreSQL cluster...
Each kafka topic defines the number of partitions and replication factors when itβs created. However, once a topic is created, the partition count cannot be changed without affecting the ordering guarantees of the kafka partitions since kafka uses the following formula to calculate which partition a record should go to: partition\_id = partition\_key % number of partitions Kafka partitions are the gateway to concurrency and scalability...
I was recently looking at a managed Kafka service and came across services like AWS MSK and Kafka on Confluent Cloud. While comparing these services, I saw that there were limitations on the number of partitions allowed in a cluster...
Monarch is a planet-scale in-memory time series database developed by Google. It is mainly used by as a reliable monitoring system by most of Googleβs internal systems like Spanner, BigTable, Colossus, BlobStore...
What is virtual time and why do we need it. As distributed systems have progressed and been adopted over the last decade, there have been numerous technologies in different segments like databases, caches, message queues, etc which are built on top of other frameworks which abstract away the difficulty of managing distributed systems...