Is ClickHouse Better Than Snowflake?
I was pitching SIVARO's data infrastructure services to a fintech CTO in mid-2023. Their team had been bleeding money on Snowflake for 18 months. $2.3 millio...
Technical articles on ClickHouse consulting, data infrastructure, and production AI systems. Written by Nishaant Dixit, Founder & Lead Engineer at SIVARO.
Written by Nishaant Dixit, Founder & Lead Engineer at SIVARO.
I was pitching SIVARO's data infrastructure services to a fintech CTO in mid-2023. Their team had been bleeding money on Snowflake for 18 months. $2.3 millio...
Let me tell you a story. In 2021, I sat in a room with a fintech team who had just gotten their Snowflake bill. $47,000 for a month of [analytics) queries. T...
I've been building data infrastructure for over six years. I've burned real money — client money, investor money — testing both ClickHouse and Snowflake ...
I’ve lost count of how many times someone has asked me: "Is ClickHouse SQL or NoSQL?" Usually they’re staring at a columnar database that ingests 100K ro...
I'll tell you straight: is kubernetes still relevant in 2026? Yes. But not for the reasons most people think. In 2022, I had a client — a mid-size fintech ...
I get this question every week. A founder at a Series A startup asks me, "Is Kubernetes the same as AWS?" A CTO at a mid-market company asks the same thing, ...
You’re [building)) a streaming platform-example-a-practitioners-3). Millions of users. Global traffic. Every second of downtime costs you subscribers. You...
I remember sitting in a conference room in Bangalore in 2019, convincing a skeptical CTO that Kubernetes wasn't just hype. His first question: "If Kubernetes...
I remember the exact moment I stopped caring about the title. 2019. I'm at a conference in Bangalore. A guy walks up to me, says he's a "Platform Engineer." ...
In 2023, my team at SIVARO was tasked with [building) a customer support agent that could autonomously resolve billing disputes. We thought we just needed a ...
I started SIVARO in 2018 [thinking) the hardest part of AI would be the models-for-ai)-layer-for-ai)-protocol-the-missing-layer-for-ai). I was wrong. The har...
You're building a retrieval-augmented generation system. You've got docs indexed, embeddings ready, and a language model waiting to answer questions. But you...
You've built a chatbot that answers questions. It's smart enough to sound human. But when [someone](/articles/docker-explained-what-the-hell-is-it-and-why-e...
I didn't start SIVARO to build AI agents. I started it because I was tired of watching companies spend millions on infrastructure that collapsed under produc...
Every week, another CEO asks me: "Nishaant, which AI agent should we bet on?" They've read the headlines. They've seen the demos. They're terrified of being ...
I’m Nishaant Dixit, founder of SIVARO. We build data infrastructure and production AI systems. Every day, someone asks me: what does a platform engineer do...
You've heard the hype. Every SaaS product now calls itself an "AI agent." Your boss wants you to deploy one by Friday. But when you strip away the marketing,...
You’ve heard the hype. Every vendor claims their chatbot is now an “agent.” Every demo shows a bot booking flights, filing expenses, writing code. But ...
Let me tell you about the first time I thought I understood AI agents. It was January 2023. One of our clients at SIVARO — a mid-size logistics company —...
Every week, a founder pitches me their "AI agent" startup. And every week, I ask them the same question: "What does an AI agent do exactly?" Most can't [ans...
You're sitting in a meeting, and [someone](/articles/what-is-apache-kafka-used-for-a-practitioners-guide)) says "we need to [build](/articles/what-is-clic...
Here's the short version: An AI agent is a system that perceives its environment, makes decisions, and takes actions to achieve goals — without you microma...
I was six months into [building)) SIVARO when a potential client asked me flat out: "What does Kubernetes actually do?" Not "What is Kubernetes?" — he kne...
I spent the first six months of my career hating Kubernetes. Not because it was hard. Because I couldn't answer the simplest question from my CEO: "What does...
I remember the exact moment I realized raw LLMs weren't going to cut it for production systems-context-protocol-the-missing-layer-for-ai). It was January 202...
Kubernetes isn't a single thing. It's a contradiction. I've spent the last six years [building](/articles/what-is-ai-assisted-development-a-practitioners-...
I spent three years ignoring Kubernetes. Thought it was overhyped. Another tool for ops teams to justify their existence. Then I tried running a real [produ...
I remember the exact moment I realized platform engineering wasn’t just DevOps with a new label. It was late 2019. We were [building) a data pipeline-maps)...
I spent two years building internal tools wrong. At SIVARO, we were shipping data pipelines-maps) for clients—event-driven [systems)](/articles/what-is-a...
I spent the first six months of 2024 watching my team try to get two Salesforce AI agents to talk to each other. It was a mess. One agent would fire off a ta...
I remember the first time a junior engineer) on my team used GitHub Copilot to write a complex data pipeline. He was three months out of bootcamp. The code c...
I spent the first half of 2024 convinced that multi-agent [systems](/articles/what-is-a-model-context-protocol-the-missing-layer-for-ai)) were pure hype. No...
I remember the exact moment I stopped believing in “just connect the APIs.” We were building a fraud detection pipeline for a fintech client in mid-2022....
You’re staring at a monolithic database that’s crashing under 50K queries per second. Your team’s been told to “scale up”—buy bigger hardware, ad...
I remember the exact moment I stopped believing in "real-time" data warehouses. It was 2020. We were building a fraud detection pipeline for a fintech client...
I learned-7) this the hard way. In 2019, my team at SIVARO built a monolithic system for a client. Three months later, a single database connection pool exha...
You’re feeding a 200-page legal document to GPT-4. Halfway through, it forgets what the plaintiff argued on page 3. You bump the context window to 128K tok...
You're building an AI system that needs to talk to databases, APIs, and file systems. Six months ago you'd wire up each integration by hand — custom code f...
I spent six months in 2023 watching a perfectly good AI system collapse under its own complexity. Three agents, each trained on different datasets, each with...
I remember the exact moment I stopped believing in magic. It was March 2023. My team at SIVARO had just spent six weeks building what we thought was a "smart...
You've run Kubernetes in [production)](/articles/what-is-a-model-context-protocol-the-missing-layer-for-ai)) for six months. Your pods restart, your nodes ...
You're building something that needs to handle 10,000 requests per second. Or maybe you're migrating a monolith because Monday morning traffic killed your da...
I've spent the last six years building data infrastructure and production AI systems at SIVARO. I've burned through more orchestration tools than I care to c...
I've been asking myself this question since 2021. Back then, most "AI orchestration" meant piping three Python scripts together with Airflow. Today? The land...
I remember the first time I heard "Docker" in a team meeting back in 2015. Our lead engineer said "just containerize it with Docker" and everyone nodded. I d...
I was sitting in a product review last week when an engineer asked me: "Who are the big 4 AI agents? Like the FAANG of agents?" Good question. Bad framing. T...
The honeymoon is over. In 2020, I watched a team of twelve spend six months migrating their Rails monolith to Kubernetes. They wanted "cloud native." They wa...
I built SIVARO in 2018. We design data infrastructure and production AI systems. For years, Kubernetes was our default answer. Container [orchestration)? Kub...
You’re running a Kubernetes cluster in [production](/articles/what-is-llm-context-length-a-practitioners-guide-3)). Everything’s fine. Then Slack blows ...
I spent three months building an AI agent system that failed in production within 12 minutes. The agent was brilliant in staging. It had perfect chain-of-tho...
Let me tell you about the moment I stopped being skeptical. It was March 2024. I was staring at a terminal window at 2 AM, watching an agent I'd built autono...
You're staring at a [compliance)](/blog/managed-soc2-compliance-ai-agents-pricing-comparison-ciso)](/blog/managed-soc2-compliance-ai-agents-pricin...
You're staring at a SOC 2 audit request. The auditor wants to know every AI model in [production](/blog/vector-database-comparison-2026-what-...
I spent the first six months of 2024 [building)](/blog/ai-agents-what-they-actually-are-and-how-to-build-one-that)](/blog/ai-agents-what-they-actual...
You've got three AI agents talking to each other. Each one calls a different LLM. A supervisor agent routes tasks between them. Everyone calls this "orchestr...
I was on a call with a client in Singapore when their dashboards went dark. Not just theirs — every customer. The AWS console showed nothing. No errors. No...
Four years ago, I watched a $50K Kafka cluster melt down because our AI orchestration layer couldn't handle two concurrent model calls. The pager went off at...
I’m Nishaant Dixit. I run SIVARO — we build](/articles/what-is-a-platform-engineering-example-a-practitioners-3)](/articles/what-is-agent2agent-protoc...
Here's a hard truth most people won't tell you: "free AI" is never truly free. It comes with invisible costs—data privacy concerns, rate limits, and hidden...
I bought my first Ugreen NASync DXP6800 Pro six months ago. The specs looked incredible. Intel Celeron N100. 8GB RAM expandable to 64GB. Dual 2.5GbE ports. A...
I’ll cut straight to it: yes, UGREEN NAS can run Docker. But “can” is doing a lot of work. I have three UGREEN units on my desk right now—a DX4800, a...
I spent three months in 2024 building a chatbot for a logistics client. We tried GPT-4, Claude, fine-tuned models, the works. The CEO asked me one question t...
I spent three years building analytics on PostgreSQL. Then I hit a wall. A client wanted real-time dashboards over 200 million rows. PostgreSQL screamed. Que...
The phone rang at 2 AM. Our real-time analytics dashboard was down. Again. Everyone said Snowflake was the answer. Elastic. Scalable. The cloud data warehous...
I’ll never forget the moment our dashboard stopped loading. It was 2 AM. A room full of engineers staring at a spinning wheel. 50 million rows. Twelve join...
I'll never forget the call. 3 AM. Our Snowflake query was running for 47 minutes. The dashboard was dead. Users were screaming. The problem wasn't Snowflake....
I spent three years debugging why perfectly good data compression algorithms ran like garbage on GPUs. Not because the algorithms were wrong. Because the com...
You've got SOC2 certification. Congratulations. Now the real nightmare begins. I learned this the hard way at SIVARO. We spent six months implementing RAG pi...
Four years ago, I sat in a hotel lobby in Bangalore testing a "conversational AI" travel agent for a client. It took seven minutes to book a simple flight fr...
Most GPU developers think a kernel launch is simple. Call the function. Wait for results. Done. I learned the hard way this is wrong. In 2023, I was debuggin...
I'll be honest — when I first heard about DeepSeek, I dismissed it. Another Chinese AI lab claiming breakthrough? Seen that movie. Then I [...
You've heard the buzz. DeepSeek V4 is out. The community is losing its mind over 1M context windows and [pricing)](/blog/managed-...
I've been [building)](/blog/ai-agents-what-they-actually-are-and-how-to-build-one-that)](/blog/ai-agents-what-they-actually-are-and-how-to-build-o...
I'll never forget the call. A CTO friend of mine had just rolled out DeepSeek V4 across his customer support pipeline. Three weeks later, his cloud bill was ...
I spent the last month hammering on the DeepSeek V4 free trial API. Not because I'm cheap — I needed to know if it's [produ...
Let me cut through the noise. I've spent the last three months running DeepSeek V4 Pro through our production [pipelines](/articles/wh...
I run a product [engineering](/blog/vector-database-comparison-2026-what-actually-works-in))](/blog/temporal-workflow-engine-comparison-what-actu...
You don't care about benchmarks. You care about whether your CI pipeline-maps) stops failing. Whether that 2 AM deploy doesn't blow up. Whether the junior de...
You're looking at two [models](/blog/ai-model-inventory-template-soc2-auditors-cc61-mapping)](/blog/ai-model-inventory-template-soc2-auditors...
I’ve spent the last eight years building data infrastructure and production AI systems. I’ve seen the hype cycles. I’ve burned budgets on models that c...
I spent two weeks stress-testing both models-for-ai)-protocol-the-missing-layer-for-ai) against production workloads at SIVARO. Here's what I found. You've s...
I remember the first time someone told me to "just containerize it." This was 2016. I was debugging a Python app that worked on my laptop but crashed on stag...
I remember the exact moment Docker clicked for me. 2015. I was trying to deploy a Python app that worked perfectly on my MacBook but crashed on the Ubuntu se...
I still remember the moment clearly. A senior engineer asked me during a system design review: "Does ChatGPT use MCP under the hood?" I paused. The answer wa...
I get asked this question every week. Clients, engineers](/articles/what-is-ai-assisted-development-a-practitioners-guide-2)](/articles/what-is-a-platform...
I spent three years building production AI systems before I asked the obvious question. Why does every LLM integration feel like duct-taping a firehose to a ...
Everyone builds chains. They're simple. They work for demos. But in production, chains break—hard. I learned this the hard way building AI systems at SIVAR...
I spent last Tuesday rebuilding a retrieval pipeline for the third time this year. Not because the data was bad. Because the context kept breaking. Then I go...
I spent three weeks debugging a cache miss issue in late 2025. The hash map was fine on paper. O(1) lookups, textbook implementation. But at 50,000 requests ...
I’ve been building production AI systems since 2018. I’ve watched teams burn millions on inference latency because they didn’t understand what happens ...
You’re building a system that needs to talk to other systems)](/articles/what-is-a-model-context-protocol-the-missing-layer-for-ai)). Maybe it’s an AI...
Let me start with a story. August 2024. I'm sitting in a back room at a startup in Bangalore, watching two engineers argue for forty minutes about whether th...
I’ve been running Kubernetes in [production](/articles/what-is-a-model-context-protocol-the-missing-layer-for-ai)) since 2017. At SIVARO, we’ve built da...
I spent the first six months of 2026 inside the engine room of inference optimization-that-doubles-llm). My team at SIVARO was tasked with cutting latency on...
I've sat through hundreds of interviews. On both sides of the table. Most candidates fumble when asked "Explain Docker." They recite definitions from tutoria...
I remember my first engineering leadership interview. The VP asked me to explain Docker containers. I rambled about "lightweight virtualization" and "isolate...
I spent the first half of 2025 convinced the bottleneck was model size. Bigger models, more GPUs, problem solved. Then my team at SIVARO hit a wall running p...
I spent two years watching teams fail at agent orchestration. Not because their models were bad. Not because their agents couldn't reason. Because they treat...
It was 3 AM in June 2024. I was sitting in a co-working space in Bangalore, staring at a CUDA out-of-memory-humanoid-robots-the) error for the fourth time th...
I've spent the last six [years](/articles/what-is-an-ai-orchestration-example-a-practitioners)) [building](/articles/is-kubernetes-still-relev...
I’ve spent the last eight years building production AI systems. I’ve seen companies burn millions on “AI agents” that were just glorified chatbots. S...
Last week, a CTO told me he'd "deployed AI agents" across his company. He'd hooked ChatGPT up to a Slack bot. That's not an agent. That's a glorified chatbot...
I remember the exact moment I realized ChatGPT was not an AI agent. My team had built a pipeline that used ChatGPT to automatically triage support tickets. T...
I spent three years selling Snowflake. Then I spent two years building on ClickHouse. The question "is ClickHouse better than Snowflake?" isn't simple — bu...
I spent three years building a real-time analytics platform on Snowflake. We hit 50TB of data. Queries took minutes. The CFO asked why our infra bill looked ...
I still remember the panic. 2 AM. Dashboard frozen. The CEO was in a meeting with the board, and our Snowflake warehouse was spitting out 45-second query tim...
I spent five years building data pipelines at scale. 200,000 events per second. 50 TB of new data daily. The question everyone asked me: “ClickHouse or Sno...
I built a real-time analytics pipeline that cost $14,000 per month on Snowflake. ClickHouse did the same job for $1,800. Everyone talks about "cloud data war...
Two years ago, I watched a client burn $40,000 in a single week on Snowflake queries. Their dashboard was loading in 45 seconds. They were panicking. I told ...
I spent six months in 2023 migrating a client’s analytics stack from Snowflake to ClickHouse. Fifty terabytes of event data, 200 concurrent queries per sec...
Here's what nobody tells you about DeepSeek AI safety: the public debate is missing the real engineering problems. I run SIVARO. We build production AI syste...
I'll never forget the panic in a CTO's voice last month. His team had just deployed a DeepSeek R1-powered customer support system. Then an auditor asked: "Wh...
I’ll be straight with you: when DeepSeek R1 dropped in late 2024, I dismissed it as another Chinese LLM trying to catch up. Then my team at SIVARO started ...
I'll be straight with you: when I first heard about DeepSeek, I assumed it was another also-ran. "Chinese ChatGPT clone" — that's what everyone called it. ...
Everyone told me ChatGPT was the only option. Six months ago, I stopped listening. I run a product engineering shop. We build data-intensive systems that nee...
Back in 2023, I was burning cash on GPT-4 API calls like it was confetti. My team at SIVARO was building a [real-time) data pipeline that needed to summarize...
I spent last Tuesday rebuilding a production RAG pipeline. My team had two weeks to cut latency by 40%% and reduce costs by 60%%. The choice came down to two m...
I spent three weeks building a production RAG system. First with GPT-4o. Then with DeepSeek-V4. The results surprised me. Most people think this is a simple ...
I spent last Thursday testing five AI models side by side. My coffee went cold. My CPU hit 92°C. And I emerged with a clear answer to the question everyone ...
I spent last month stress-testing DeepSeek against GPT for a production RAG pipeline handling 50K queries daily. The results surprised me. Most AI comparison...
I spend my days building data pipelines-maps) and [production](/articles/what-is-a-model-context-protocol-the-missing-layer-for-ai)) AI systems at SIVARO. W...
Last week, I watched a $50M data pipeline fall apart because of a bad model choice. The CTO had picked an LLM for token optimization, but the latency killed ...
Here's what I learned the hard way: most people compare AI models like they're comparing specs on a laptop. More parameters. Faster inference. Cheaper tokens...
You're building something real. A product. A pipeline. A system that needs to work at scale, with predictable costs and consistent output. And someone in you...
Six months ago, I bet my production AI pipeline on DeepSeek. My team at SIVARO had been running GPT-4 on critical data infrastructure—real-time event proce...
Let me start with something I learned the hard way. In early 2025, I was [building) a real-time data pipeline for a client at SIVARO. We needed an LLM to cla...
Let me tell you a story. Two weeks ago, I was on a call with a CTO from a mid‑size logistics company. He'd just read about DeepSeek and asked me point‑bl...
I’ll cut straight to it. Everyone’s asking “is deepseek for free?” because DeepSeek launched with a zero-price API, open weights, and a narrative tha...
Look, I get it. You've heard the hype. Docker this, containers that. [Someone](/articles/what-is-apache-kafka-used-for-a-practitioners-guide)) on your team ...
I've had this conversation at least fifty times. A CTO leans across the table and says, "So Docker is basically a lightweight VM, right?" They're wrong. But ...
I've seen it a hundred times. A new engineer joins the team. They look at Docker. They say "Oh, it's like a lightweight VM." And suddenly we're debugging per...
I’ll never forget the look on my client’s face at a fintech startup in 2019. They’d just spent six months migrating their monolith into “containers.�...
I remember the exact moment I stopped pretending Docker was just a lightweight VM. We were debugging a memory leak in production. The container kept crashing...
I’ve been building production data systems since 2018. SIVARO started on AWS. We migrated core infrastructure to GCP. Then we ran hybrid for two years. The...
You're staring at a billion-dollar decision. Your CTO says "just use Kafka." Your lead engineer says "it's overkill." Both are right. Both are wrong. I've sp...
I remember the exact moment Kafka broke me. 3 AM. Pager screaming. Our production cluster had silently dropped 400GB of uncommitted data into the void. The o...
I spent three years running Kubernetes clusters that nearly broke my team. We hit 200K events per second, and every control plane failure taught me something...
I built my first Kubernetes cluster in 2018. Three weeks of sleepless nights. Countless broken YAML files. A production outage that cost us $40K in lost reve...
I’ve lost count of how many engineering leads have asked me: “Should we use Docker or Kubernetes?” The question itself is the problem. It’s like aski...
I spent three years avoiding MoE. Thought it was hype. Too complex. Too many failure modes. Then we hit a wall at SIVARO. A customer's production system need...
I remember the first time I tried to scale a streaming service. Everything broke. Databases melted. We hit rate limits we didn't know existed. That's when I ...
Let me kill the suspense: Yes, Netflix uses Kubernetes. But not the way you think. And not everywhere. And honestly, their relationship with Kubernetes is mo...
I spent three years building a platform team before I realized my biggest mistake. I hired five DevOps engineers and told them to build an internal platform....
I've built ten platforms from scratch. Eight of them failed. Not because the tech was wrong. Because we confused roles. Every CEO asked me: "Is platform engi...
I walked into a conference last month. Three different speakers used “platform engineering” and “DevOps” interchangeably. Five VPs in the audience no...
I've spent the last year helping [engineering)](/blog/temporal-workflow-engine-comparison-what-actually-works-in)](/blog/temporal-workflow-engine-...
You're running [Kubernetes](/blog/karpenter-kubernetes-autoscaler-what-it-is-and-how-to-stop)](/blog/karpenter-kubernetes-autoscaler-what...
I've spent the last six years [building)) data infrastructure-and-databricks-a-practitioners-guide-to) at scale. I've seen AWS bills that'd make a CFO cry. ...
Let me tell you a story. Two years ago, I was staring at an AWS bill that made my stomach drop. Our Kubernetes [cluster](/blog/karpenter-eks-vs-cluster...
You've got an EKS cluster running Cluster Autoscaler. It [works)](/blog/vector-database-comparison-2026-what-actually-works-in)](/blog/vector-data...
The 3AM page hit my phone like a brick. Node failures across three availability zones. Pods stuck in Pending for eighteen minutes. My team was burning hours ...
I'm going to tell you something most [cloud)](/articles/is-kubernetes-still-relevant-in-2026-2)](/blog/is-kubernetes-the-same-as-aws-3))](/blog...
It's July 2026. I just spent six hours debugging a cluster that randomly scheduled a critical AI inference pod onto a node with failing memory. The alert fir...
I spent three weeks rebuilding a deployment pipeline in 2019. Manual SSH. Shell scripts. Prayers to the server gods. The system worked — until Black Friday...
I spent years [building)](/blog/ai-agents-what-they-actually-are-and-how-to-build-one-that)](/blog/ai-agents-what-they-actually-are-and-how-to-bui...
You’ve deployed your LLM. Prompt [engineering) is solid. The RAG pipeline [works)](/blog/vector-database-comparison-2026-what-actually-works-in)](/blo...
You're a CISO at a Series B company. Your board just asked for SOC 2 Type II by Q3. Your security team is you and a part-time intern. Your budget? Maybe $50K...
I spent six months optimizing a mixture-of-experts model last year. The result? A 3x throughput improvement in training. Everyone cheered. Then the productio...
I spent the first six months of 2025 convinced we'd run every production workload on GPT-4-class models. Then our AWS bill hit $47,000 in a single month. Tha...
Every week, another CTO asks me this. They're building something ambitious. Data pipelines breaking. Costs spiraling. And they just want to know: What does A...
I spent three nights debugging an agent-to-agent handoff that kept dropping context. The cards were wrong. The state was lost. Two production systems, each p...
Last year, I watched a team burn six months building a RAG system. Their demo was flawless. Their demo is always flawless. The production system? It hallucin...
I spent last Tuesday debugging a latency spike that nearly cost us a client. The setup looked perfect on paper — Claude for reasoning, Codex for code gen, ...
I thought I did. Seven years ago, I was building a "distributed" system with two microservices and a shared MySQL database. I called it scalable. Then we hit...
Six months post-close on a Series A. The customer procurement team from a Fortune 500 just flagged your SOC 2 Type II report. Exceptions. Contract rescinded....
Here's the thing [nobody](/articles/what-is-a-Model)-context-protocol-the-missing-layer-for-ai)](/blog/what-does-an-ai-agent-do-exactly))](/blog/ai...
I spent last month touring semiconductor fabs in Pyeongtaek. What I saw changed how I think about automation. The clean rooms are getting quieter. Not becaus...
Your microservices are tangled. Your distributed transactions are failing. You're writing cron jobs to fix other cron jobs. I've been there. In July 2026, te...
It was 2:47 AM on a Tuesday in April 2026 when I saw the first alert. A single GitHub account—no avatar, no bio, created three hours earlier—had pushed 1...
I spent three months building an AI agent system that failed in production in the first hour. Seven different LLM calls. A hand-crafted routing layer. Custom...
I spent three months building a RAG system that returned worse results than raw keyword search. The problem wasn't embedding quality or LLM size. It was ever...
You're evaluating an AI vendor. Maybe it's a [model](/blog/ai-model-inventory-template-soc2-auditors-cc61-mapping)](/blog/ai-model-inve...
I remember staring at AWS bills in 2024. $180,000 per month on GPU instances. Our LLM serving stack was hitting 45%% utilization. Everyone said "buy more GPUs...
I've spent the last six years [building)](/blog/ai-agents-what-they-actually-are-and-how-to-build-one-that)](/blog/ai-agents-what-they-actually-are-and-...
I remember the exact moment I realized monoliths were killing us. It was 2021. Our team at SIVARO was running a single PostgreSQL cluster for everything—us...
I spent three months last year fighting a production system that kept losing the plot. Specifically, it couldn't remember what happened five pages ago in a l...
I built my first LLM-powered code generator in 2023. It was terrible. The code compiled but introduced three security vulnerabilities I didn't catch for two ...
I spent three months building an AI system with the wrong kind of LLM. Cost us $47,000 in wasted compute. The model kept hallucinating on structured data. We...
I spent six months last year watching a team burn $400,000 on the wrong LLM architecture. They picked a dense transformer because "bigger is better." Their i...
I spent three years building AI systems that broke in production. Every. Single. Time. The problem wasn't the model. It was the agent architecture. I kept sh...
Most teams build RAG the same way. They chunk documents, stuff them into a vector database, and call it a day. Three months later, they're wondering why thei...
I spent six months last year watching a RAG system hallucinate its way through production. The embedding model was wrong. The chunking strategy was a joke. T...
I spent last Thursday debugging a production RAG pipeline that should have worked. Documents were indexed. Embeddings looked clean. But the answers coming ou...
I spent three years building AI agents that broke in production-protocol-the-missing-layer-for-ai). Not because the models-layer-for-ai)-protocol-the-missing...
The call came at 2 AM. Our customer's RAG pipeline was hallucinating financial data. The agent we'd built was supposed to verify every answer against their i...
It was 3 AM in December 2023. My team at SIVARO was training a 7B parameter model for a client in financial services. The single-GPU run was scheduled to fin...
I spent two years trying to hire a platform engineer. Every resume said the same thing: "Built internal tools." "Managed Kubernetes." "Loved DevOps." None of...
I spent three years building the wrong thing. My team called it “platform engineering.” We built beautiful internal tools, fancy dashboards, and self-ser...
I’ll never forget the day a CTO asked me: “What does a platform engineer actually do?” He’d hired three of them, spent $600K in salaries, and his dev...
I spent three years building infrastructure that no one used. Beautiful Kubernetes clusters. Perfect CI/CD pipelines. Zero adoption. The problem? I built wha...
Let me tell you a story. In 2023, a client came to me — let's call them FinFlow, a payments startup processing $2B annually. They'd built a chatbot using G...
I’m going to tell you a story about a database that broke my [production](/articles/what-is-a-model-context-protocol-the-missing-layer-for-ai)) system at ...
I spent three years building monolithic data systems before I learned the hard truth. Every query that crossed 100ms hurt. Every scaling decision required re...
I remember my first Kubernetes deployment like it was yesterday. We spent three weeks setting up a cluster. Two more weeks debugging networking. The system r...
If you've been in tech for more than five minutes, you've heard the Kubernetes pitch. "It's like Docker for your whole infrastructure." "It abstracts away th...
I remember the exact moment I realized standard LLMs were lying to me. We were building a customer support system at SIVARO. The model answered confidently. ...
I remember the exact moment I stopped believing that bigger models alone would solve enterprise AI. A client had deployed GPT-4-class models for customer sup...
I remember my first AWS bill. I was 24, building what I thought was a simple data pipeline. Three months later, I got a $47,000 surprise. My stomach dropped....
I remember my first AWS bill. Fresh out of startup land, I'd spun up 12 EC2 instances for a proof-of-concept. Three days later, I had a $4,000 surprise and z...
I was sitting in a client's war room at 3 AM. Their e-commerce platform was melting. The database CPU was pinned at 100%%. Redis was evicting keys every secon...
I've been building data infrastructure since 2018. For the first three years, I thought Kubernetes was the answer to everything. Then I ran a 200-node cluste...
I remember the panic. 3 AM. A microservice was thrashing. My on-call phone wouldn’t stop buzzing. The old way? SSH into a box, restart a process, pray. Tha...
I was running 47 microservices on bare metal in 2018. Every deployment meant SSH-ing into servers. Every scaling decision meant guessing. Every crashed conta...
I spent three months debugging a payment system. The logs said "event received at 14:32:01." The database said "processed at 14:31:59." The customer's bank s...
I remember a conversation from early 2025. A CTO told me his team was building a "modern distributed system." No tiers. Just microservices everywhere. Event-...
I spent eighteen months building what I thought was the perfect inference stack. One giant GPU cluster. Monolithic serving architecture. Every model, every b...
I spent six months building a monolithic inference system. It worked great in testing. In production, it collapsed under 2,000 concurrent requests. The probl...
I spent three months rebuilding an inference stack that didn't need rebuilding. We had eight A100s in a single node. Latency was fine. Throughput was fine. T...
I spent three days debugging a production outage. The root cause? A developer’s laptop had Python 3.9. The production server ran 3.11. We lost $40,000 in r...
I built my first MoE system in 2023. It broke in production within three hours. The router kept sending every token to the same expert. We had a 10TB expert ...
I spent three years building RAG systems that worked great in demos and fell apart in production. Every integration was a bespoke mess. Different APIs. Diffe...
I spent three months building an AI agent system that could answer customer queries from our internal docs. It worked perfectly in staging. Then we hit produ...
I spent six months building an internal platform that nobody used. The code was clean. The architecture was elegant. The CI/CD pipeline was a work of art. Bu...
I remember the exact moment I realized our platform engineering bet had paid off. We were running a data pipeline processing 200,000 events per second. Our M...
I spent three months in 2024 building what I thought was a perfect RAG system. It worked beautifully in the demo. Then we put it in production. The chatbot s...
I’ll tell you what I told a CTO at a Series B fintech last month: if you think agent-to-agent protocol is just another API layer)](/articles/what-is-a-m...
You're running three AI agents in [production](/articles/what-is-a-model-context-protocol-the-missing-layer-for-ai)). One handles customer intake. Another d...
I spent six months building a multi-agent system for a logistics client. Three agents. Two different frameworks. Zero interoperability. We had a planning age...
Last year, I watched a senior engineer rewrite 800 lines of Kafka consumer logic in 45 minutes. Not alone—with an AI pair. The code passed code review on f...
Last month, a CTO showed me his RAG pipeline. It had seven separate scripts, three cron jobs, and a manual step to re-index when the embedding model changed....
I was debugging a production RAG pipeline at 2 AM last week. The LLM kept hallucinating because the retrieval step was running before embeddings finished upd...
I remember the moment clearly. Midnight on a Tuesday, three years ago. Our multi-agent system was processing customer support tickets, and everything seemed ...
I spent three months building an AI system that failed in production. Every agent stepped on each other. Data went missing. Costs spiraled. The problem wasn'...
I spent three months building a multi-agent system the wrong way. Each AI agent worked in isolation. They never shared context. The result? A tangled mess of...
I spent three nights debugging a production outage last month. Our FastAPI service was serving 40,000 requests per minute. Then it stopped. Not because FastA...
I spent three months building a RAG pipeline that worked perfectly in testing. Then we pushed it to production. The system fell apart within hours. Latency s...
You're staring at a dashboard. Two AI agents are supposed to be talking to each other. One is supposed to query a database. The other is supposed to format a...
I spent two years watching AI agents fail in production. The problem wasn't the models. It was the handshake. Every team I advised was building agents in iso...
Let me tell you a story. I was [building) a data pipeline-maps) for a client in early 2023. They had two [systems](/articles/what-is-a-model-context-protoco...
I built production AI systems for five years before I truly understood agentic orchestration. My team spent three months wiring together LangChain agents, Ka...
I spent three months building what I thought was the perfect AI pipeline. Six models. Four custom agents. A dozen API calls chained together like a beautiful...
Here’s a story from the trenches. Two years ago, I watched a team spend three months trying to scale a monolithic ClickHouse deployment. They added RAM. Th...
I learned this lesson the hard way. In 2019, our streaming pipeline collapsed during a customer demo. 47 seconds of downtime. Lost the deal. The problem wasn...
I remember the exact moment I stopped trusting my data pipeline. Four AM. A production alert. Our ClickHouse cluster was choking on 200K events per second be...
I still remember the panic in 2019. Our event pipeline was melting down at 50,000 events per second. Postgres was choking. Redis was filling up. The CTO aske...
I spent three months building a real-time analytics pipeline with Kafka before I realized I'd completely misunderstood the tool. We were processing 50,000 ev...
I remember my first Kafka deployment like a bad hangover. We were processing 50K events per second. Everything crashed. The producers backed up. Consumers fe...
Building data pipelines at 200K events per second taught me one hard truth: cloud choices aren't academic. They're survival decisions. I've watched teams bur...
Most [people](/articles/is-kubernetes-still-relevant-in-2026-2))-away-from-kubernetes-the-real-reasons) think Azure is just Microsoft’s [answer](/article...
I remember sitting in a cluttered conference room in 2019, watching a team of five engineers manually move CSV files between S3 buckets. They had no pipeline...
Back in 2018, I was at a client site in Bangalore, staring at a cluster of Spark jobs that took 14 hours to run. The team had built everything on-prem — 20...
I've spent the last eight years building data infrastructure at SIVARO. My team has deployed systems processing 200,000 events per second. We've migrated pet...
I spent three years fighting with data infrastructure at my last startup. We were processing 200K events per second on a shoestring budget. Cloud costs were ...
I’ve been wrong about Azure more than once. Back in 2019, I told a client that Azure was just “Microsoft’s AWS clone” — a catch-up play with a diff...
My phone buzzed at 3:47 AM. By 3:49 AM, I was staring at five red dashboards. Our entire production pipeline was dark. ClickHouse clusters unreachable. Kafka...
I’ll never forget the moment our real-time dashboard froze at 3 AM during a Black Friday sale. 200K events per second streaming in. PostgreSQL on its knees...
I’ll never forget the day our PostgreSQL cluster started gasping. 10 TB of time-series data. 200,000 writes per second. Queries that took 45 seconds. The e...
I spent two years at a fintech startup rebuilding our analytics pipeline three times. First PostgreSQL. Then Cassandra. Then Elasticsearch. Each time, the sa...
Most engineering leads hear "DeepSeek" and think cheap API token prices. That’s a mistake. The real value isn’t cost. It’s how this model family solves...
Last month, I watched a production cluster burn $8,000 in GPU hours in under three hours. The cause? A classic monolithic inference setup. One model deployme...
Last month, one of my clients called me in a panic. Their LLM inference costs had hit $480,000 for the quarter. They were serving a chatbot that barely worke...
Last year, I watched a cluster of 8 H100s eat $12,000 in GPU time serving 150 concurrent users. The problem wasn't inference speed—it was wasted compute du...
I remember the exact moment our inference pipeline broke. 3:47 AM. A customer running a complex multi-hop RAG query. The GPU memory screamed, then died. We l...
Most teams building LLM inference systems are wasting money. They don’t know it yet. I’ve seen this pattern repeat across a dozen companies: spinning up ...
I spent three years building single-node LLM systems at my last startup. Every request was a prayer to the latency gods. Then we hit 10,000 concurrent users,...
Distributed software architecture isn’t what most people imagine. Six years ago, I watched my first production system collapse during a Black Friday sale. ...
I’ll never forget the call at 2 AM. Our monolithic application had just handled 50,000 concurrent users during a flash sale. The database locked. The queue...
It's 2014. I'm staring at a [production) outage. The app works perfectly on my MacBook. The staging server runs it fine. But production? Dead. The error mess...
I still remember the first time I saw a production system go down because of the "but it works on my machine" problem. 2017. A client's data pipeline was fai...
I’ll never forget the first production meltdown I caused. It was 2019. My team had built a Python microservice that worked perfectly on my MacBook. We push...
I remember the exact moment I almost rage-quit infrastructure engineering. It was 3 AM. My team had just spent 12 hours debugging why a Python app worked per...
I spent three weeks debugging a pipeline that should have taken three days. The culprit? A single query pattern that worked fine with PostgreSQL but complete...
I was building a RAG pipeline last year. Documents were scattered across three databases. The retrieval pipeline was brittle. Every query felt like gambling....
Let me tell you a story. It's 2023. My team at SIVARO just shipped a RAG system for a healthcare client. The retrieval latency was 200ms—acceptable in demo...
I remember the exact moment Kafka broke my production system. 3 AM. 200,000 events per second. A rogue consumer group lagging by 47 minutes. The pager didn't...
You're building a system that needs to handle data in motion. Maybe it's clickstream events from a million users. Maybe it's IoT sensor readings from a facto...
I learned this the hard way. My first Kafka deployment nearly took down production. We had 50 microservices, each consuming from the same topic. I thought I ...
Ten years ago, I watched a startup's entire data pipeline collapse during a Black Friday sale. The CEO turned to me and asked, "Why can't we just use a datab...
I’ve been called at 2 AM more times than I care to admit. The symptom? “Our pipeline is down.” The cause? Not Kafka. It was how we wired it. Here’s t...
I learned the hard way that connecting AI models to real data sources is a mess. Every system spoke a different protocol. Every integration broke differently...
I spent six months building a RAG pipeline that broke every time we changed a data source. The connectors were custom. The authentication was fragile. The sc...
We built a RAG system last year. Three months of work. Every time we added a new data source—PostgreSQL, Salesforce, Notion—we rewrote connectors. The co...
I learned this the hard way. We spent three months building an agent system that could answer any customer query. It was beautiful. Smart. Self-correcting. W...
I spent three years at a fintech in 2020 watching teams confuse "availability" with "reliability" in Kubernetes. They'd brag about 99.99%% uptime while their ...
Last year, I watched a team burn $2 million on an AI project. Six months of work. Zero production value. The CEO told me: "We hired the best ML engineers. Bo...
I learned the 30%% Rule the hard way. Two years ago, my team spent six months building a custom LLM-powered recommendation engine. We poured $200K into infras...
I spent three months last year trying to get two SAP agents to talk to each other. One handled procurement. The other managed inventory. They should have bee...
I spent six months last year trying to connect two SAP agents. One handled procurement. The other managed inventory. They should have talked to each other. T...
I remember the exact moment my first distributed system died. 3 AM. My phone lit up with alerts. A Kafka cluster had split into two brain-halves, and our Cli...
I spent six months building a RAG system that broke every week. The LLM calls failed. The vector search timed out. The agent loops spiraled into infinite ret...
I spent six months building an agent system that fell apart in three hours. The problem wasn't the model. It was orchestration. Every tool promised "seamless...
I spent six months building a RAG pipeline that failed in production. The orchestrator wasn't the problem. My assumptions were. Everyone talks about which AI...
I spent six months building an AI-powered customer service system that fell over every three hours. Not because the models were bad. Not because the data was...
I spent six months last year choosing the wrong orchestration tool. My team at SIVARO was building a multi-agent system for a logistics client—real-time in...
Last quarter, I watched a team spend six weeks wiring together LangChain, a custom agent loop, and a half-baked RAG pipeline. Their first demo crashed in fro...
I spent three years building data pipelines that broke every weekend. The problem wasn't the individual tools. Kafka worked. ClickHouse worked. The problem w...
Four years ago, I watched a team burn six weeks building a RAG pipeline from scratch. They had the models. They had the data. They had the ambition. What the...
I've spent the last six years building data infrastructure) and production AI systems at SIVARO. I've watched the tooling landscape shift from bespoke script...
You're building a system that needs to think, decide, and act autonomously. Not a chatbot. Not a workflow. An agent. The market is flooded with options. Ever...
I spent three months building an agent orchestration platform last year. It failed spectacularly. Not because the code was bad—but because I picked the wro...
I spent four years building on Kubernetes. At SIVARO, we ran 200+ microservices across multiple clusters. We had all the bells and whistles — service meshe...
I spent two years building a real-time analytics platform at a startup that shall remain unnamed. We started with Snowflake. By month six, we were bleeding c...
I walked into our SIVARO war room last month. Three junior engineers—all Gen Z—were debating Kafka partition strategies like they were arguing about a vi...
You push code to staging. Five minutes later, your pod disappears. No error in the logs. No obvious OOM footprint. Just a terse "Killed" event in kubectl get...
It was 3 AM. Our production ClickHouse cluster started serving 503s. I ssh'd into a node, and every pod was disappearing. Not crashing. Not failing health ch...
Production systems fail silently. That's what I learned the hard way. Four years ago, one of our pipelines processing 200K events/sec started dropping data e...
Let me tell you a story that broke last month. May 11, 2026. A major European energy grid operator detected anomalous outbound traffic from three control sys...
I spent three years building Kubernetes clusters for clients. Two years undoing them. The hardest lesson? K8s isn't evil. But it's been sold as the default s...
I once watched a $150,000 cluster of A100s hit 12%% utilization for three days straight. The team blamed the framework. They blamed the network. They blamed e...
I've built seven production AI systems from scratch. Three failed spectacularly. The other four taught me lessons no textbook ever could. Everyone talks abou...
You know that feeling. Slack goes quiet. Your dashboards go gray. Someone in the #engineering channel [types](/articles/what-is-ai-assisted-development-a-pr...
I spent two weeks last December trying to figure out why my Game Boy emulator ran slower than a TI-84 on JavaScript. Then I scrapped the whole thing and buil...
Every week, someone asks me: "is chatgpt an ai agent?" Usually it's a founder trying to decide what to build. Or an engineer who's been told to "build an AI ...
I spent six months migrating a client off Snowflake to ClickHouse in 2023. The CTO thought I was insane. "Everyone uses Snowflake," he said. He wasn't wrong....
Last year, a CTO I know spent $80,000 on GPU clusters to serve a custom chatbot. Three months later, the project was dead. Not because the model was bad. But...
You’ve probably heard the rumor: Netflix runs everything on Kubernetes. Every microservice, every recommendation engine, every stream. It’s a nice story....
I’ll never forget the moment I realized I’d been thinking about models all wrong. It was late 2022. My team at SIVARO was trying to serve a single 175B-p...
I spent six months in 2023 building what I thought was the perfect RAG system. It failed. Not because the retrieval was bad or the generation was weak — bu...
I’m Nishaant Dixit. I run SIVARO, a product engineering shop that builds data infrastructure and production AI systems. In the last 18 months, I’ve watch...
You’re building something. Maybe it’s an automated customer support pipeline. Maybe it’s a system that writes code, or manages inventory, or negotiates...
I’ll cut the preamble. You’re here because you’ve heard the AWS vs. GCP debate a hundred times, and you’re tired of vague “both are good” answers...
I spent 18 months building a production AI system that failed — not because the models were bad, but because we couldn't get them to work together. Each ag...
I spent six months in 2023 convinced that Retrieval-Augmented Generation was just one thing: take a query, find documents, feed them to an LLM. Simple. Then ...
I remember the exact moment I stopped treating Kafka like a message queue and started treating it like what it actually is. It was 2019. We were building a f...
You’re building something with an LLM. Maybe a customer support agent that reads entire chat histories. Maybe a code assistant that needs full function bod...
You've got a cluster. Pods are running. The dashboard is green. Then it's 2 AM and your checkout service is returning 503s because a node died and etcd had a...
I’ve been building production AI systems since 2018. At SIVARO, we’ve shipped MoE models into real-world pipelines. I’ve seen the hype. I’ve also see...
I spent three months in 2023 convincing a healthcare client NOT to use Kubernetes. They had twelve microservices, three developers, and zero SRE experience. ...
I spent two years at a fintech in 2023 debugging why our RAG system kept serving garbage answers to customer support queries. The embeddings were fine. The v...
I spent five years building data pipelines before I let an AI tool touch my production code. That changed in early 2023 when my team faced a 12-week backlog ...
I spent six months in 2023 building what I thought was an "agentic" system. It wasn't. It was a fancy API orchestrator with a loop. The difference mattered �...
I spent six months in 2023 trying to make a Mixture of Experts (MoE) model work for a client's real-time recommendation system. Six months. The paper said it...
I spent three months in 2022 trying to cram a 175B parameter model onto a single GPU node. It was stupid. We burned $80K on HGX boxes before I admitted the e...
I’ll be straight with you: most explanations of A2A (Agent-to-Agent) are either too abstract or too trivial. They say “it’s about agents talking to eac...
I remember the first time I hit sub-second query times on a billion-row dataset. I was skeptical. "This has to be cached," I thought. It wasn't. That was 201...
Last year I sat through a demo at a major cloud provider. The team was proud: their LLM serving stack handled 10K requests per second. Then they showed me th...
I remember the exact moment I realized single models were dead ends. It was 2019. We were building a recommendation system at SIVARO for a client. The data w...
Here's the short version: it depends on what you're building. I'm Nishaant Dixit, founder of SIVARO. My team builds data infrastructure and production AI sys...
I spent three months in 2021 trying to hire a "DevOps engineer" for a client building a real-time ML inference pipeline. We interviewed 40 people. Every sing...
Let me cut through the noise. I've been building production AI systems since 2018 at SIVARO. In 2023, I watched a dozen startups raise millions on "RAG-power...
Here's the thing about RAG pipelines: everyone talks about them, most implement them badly, and almost nobody admits how much they struggled getting them to ...
I've been building data systems since 2018. Before that, I was just another engineer who thought he understood streaming. Then I spent eighteen months migrat...
Most people think Apache Kafka is a message queue. It's not. At least, using it like one is a mistake I've seen destroy three projects before they shipped. I...
I spent six months in 2023 trying to squeeze 10x more throughput out of our LLM serving stack at SIVARO. We were handling production inference for a client p...
By Nishaant Dixit, Founder of SIVARO You're building an AI system that reads customer emails. At first, it works fine. Then someone sends a 3-page contract r...
Here’s the short version before we go deep: MCP stands for Model Context Protocol, and it’s the missing piece in making large language models actually us...
I spent two years at a fintech in 2021 watching our Kubernetes clusters fail in ways no one predicted. We had 47 microservices, three observability platforms...
You can absolutely train an LLM with your own data. But here’s the thing most people get wrong: they think "training" means one thing. It doesn’t. I run ...
I’ve been asked this question more times than I can count. Usually it comes from a founder who’s about to spend $500K on hardware. Or a CTO who just read...
I remember the exact moment I stopped caring about the terminology. It was March 2024, and I was staring at a production pipeline that kept hallucinating inv...
I've been asked this question more times than I can count. Usually by engineers who've been burned by VM sprawl. Sometimes by CTOs trying to cut cloud bills....
I got this question three times last week. Two from founders, one from a CTO who’d already spent $80K on infrastructure that didn’t work. Is Kubernetes t...
I spent last spring debugging an agent that kept booking conference rooms for meetings that didn’t exist. The agent had all the right tools—calendar APIs...
My first RAG system was a disaster. We spent three months building what we thought was a cutting-edge retrieval pipeline. The demos looked amazing. Then we p...
I hired my first platform engineer in 2019. I thought I knew what the role was. I was wrong. Back then, I needed someone to "manage our infrastructure." Six ...
I walked into a client's office in late 2022. They had 17 microservices, 4 different CI/CD pipelines-maps), and a team of 40 engineers spending 30%% of their ...
I spent six months in 2023 building a chatbot for a logistics client. We used a fine-tuned GPT-3.5. It cost us $12,000 API credits, hallucinated shipment dat...
I remember the day I first hit Kafka’s wall. Late 2019. We were building a real-time fraud detection pipeline for a payments client. The system would inges...
I spent three months in 2023 trying to figure out why our GPU cluster was burning money. We had 32 A100s. We were serving a 70B parameter model. Our utilizat...
I spent six months in 2023 thinking the Model Context Protocol was just another API spec. I was wrong. We were building an AI system for a logistics client a...
It’s late 2023. I’m sitting in a room with a CTO from a mid-sized logistics company. He’s just watched a demo of a multi-agent system booking freight, ...
I’ve spent the last six years building data infrastructure and production AI systems at SIVARO. We process 200K events per second. We run stateful workload...
I've sat on both sides of the table. As a founder hiring for SIVARO, I've watched candidates tank the Docker question in under 30 seconds. Not because they d...
Look, I spent two years ignoring Kubernetes. Thought it was overengineered. Another Google brainchild that solves problems you don't have. Then we hit 50 mic...
I've been building data infrastructure since 2018. Before SIVARO, I spent years watching teams throw Kubernetes at problems that didn't need it — and avoid...
I was sitting in a Bangalore conference room in 2017, watching a deployment fail for the fourth time that week. The developer said "it works on my machine." ...
I’ll tell you a story. Back in 2019, I was engineering a real-time recommendation system for a retail client. We needed to process 50,000 user events per s...
I’ve spent the last six years building and running production Kubernetes clusters at SIVARO. We process 200K events per second through our data infrastruct...
I spent four years building on Kubernetes. I sold it to clients. I wrote migration playbooks. And in 2023, I started helping teams move off it. Let me be cle...
You've got an AI coding agent. It writes beautiful PRs in the morning. By afternoon, it's hallucinating API endpoints and checking in [...
Let me be direct: this [isn't)-from-building)](/blog/is-clickhouse-better-than-snowflake-a-practitioners-guide-2)](/blog/clickhouse-vs-snowflake-...
I get this question at least once a week. Founders, engineers, even VCs ask me: "does jeff bezos own aws?" Usually followed by a conspiracy theory about Jeff...
April 2025. I'm sitting in a customer meeting in Bangalore. The CTO leans forward. "Just tell me," he says. "Is ChatGPT an AI agent or not? Because my team k...
I'll keep it simple: ChatGPT is not an AI agent — but it can act like one, and that distinction is costing companies real money. Here's the problem. In 202...
I spent last Thursday night in a hotel room in Bangalore, running 47 parallel benchmarks against OpenAI’s GPT-4o and DeepSeek’s latest models. Not becaus...
I was sitting in a conference room in Bangalore, 2021, when a VP of Engineering asked me flat out: "is kubernetes the same as aws?" He wasn't joking. His tea...
I run a product [engineering](/blog/temporal-workflow-engine-comparison-what-actually-works-in)](/blog/temporal-workflow-engine-comparison-what-act...
I’ve spent the last six years shipping AI systems into production at SIVARO. Not demos. Not Jupyter notebooks that never left the laptop. Real systems hand...
I spent three years building data pipelines for a fintech that eventually hit 200K events per second. My biggest mistake? Choosing the wrong agent architectu...
I built my first agent in 2020. It was a glorified if-else loop with an API call. I called it an "AI agent." I was wrong. Three years and a few burned-down p...
Most people think AWS is just servers in the cloud. They're wrong. I've spent years building data infrastructure-and-databricks-a-practitioners-guide-to) and...
You're staring at a cluster of servers. Maybe 10. Maybe 1000. Each one running containers — Docker, Containerd, maybe Podman. And you're thinking: "I need ...
I remember the exact moment I realized temporal was the key we'd been missing. 2019. SIVARO was building a real-time fraud detection pipeline for a payments ...
I've spent the last six years building production AI systems at SIVARO. And I've watched too many teams burn months trying to stitch together AI components t...
You’re running an e-commerce checkout flow. A user clicks "buy" and nothing happens. Your support team lights up. Your CEO is on Slack. And the dashboard s...
You're running a real-time analytics dashboard. It's 2 AM. Your PostgreSQL instance is drowning under 50 million rows per hour. Queries that took 200ms yeste...
I spent 2018 to 2021 building data pipelines that kept collapsing under their own weight. We'd start with PostgreSQL, hit 50 million rows, and suddenly dashb...
Let me tell you a story. In 2019, I was building a real-time analytics dashboard for a logistics client. PostgreSQL was choking on 50 million rows per day. W...
I spent six years building data infrastructure. ClickHouse kept coming up in every architecture review, every POC, every "can you just make this query faster...
Most people think Gemini is just "the [twins") — two-faced, indecisive, chatty. That's surface-level stuff. I've spent years studying how systems work. Dat...
Ask ten DevOps engineers what Kubernetes is, and you'll get ten answers—most) of them wrong. I learned this the hard way. In 2018, my team at SIVARO was bu...
You're building an AI system. You've got the models. You've got the data. And you're watching your accuracy metrics climb — 70%%, 80%%, 90%%. Feels good. Then...
I spent six months trying to answer) this question for a production system at a fintech client. We tested eight tools. Burned through three architectures. Lo...
I spent six weeks last year trying to answer) this question for a client. Three engineers, twelve tools tested in production, one blown-up staging environmen...
I spent last Thursday evening in a Slack thread that turned into a therapy session. The CTO of a Series B data company — let's call him Ravi — was explai...
I was staring at a Kafka cluster crash log at 3 AM when it hit me. The system wasn't the problem. The data was. That moment taught me something about Franz K...
You’re sipping coffee, Slack goes silent. Then your app returns a 503. Then your smart fridge stops talking to your phone. Panic sets in. Everyone blames �...
I’ve been building data infrastructure since 2018. Started SIVARO to help companies stop treating data like a side project. And I’ve lost count of how ma...
I was at a fintech meetup in Berlin back in 2019. Someone asked the panel: “why is apache kafka so popular?” The CTO of a payments startup shrugged and s...
I spent last Thursday debugging a stream processing pipeline. Kafka topic lag was spiking. Consumer group rebalancing was thrashing. My phone buzzed — a Sl...
Let me tell you a story. In 2019, my team at SIVARO was building a real-time data pipeline for a fintech client. We had microservices. We had containers. We ...
Most engineers I talk to assume Snowflake and ClickHouse solve the same problem. They don't. I learned this the hard way after burning six months rebuilding ...
Let me tell you a story. In 2019, I was at a startup that ran 47 microservices on bare metal. Deployments took 45 minutes. We had a "deployment committee" �...
Let me tell you about the first time I saw AI orchestration fail spectacularly. It was March 2024. A fintech client had built a multi-agent system for fraud ...
I spent six months last year building an AI agent system for a logistics client. We tested every architecture pattern I could find. Some worked. Most didn't....
I spent three years helping a fintech company run Kubernetes in [production). By year four, we were migrating off it. Not because we couldn't make it work ��...
You're reading this because you've heard the noise. Everyone's talking about) AI agents. But when you strip away the marketing hype, what actually works in [...
I’ve spent the last six [years](/articles/best-ai-orchestration-tool-heres-what-4-years-of-building)) building data infrastructure and AI [systems). In 202...
You're staring at a petabyte of event data. Your dashboard queries take 45 seconds. Your analytics team is quietly building shadow data pipelines in Python b...