<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>Yogreet Global — Engineering Blog</title>
    <link>https://yogreet.com/blog/</link>
    <atom:link href="https://yogreet.com/blog/rss.xml" rel="self" type="application/rss+xml"/>
    <description>Daily, specific engineering insights and guides on AI/LLM cost, microservices, cloud cost, performance and scaling — from Yogreet Global, an infrastructure-first product engineering studio.</description>
    <language>en</language>
    <lastBuildDate>Thu, 25 Jun 2026 14:36:04 GMT</lastBuildDate>
    <item>
      <title>Choosing the Right Model-Routing Threshold for Frontier Models</title>
      <link>https://yogreet.com/blog/choosing-the-right-model-routing-threshold-for-frontier-models</link>
      <guid isPermaLink="true">https://yogreet.com/blog/choosing-the-right-model-routing-threshold-for-frontier-models</guid>
      <pubDate>Thu, 25 Jun 2026 14:36:04 GMT</pubDate>
      <description>Learn how to effectively decide which requests escalate to frontier models in AI systems, optimizing performance and cost.</description>
    </item>
    <item>
      <title>Semantic Caching: Cost Reduction and Accuracy Risks in LLMs</title>
      <link>https://yogreet.com/blog/semantic-caching-cost-reduction-and-accuracy-risks-in-llms</link>
      <guid isPermaLink="true">https://yogreet.com/blog/semantic-caching-cost-reduction-and-accuracy-risks-in-llms</guid>
      <pubDate>Thu, 25 Jun 2026 12:21:00 GMT</pubDate>
      <description>Explore semantic caching for LLM apps to cut costs by 70%, while understanding potential accuracy pitfalls.</description>
    </item>
    <item>
      <title>Why Startups End Up Rewriting Their Architecture (and How to Avoid It)</title>
      <link>https://yogreet.com/blog/why-startups-rewrite-architecture</link>
      <guid isPermaLink="true">https://yogreet.com/blog/why-startups-rewrite-architecture</guid>
      <pubDate>Thu, 25 Jun 2026 03:30:00 GMT</pubDate>
      <description>The four causes of the expensive rewrite that hits when growth arrives — and how designing clean seams early avoids it.</description>
    </item>
    <item>
      <title>How to Reduce AI API &amp; Token Costs: A Practical Guide</title>
      <link>https://yogreet.com/blog/how-to-reduce-ai-api-costs</link>
      <guid isPermaLink="true">https://yogreet.com/blog/how-to-reduce-ai-api-costs</guid>
      <pubDate>Thu, 25 Jun 2026 03:30:00 GMT</pubDate>
      <description>The four levers that move an AI bill most — prompt caching, model routing, batching and output discipline.</description>
    </item>
    <item>
      <title>Microservices vs Monolith for a Startup: How to Actually Decide</title>
      <link>https://yogreet.com/blog/microservices-vs-monolith-startup</link>
      <guid isPermaLink="true">https://yogreet.com/blog/microservices-vs-monolith-startup</guid>
      <pubDate>Thu, 25 Jun 2026 03:30:00 GMT</pubDate>
      <description>When a modular monolith wins, the three signals that justify splitting, and how to migrate without a rewrite.</description>
    </item>
    <item>
      <title>How Much Does It Cost to Scale an AI App?</title>
      <link>https://yogreet.com/blog/cost-to-scale-ai-app</link>
      <guid isPermaLink="true">https://yogreet.com/blog/cost-to-scale-ai-app</guid>
      <pubDate>Thu, 25 Jun 2026 03:30:00 GMT</pubDate>
      <description>The real cost drivers, why per-user cost creeps up, and how to keep it flat from 100 to 100,000 users.</description>
    </item>
    <item>
      <title>Seven Lines of Code: How Stripe Built Infrastructure-First</title>
      <link>https://yogreet.com/blog/stripe-api-first-infrastructure</link>
      <guid isPermaLink="true">https://yogreet.com/blog/stripe-api-first-infrastructure</guid>
      <pubDate>Tue, 23 Jun 2026 03:30:00 GMT</pubDate>
      <description>Treating payments as infrastructure, not a feature, meant the core API never needed a rewrite.</description>
    </item>
    <item>
      <title>The Three-Day Outage That Rebuilt Netflix</title>
      <link>https://yogreet.com/blog/netflix-microservices-rebuild</link>
      <guid isPermaLink="true">https://yogreet.com/blog/netflix-microservices-rebuild</guid>
      <pubDate>Tue, 23 Jun 2026 03:30:00 GMT</pubDate>
      <description>How a 2008 database corruption triggered a seven-year migration to 1,000+ cloud microservices.</description>
    </item>
    <item>
      <title>50 Engineers, 2 Billion Users: WhatsApp Infrastructure Efficiency</title>
      <link>https://yogreet.com/blog/whatsapp-infrastructure-efficiency</link>
      <guid isPermaLink="true">https://yogreet.com/blog/whatsapp-infrastructure-efficiency</guid>
      <pubDate>Tue, 23 Jun 2026 03:30:00 GMT</pubDate>
      <description>How one unfashionable language choice let a ~50-person team serve hundreds of millions of users.</description>
    </item>
    <item>
      <title>13 People, One Server, a Billion-Dollar App: Instagram</title>
      <link>https://yogreet.com/blog/instagram-engineering-journey</link>
      <guid isPermaLink="true">https://yogreet.com/blog/instagram-engineering-journey</guid>
      <pubDate>Tue, 23 Jun 2026 03:30:00 GMT</pubDate>
      <description>How a tiny team avoided a database rewrite by designing to shard cleanly before they needed to.</description>
    </item>
    <item>
      <title>The Game That Failed Twice — Then Became Slack</title>
      <link>https://yogreet.com/blog/slack-from-failed-game-to-saas</link>
      <guid isPermaLink="true">https://yogreet.com/blog/slack-from-failed-game-to-saas</guid>
      <pubDate>Tue, 23 Jun 2026 03:30:00 GMT</pubDate>
      <description>How an internal tool built to support a failing game studio became the actual business.</description>
    </item>
  </channel>
</rss>
