#production

22 posts

May 5, 2026 · 6 min read Part 1

Built with Agentic Engineering -- The New Way Software Gets Made

Forget vibe coding. This is a real workflow: plan with agents, verify with QA, review as human, ship to production. Here is what it actually looks like in practice.

May 5, 2026 · 8 min read Part 5

From Agent to Production -- The Pipeline That Lets You Sleep

The full CI/CD pipeline for agentic code: spec to deploy in one workflow. Real examples from luonghongthuan.com and CubLearn. How to build the system that runs while you sleep.

agentic engineering ci-cd Cloudflare Pages +3

Apr 20, 2026 · 16 min read Part 12

Cost, Model Selection, and Taking Your AI Team to Production

The finale. Optimize costs, match models to agents, calculate ROI, and make your AI team production-ready.

ai multi-agent vibe-coding +4

Apr 18, 2026 · 18 min read Part 2

Build an Agentic AI System From Scratch: Step-by-Step Implementation Guide

Complete hands-on guide to building a production-ready agentic AI system. From project setup to deployment — every layer implemented with working code, tests, and Docker compose.

ai agentic-ai tutorial +7

Apr 18, 2026 · 12 min read Part 1

Build a $0 Agentic AI System: Architecture That Scales from Prototype to Production

A complete technical guide to building a profitable agentic AI system using only open-source tools — with retrieval, orchestration, tool use, and observability. Includes architecture diagrams and real cost analysis.

ai agentic-ai architecture +7

Apr 6, 2026 · 6 min read

Governing AI Agents in Production: What Microsoft's New Toolkit Gets Right

Microsoft released the Agent Governance Toolkit on April 3, 2026 — seven packages covering policy enforcement, cryptographic identity, runtime isolation, and compliance automation. Here's a practical breakdown from someone building production agents.

ai-agents security governance +3

Mar 27, 2026 · 6 min read

AI Agents in Production 2026: The Shift from Copilot to Autopilot

90% of developers now use AI at work. But the real shift in March 2026 is agents moving from suggestion-mode to autonomous execution. Here's what that actually looks like in production systems and what breaks when you go too far too fast.

ai agents developer-tools +2

Mar 25, 2026 · 5 min read

Mistral 3 in Production: What Open-Source AI Gets Right (and Wrong) in 2026

Mistral Large 3's MoE architecture delivers 92% of GPT-5.2's performance at 15% of the cost. As a technical lead who has run open-source LLMs in production, here's where it works and where it fails.

mistral open-source-ai llm +2

Mar 24, 2026 · 6 min read

What Production AI Agents Actually Look Like in 2026 — Not the Demo, the Reality

Gartner says 40% of enterprise apps will embed AI agents this year. But 40% of agentic projects will be scrapped by 2027. Here's what separates the teams that ship production agents from those that get stuck in pilots forever.

ai ai-agents enterprise +2

Mar 16, 2026 · 19 min read

Pipecat Voice Agent in Production: Complete Guide to Issues, Optimization & Scalable Architecture

Deep-dive into 26+ real production issues with Pipecat voice agents — latency, audio quality, memory leaks, VAD problems, and pipeline freezes — plus battle-tested optimization strategies for building scalable voice AI systems.

pipecat voice-ai production +1

Mar 10, 2026 · 22 min read Part 8

Production Voice AI for Research at Scale: Deployment and Go-Live — From Docker Compose to 200 Concurrent Sessions

The complete deployment guide: Docker multi-stage builds, Kubernetes orchestration, CI/CD with GitHub Actions, zero-downtime deploys, go-live checklist, production monitoring with Prometheus/Grafana, and the operational runbook that keeps voice AI running at scale.

voice-ai s2s research +7

Mar 8, 2026 · 17 min read Part 7

Production Voice AI for Research at Scale: Multi-Language Voice AI — When Your Agent Needs to Think in Japanese

Multi-language voice AI for research: language detection, provider routing (Gemini Live for 30+ languages, OpenAI Realtime for English), locale-aware VAD tuning, i18n prompt packs, and cross-language analysis pipelines.

voice-ai s2s research +5

Mar 6, 2026 · 34 min read Part 9

Building KidSpark: Production — Analytics, Monitoring, Crash Reporting, and Iteration

Launch day is just the beginning. Privacy-compliant analytics, crash reporting that respects child data, and a feedback loop that actually improves the product.

analytics monitoring mobile +2

Mar 6, 2026 · 13 min read Part 6

Production Voice AI for Research at Scale: What Breaks at 200 Concurrent Sessions

Scaling from 10 sessions/week to 200 concurrent. The enrichment bottleneck (30,000 API calls), session recovery for dropped WebRTC connections, provider failover, and the operational metrics that keep it all visible.

voice-ai s2s research +5

Mar 4, 2026 · 12 min read Part 5

Production Voice AI for Research at Scale: The Real Cost

Real-time per-minute cost tracking, provider comparison (OpenAI Realtime ~$0.053/min vs Gemini Live ~$0.029/min), budget enforcement with soft/hard limits, and the self-hosting math that saves 90% on transport.

voice-ai s2s research +4

Mar 2, 2026 · 13 min read Part 4

Production Voice AI for Research at Scale: From Recording to Insight

The 3-stage automatic pipeline that turns raw interview recordings into enriched, queryable research data in 3-7 minutes. Transcription, enrichment, analysis — with the transcript batching trick that cut DB load by 80%.

voice-ai s2s research +5

Mar 1, 2026 · 8 min read Part 6

Tech Coffee Break #6: Cool Demo, But Will It Work Monday Morning?

Putting AI into production is nothing like building a demo. Two tech leads discuss costs, hallucinations, latency, guard rails, and what actually breaks when real users hit your AI features.

tech-coffee ai production +2

Feb 28, 2026 · 11 min read Part 3

Production Voice AI for Research at Scale: Multi-Phase State Machines

Research interviews follow structured protocols with distinct phases. How to build an LLM-driven state machine with next_phase() function calling and dynamic instruction swapping via set_chat_ctx().

voice-ai s2s research +5

Feb 27, 2026 · 6 min read Part 7

Deploy, Scale & Pricing for Azure Voice Live in Production

Your demo works. Now make it production-ready for 1,000 concurrent interviews. Full guide to deployment options, WebSocket scaling, Azure pricing breakdown, cost estimations, and monitoring for Azure Voice Live.

azure-voice-live azure voice-ai +5

Feb 26, 2026 · 12 min read Part 2

Production Voice AI for Research at Scale: Zombie Agents, Pre-Warming, and the 5 Bugs That Cost Us Weeks

The production pain points nobody warns you about: zombie agents, metadata latency, pre-warming for 1-2s time-to-first-voice, VAD tuning for research respondents, and provider quirks.

voice-ai s2s research +4

Feb 24, 2026 · 10 min read Part 1

Production Voice AI for Research at Scale: The Architecture Nobody Warns You About

Why research interviews need server-side voice agents, the three-tier architecture, room metadata as configuration transport, and the 100-500ms propagation latency nobody tells you about.

voice-ai s2s research +4

Feb 18, 2026 · 7 min read Part 10

AWS Full-Stack Mastery: Production Readiness, Multi-Region & Disaster Recovery

Take Kids Learn production-ready: multi-region Active-Passive with Aurora Global Database, Route 53 health-checked failover, auto-scaling strategies, load testing with Artillery, chaos engineering, runbooks, and a comprehensive launch checklist.

aws multi-region disaster-recovery +2

← All posts

#production

Stay in the loop