<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"><channel><title>LLMOps.si - Articles</title><description>Practical, vendor-neutral guides on running large language models in production.</description><link>https://llmops.si/</link><language>en</language><item><title>How to build your first eval dataset</title><link>https://llmops.si/articles/how-to-build-your-first-eval-dataset/</link><guid isPermaLink="true">https://llmops.si/articles/how-to-build-your-first-eval-dataset/</guid><description>A practical, step-by-step guide to building an LLM eval dataset from real traffic - what a row looks like, how to score it, how many cases you need, and how to wire it into CI.</description><pubDate>Sat, 20 Jun 2026 00:00:00 GMT</pubDate><category>evaluation</category><category>how-to</category><category>evals</category></item><item><title>What to log in an LLM trace</title><link>https://llmops.si/articles/what-to-log-in-an-llm-trace/</link><guid isPermaLink="true">https://llmops.si/articles/what-to-log-in-an-llm-trace/</guid><description>A field-by-field guide to what belongs in a production LLM trace - request IDs, prompt versions, retrieval, tokens, latency, cost and outcome - plus what to redact.</description><pubDate>Fri, 19 Jun 2026 00:00:00 GMT</pubDate><category>observability</category><category>tracing</category><category>how-to</category></item><item><title>How to calculate hallucination rate</title><link>https://llmops.si/articles/how-to-calculate-hallucination-rate/</link><guid isPermaLink="true">https://llmops.si/articles/how-to-calculate-hallucination-rate/</guid><description>A practical method for measuring LLM hallucination (faithfulness) rate in production - how to define it, sample it, judge it, and track it over time.</description><pubDate>Thu, 18 Jun 2026 00:00:00 GMT</pubDate><category>evaluation</category><category>hallucination</category><category>how-to</category></item><item><title>Prompt versioning with GitHub</title><link>https://llmops.si/articles/prompt-versioning-with-github/</link><guid isPermaLink="true">https://llmops.si/articles/prompt-versioning-with-github/</guid><description>A concrete workflow for versioning LLM prompts in GitHub - file layout, pull-request review, eval gating in CI, and one-step rollback - without buying a dedicated tool.</description><pubDate>Wed, 17 Jun 2026 00:00:00 GMT</pubDate><category>prompt management</category><category>how-to</category><category>ci-cd</category></item><item><title>RAG freshness monitoring checklist</title><link>https://llmops.si/articles/rag-freshness-monitoring-checklist/</link><guid isPermaLink="true">https://llmops.si/articles/rag-freshness-monitoring-checklist/</guid><description>A focused checklist for keeping a RAG index fresh in production - detecting stale content, missing documents, embedding drift and re-index failures before users do.</description><pubDate>Tue, 16 Jun 2026 00:00:00 GMT</pubDate><category>rag</category><category>monitoring</category><category>checklist</category></item><item><title>LLM incident response template</title><link>https://llmops.si/articles/llm-incident-response-template/</link><guid isPermaLink="true">https://llmops.si/articles/llm-incident-response-template/</guid><description>A ready-to-adapt incident response template for LLM applications - severity levels, the first 15 minutes, mitigation levers unique to LLMs, and a post-mortem structure.</description><pubDate>Mon, 15 Jun 2026 00:00:00 GMT</pubDate><category>governance</category><category>incident-response</category><category>template</category></item><item><title>How to choose between Langfuse, LangSmith, Braintrust and Helicone</title><link>https://llmops.si/articles/choosing-langfuse-langsmith-braintrust-helicone/</link><guid isPermaLink="true">https://llmops.si/articles/choosing-langfuse-langsmith-braintrust-helicone/</guid><description>A decision framework for picking an LLM observability and evals platform - how Langfuse, LangSmith, Braintrust and Helicone differ, and which fits your team.</description><pubDate>Sun, 14 Jun 2026 00:00:00 GMT</pubDate><category>observability</category><category>evals</category><category>tools</category><category>comparison</category></item><item><title>What your CTO should ask before approving an LLM launch</title><link>https://llmops.si/articles/cto-questions-before-approving-llm-launch/</link><guid isPermaLink="true">https://llmops.si/articles/cto-questions-before-approving-llm-launch/</guid><description>The questions a technical leader should ask before signing off on shipping an LLM feature to production - covering evals, observability, cost, security, rollback and governance.</description><pubDate>Sat, 13 Jun 2026 00:00:00 GMT</pubDate><category>governance</category><category>leadership</category><category>checklist</category></item><item><title>LLM Governance: Audit trails, approvals and explainability</title><link>https://llmops.si/articles/llm-governance-audit-trails-and-human-review/</link><guid isPermaLink="true">https://llmops.si/articles/llm-governance-audit-trails-and-human-review/</guid><description>A practical guide to governing LLM applications - audit trails, human-in-the-loop approval gates, explainability and reproducibility, and data retention you could show an auditor.</description><pubDate>Mon, 08 Jun 2026 00:00:00 GMT</pubDate><category>governance</category><category>compliance</category><category>audit</category></item><item><title>LLM Deployment: CI/CD, staging and one-step rollback</title><link>https://llmops.si/articles/llm-deployment-ci-cd-staging-rollback/</link><guid isPermaLink="true">https://llmops.si/articles/llm-deployment-ci-cd-staging-rollback/</guid><description>How to ship LLM changes safely - CI with eval gates, a staging mirror, progressive rollout, config-driven rollback and provider fallback. The deployment layer of the LLMOps stack.</description><pubDate>Mon, 08 Jun 2026 00:00:00 GMT</pubDate><category>deployment</category><category>ci-cd</category><category>rollback</category></item><item><title>What is LLMOps?</title><link>https://llmops.si/articles/what-is-llmops/</link><guid isPermaLink="true">https://llmops.si/articles/what-is-llmops/</guid><description>LLMOps is the discipline of deploying, monitoring and improving large language model applications after the prototype works. A practical guide to what it covers, why it matters, and where to start.</description><pubDate>Fri, 05 Jun 2026 00:00:00 GMT</pubDate><category>fundamentals</category><category>definition</category></item><item><title>LLMOps vs MLOps: What changes with large language models?</title><link>https://llmops.si/articles/llmops-vs-mlops/</link><guid isPermaLink="true">https://llmops.si/articles/llmops-vs-mlops/</guid><description>LLMOps builds on MLOps but adds prompts-as-code, non-determinism, LLM-judged evaluation, prompt-injection security and live token budgets. A practical guide to what actually changes - and what carries over.</description><pubDate>Thu, 04 Jun 2026 00:00:00 GMT</pubDate><category>fundamentals</category><category>mlops</category></item><item><title>The LLMOps Stack: The 8 layers of production LLM systems</title><link>https://llmops.si/articles/llmops-stack-8-layers-of-production-llm-systems/</link><guid isPermaLink="true">https://llmops.si/articles/llmops-stack-8-layers-of-production-llm-systems/</guid><description>Prompt management, evaluation, observability, cost control, RAG operations, security, governance and deployment - a deep dive into the eight layers between an LLM prototype and production, with failure modes and a checklist for each.</description><pubDate>Wed, 03 Jun 2026 00:00:00 GMT</pubDate><category>stack</category><category>architecture</category></item><item><title>LLM Observability: What to monitor in production</title><link>https://llmops.si/articles/llm-observability-what-to-monitor-in-production/</link><guid isPermaLink="true">https://llmops.si/articles/llm-observability-what-to-monitor-in-production/</guid><description>A production guide to LLM observability - the signals that matter, how to instrument with OpenTelemetry or Langfuse, what breaks without it, and a minimal-vs-mature path.</description><pubDate>Tue, 02 Jun 2026 00:00:00 GMT</pubDate><category>observability</category><category>monitoring</category><category>tracing</category></item><item><title>LLM Evaluation: How to test prompts, RAG and agents</title><link>https://llmops.si/articles/llm-evaluation-testing-prompts-rag-agents/</link><guid isPermaLink="true">https://llmops.si/articles/llm-evaluation-testing-prompts-rag-agents/</guid><description>A production-grade guide to LLM evaluation - what breaks without it, what to measure, how to write an LLM-as-judge and an eval runner, and how to gate releases the way you gate code on tests.</description><pubDate>Mon, 01 Jun 2026 00:00:00 GMT</pubDate><category>evaluation</category><category>testing</category><category>evals</category></item><item><title>Prompt Versioning: Why prompts should be treated like code</title><link>https://llmops.si/articles/prompt-versioning-treat-prompts-like-code/</link><guid isPermaLink="true">https://llmops.si/articles/prompt-versioning-treat-prompts-like-code/</guid><description>A one-line prompt edit can move behaviour as much as a model swap. A production guide to treating prompts as versioned artifacts - change-tested, attributable and rollbackable in one step.</description><pubDate>Sun, 31 May 2026 00:00:00 GMT</pubDate><category>prompts</category><category>versioning</category><category>prompt-management</category></item><item><title>RAGOps: How to monitor retrieval quality</title><link>https://llmops.si/articles/ragops-how-to-monitor-retrieval-quality/</link><guid isPermaLink="true">https://llmops.si/articles/ragops-how-to-monitor-retrieval-quality/</guid><description>Most &quot;the model is wrong&quot; bugs are retrieval bugs. A production guide to measuring retrieval quality, tuning chunking and embeddings, keeping the index fresh, and failing safe when nothing matches.</description><pubDate>Sat, 30 May 2026 00:00:00 GMT</pubDate><category>rag</category><category>retrieval</category><category>embeddings</category></item><item><title>LLM Cost Control: Tokens, caching and model routing</title><link>https://llmops.si/articles/llm-cost-control-tokens-caching-model-routing/</link><guid isPermaLink="true">https://llmops.si/articles/llm-cost-control-tokens-caching-model-routing/</guid><description>Token spend compounds quietly until a finance review forces a panic. A production guide to unit economics, caching, model routing, right-sizing and budget alerts - before the bill, not after.</description><pubDate>Fri, 29 May 2026 00:00:00 GMT</pubDate><category>cost</category><category>caching</category><category>routing</category></item><item><title>LLM Security: Prompt injection, data leakage and guardrails</title><link>https://llmops.si/articles/llm-security-prompt-injection-data-leakage-guardrails/</link><guid isPermaLink="true">https://llmops.si/articles/llm-security-prompt-injection-data-leakage-guardrails/</guid><description>An LLM with tools and data access is an attack surface. A practical guide to prompt injection, data leakage and excessive agency - with guardrail patterns and the controls that contain them.</description><pubDate>Thu, 28 May 2026 00:00:00 GMT</pubDate><category>security</category><category>guardrails</category><category>prompt-injection</category></item><item><title>LLMOps Checklist: From prototype to production</title><link>https://llmops.si/articles/llmops-checklist-from-prototype-to-production/</link><guid isPermaLink="true">https://llmops.si/articles/llmops-checklist-from-prototype-to-production/</guid><description>The narrative companion to the production-readiness checklist - the two questions that separate a demo from a system, walked across all eight layers of the LLMOps stack.</description><pubDate>Wed, 27 May 2026 00:00:00 GMT</pubDate><category>checklist</category><category>production</category></item></channel></rss>