Directory

Tools & Platforms

The LLMOps landscape by category - filterable by what actually drives the decision: open source, self-hostable, OpenTelemetry, compliance and team fit. Editorial, not paid placements.

Observability Evals Prompt management RAG / vector Guardrails Deployment Cost tracking

Attributes last verified June 2026. Attributes (especially SOC 2* and OpenTelemetry) are best-effort and change often - verify with the vendor before relying on them for procurement.

01 Observability

Logs, traces, latency and token usage for LLM applications.

Maps to the Observability layer of the stack.

Langfuse

Open-source tracing, evals and prompt management.

OSS Self-host OTel SOC 2* startupenterprise

Open source + cloud · data self-host or SaaS

Visit → Observability

LangSmith

Tracing and evals from the LangChain team.

Self-host SOC 2* startupenterprise

Free tier + paid · data self-host or SaaS

Visit → Observability

Helicone

Proxy-based logging, caching and cost tracking.

OSS Self-host OTel SOC 2* startup

Free tier + usage · data self-host or SaaS

Visit → Observability

Arize Phoenix

Open-source LLM tracing and evaluation.

OSS Self-host OTel startupenterprise

Open source + cloud · data self-host or SaaS

Visit → Observability

Fiddler AI

Model monitoring and observability platform.

Self-host SOC 2* enterprise

Enterprise · data self-host or SaaS

02 Evals

Datasets, scoring and regression testing for prompts, RAG and agents.

Maps to the Evaluation layer of the stack.

Braintrust

Eval and experimentation platform for LLM apps.

Self-host SOC 2* startupenterprise

Free tier + paid · data self-host or SaaS

Visit → Evals

LangSmith

Datasets, LLM-as-judge and offline evals.

Self-host SOC 2* startupenterprise

Free tier + paid · data self-host or SaaS

Visit → Evals

OpenAI Evals

Open-source framework for model evaluations.

OSS Self-host startup

Open source · data stays in your env

Visit → Evals

Giskard

Open-source testing for ML and LLM systems.

OSS Self-host startupenterprise

Open source + cloud · data self-host or SaaS

03 Prompt management

Versioning, change testing and rollback for prompts and configs.

Maps to the Prompt management layer of the stack.

Prompt management

PromptLayer

Prompt registry, versioning and analytics.

Free tier + paid · data leaves your env

Visit → Prompt management

Humanloop

Prompt management, evals and collaboration.

Self-host SOC 2* enterprise

Paid · data self-host or SaaS

Visit → Prompt management

Langfuse

Versioned prompts alongside traces and evals.

OSS Self-host OTel SOC 2* startupenterprise

Open source + cloud · data self-host or SaaS

04 RAG / vector

Vector storage and retrieval infrastructure for grounded answers.

Maps to the RAG operations layer of the stack.

Pinecone

Managed vector database at scale.

SOC 2* startupenterprise

Usage-based · data leaves your env

Visit → RAG / vector

Weaviate

Open-source vector database with hybrid search.

OSS Self-host SOC 2* startupenterprise

Open source + cloud · data self-host or SaaS

Visit → RAG / vector

Qdrant

Open-source vector search engine.

OSS Self-host SOC 2* startupenterprise

Open source + cloud · data self-host or SaaS

Visit → RAG / vector

Redis

Vector search on top of in-memory data.

OSS Self-host SOC 2* enterprise

Open source + cloud · data self-host or SaaS

05 Guardrails

Input/output validation, PII, injection and safety controls.

Maps to the Security layer of the stack.

Guardrails AI

Open-source validation of LLM inputs and outputs.

OSS Self-host startup

Open source · data stays in your env

Visit → Guardrails

Lakera

Prompt-injection and AI security guardrails.

SOC 2* enterprise

Paid · data leaves your env

Visit → Guardrails

Protect AI

Security tooling for the AI/ML lifecycle.

Self-host SOC 2* enterprise

Enterprise · data self-host or SaaS

06 Deployment

Serving, scaling and shipping LLM applications to production.

Maps to the Deployment layer of the stack.

BentoML

Open-source model serving and packaging.

OSS Self-host startupenterprise

Open source + cloud · data self-host or SaaS

Visit → Deployment

Modal

Serverless compute for AI workloads.

Usage-based · data leaves your env

Visit → Deployment

Replicate

Run and deploy models via API.

Usage-based · data leaves your env

Visit → Deployment

Databricks

Data + AI platform with model serving.

SOC 2* enterprise

Enterprise · data self-host or SaaS

07 Cost tracking

Token accounting, caching and routing to keep spend in check.

Maps to the Cost control layer of the stack.

Helicone

Per-request cost tracking and caching.

OSS Self-host OTel SOC 2* startup

Free tier + usage · data self-host or SaaS

Visit → Cost tracking

Langfuse

Cost and token usage per trace and feature.

OSS Self-host OTel SOC 2* startupenterprise

Open source + cloud · data self-host or SaaS

Visit → Cost tracking

Portkey

AI gateway with routing, caching and budgets.

OSS Self-host OTel SOC 2* startupenterprise

Free tier + paid · data self-host or SaaS

Missing a tool, or an attribute out of date?

This directory is community-maintained and vendor-neutral. Suggest a tool, or flag an attribute we should correct.

Contribute or correct →