Seldon Core

TL;DR

Seldon Core is an open-source model-serving framework from London-based Seldon Technologies, focused on multi-model inference graphs, A/B testing, and explainability.
Seldon Core v1 (Apache 2.0) is widely deployed in production; Seldon Core v2 (BSL-licensed since 2023) is the current generation with a redesigned data-plane and dataflow-graph model.
Strengths: composable inference graphs (predictor + transformer + explainer + router as a DAG), strong observability, native A/B and multi-armed-bandit routing.
Often compared with KServe — Seldon is the more featureful platform for compositional inference, KServe is the lighter and more CNCF-aligned choice for single-model serving.

Background#

Seldon Technologies was founded in London in 2014 by Alex Housley and Clive Cox, originally as a recommendation engine and pivoting to ML infrastructure. Seldon Core launched in 2018 as one of the first Kubernetes-native model-serving frameworks — predating KFServing/KServe — and shaped much of the early thinking around inference graphs on Kubernetes.

The project follows an open-core commercial model. Seldon Core v1 is Apache 2.0; v2 (the current generation) moved to a Business Source License in 2023, with a paid Seldon Enterprise edition layered on top. The licence change was contentious but has not slowed adoption in their target enterprise market.

Inference Graphs#

Seldon's headline abstraction is the inference graph — a DAG of components (predictors, transformers, explainers, routers, combiners) that requests flow through. A typical production graph might be: input transformer (normalisation) → router (A/B split or contextual bandit) → predictor (the model) → output transformer (post-processing) → explainer (sidecar).

Each node is a containerised microservice, communicating via gRPC or REST. The runtime handles tracing across the graph, propagates request metadata, and exposes per-node metrics. The same pattern can be assembled in KServe with transformers and explainers, but Seldon's graph model is more expressive — multi-step routing, ensemble combiners, and shadow testing are first-class.

Seldon Core v1 vs v2#

Property	Seldon Core v1	Seldon Core v2
First release	2018	2023
License	Apache 2.0	BSL (source-available)
Architecture	Per-graph deployment	Shared data-plane (MLServer)
Graph model	Static SeldonDeployment	Pipeline + Model + Experiment CRDs
Multi-model serving	Limited	Native (LRU model loading)
Streaming	No	Yes (Kafka native)
Recommendation	Maintenance only	Active development

A/B Testing and Multi-Armed Bandits#

Seldon ships built-in routers for weighted A/B splits and Thompson-sampling multi-armed bandits. This is the differentiator vs simpler servers — production teams running experiments on customer-facing models often pick Seldon for the routing layer alone, even when the model serving could be done in vLLM or Triton.

Explainability#

Seldon authored Alibi and Alibi Detect, the open-source libraries for model explanations (SHAP, anchors, counterfactuals) and outlier/drift detection respectively. Both integrate as graph nodes — an Explainer sidecar can be added to any SeldonDeployment to produce per-request feature attributions. This matters in regulated industries (financial services, healthcare) where every production prediction must be explainable.

For UK financial-services workloads where FCA expects model explainability evidence, Seldon's Alibi integration is the most direct path. KServe can do the same via custom explainer containers but with more wiring.

When to Pick Seldon vs KServe#

Pick Seldon Core (v2) when your workload is fundamentally compositional — multi-step pipelines, A/B-tested ensembles, drift detection wired into the request path — and you accept a commercial licence. Pick KServe when you serve mostly single models behind an OpenAI-compatible endpoint, want CNCF-aligned governance, and prefer Apache 2.0 throughout.

Mid-2026, KServe has the wider open-source adoption — particularly for LLMs via vLLM — while Seldon retains a strong enterprise foothold in regulated industries where its routing and explainability features pay off.

References

Seldon Core Documentation · Seldon Technologies
seldon-core on GitHub · GitHub
Alibi (explanations library) · GitHub

Background#

Inference Graphs#

Seldon Core v1 vs v2#

Property	Seldon Core v1	Seldon Core v2
First release	2018	2023
License	Apache 2.0	BSL (source-available)
Architecture	Per-graph deployment	Shared data-plane (MLServer)
Graph model	Static SeldonDeployment	Pipeline + Model + Experiment CRDs
Multi-model serving	Limited	Native (LRU model loading)
Streaming	No	Yes (Kafka native)
Recommendation	Maintenance only	Active development

A/B Testing and Multi-Armed Bandits#

Explainability#

When to Pick Seldon vs KServe#

Seldon Core

Background#

Inference Graphs#

Seldon Core v1 vs v2#

A/B Testing and Multi-Armed Bandits#

Explainability#

When to Pick Seldon vs KServe#

References

Browse all entries

Deploy on Yobitel

Seldon Core

Background#

Inference Graphs#

Seldon Core v1 vs v2#

A/B Testing and Multi-Armed Bandits#

Explainability#

When to Pick Seldon vs KServe#

References

Browse all entries

Deploy on Yobitel