TL;DR
- Omniscient Compute is Yobitel's vendor-neutral compute search engine — a Yobitel-operated marketplace that lets customers search, filter, and rank GPU and accelerator capacity across 25+ public, neocloud, regional, sovereign, and community providers from a single surface.
- Sovereignty is a first-class filter: every result carries a compliance tag (UK NCSC OFFICIAL, EU Data Boundary, US FedRAMP-equivalent, HIPAA-eligible, ISO 27001 and SOC 2 attested) so a regulated workload never surfaces non-eligible capacity.
- Coverage spans every major GPU family, every common term type (on-demand, spot, reserved 1-year and 3-year, savings plans), refreshed on a six-hour cadence and ranked by a Composite Value Score sourced from InferenceBench.
- Customer-facing surfaces today: the Marketplace UI at omniscient.yobitel.com (live, included with every Yobitel workspace) and the FOCUS 1.1 billing export (live, to customer-owned object storage). The omniscient CLI, Python and TypeScript SDKs, and REST API are in active development and will be released into early access.
- Distributed and multi-provider placement is first-class: search for clusters of 8 to 2,048 nodes, filter by interconnect (NVLink within node, InfiniBand within rack, Ethernet across racks, WAN across regions), and blend reservation + spot capacity across providers when one alone cannot satisfy the request.
- Yobitel runs the index, the refresh cadence, and the ranking on the customer's behalf; the customer owns search, decisions, and procurement.
Overview#
Modern AI workloads no longer fit inside one cloud. GPU capacity lives on neoclouds, sovereign regions hold regulated data, hyperscalers carry the bulk of supporting services, and regional operators frequently undercut the headline rate on specific SKUs. No single vendor sells the right combination at the right price in the right region, and no procurement team can hold tens of thousands of SKUs across twenty-five providers in their head. Omniscient Compute is the surface Yobitel built to make that problem tractable.
Omniscient Compute is a vendor-neutral compute search engine, operated end-to-end by Yobitel. The customer opens the marketplace, narrows the search by GPU family, region, term type, sovereignty pin, and price ceiling, and gets back a ranked list of providers and SKUs. Every result is annotated with a Composite Value Score — a single number that fuses benchmark quality, throughput, and dollar-per-token from InferenceBench so customers can compare a tier-1 hyperscaler quote against a sovereign neocloud quote in the same view. Pricing is refreshed every six hours.
Compared with manual cross-provider procurement — calling each vendor's pricing API, normalising units and currencies by hand, then cross-referencing each provider's compliance posture against a sovereignty requirement — Omniscient consolidates the question into one query. Compared with single-cloud FinOps tools, Omniscient is genuinely multi-provider; compared with bespoke in-house comparison tooling, Omniscient carries Yobitel's compliance metadata model and a FOCUS-aligned billing export.
Omniscient sits at the layer below Yobibyte in the Yobitel stack. Yobibyte calls Omniscient internally at admission and reconciliation, so every Inference and FineTune placement decision is grounded in current capacity and pricing. FinOps teams, platform teams, and procurement teams use Omniscient directly without ever adopting Yobibyte. Yobitel Communications, the UK-headquartered AI infrastructure company that operates Omniscient, indexes its own NeoCloud capacity alongside every other provider without preferential ranking.
Quick start#
The customer experience for getting a price-ranked, sovereignty-checked compute shortlist out of Omniscient takes minutes and no command-line setup. Sign in to the Yobitel console with your corporate identity provider, open the Omniscient marketplace at omniscient.yobitel.com (or under Marketplace in the main console navigation), and use the filter rail on the left to narrow the search. The standard filters are GPU family, region, term type, sovereignty pin, and price ceiling; the search runs against pricing data refreshed in the last six hours.
Results land as a ranked list. Every row shows the provider, the SKU, the per-GPU effective rate in USD, the term type, the sovereignty tags the SKU carries, and the Composite Value Score sourced from InferenceBench so a tier-1 hyperscaler quote is directly comparable to a sovereign neocloud quote. Click a row to open the per-provider detail panel — the panel shows the full price breakdown, the compliance posture, the freshness window, and the provisioning hand-off.
From the detail panel the customer either provisions directly through Yobitel (where the provider supports it) or copies the procurement detail for offline purchase. Search itself is free and unlimited; the value Omniscient bills for is the consolidated price stream and the FOCUS-aligned billing export that lands hourly in the customer's own object storage.
The same Marketplace UI is what Yobibyte itself calls at admission time. A workload that runs cleanly under Yobibyte is, by definition, a workload that Omniscient could rank for the customer directly.
Surfaces and access modes#
Omniscient is reachable through several customer-facing surfaces. The Marketplace UI and the FOCUS billing export are live today; the CLI, the language SDKs, and the REST API are in active development and will be released into early access. The table below is honest about the status of each surface.
| Surface | Status | What you get | Notes |
|---|---|---|---|
| Marketplace UI (web) | Live | Search + filter + rank + click-through to provider | omniscient.yobitel.com — included with every Yobitel workspace. |
| `omniscient` CLI | Preview | The same searches as the UI, scriptable from terminal and CI. | Auth modelled on `aws configure`: `omniscient login`, then an API token written to `~/.config/omniscient/credentials`. |
| `omniscient-py` Python SDK | Preview | Programmatic embed in pipelines and FinOps tooling. | Standard pip install; env-var-driven auth. |
| `@yobitel/sdk` TypeScript SDK | Preview | Browser and Node.js client. | npm package; OIDC and API-key auth. |
| REST API | Preview | Direct HTTP integration for custom tooling. | OpenAPI 3.1 spec published at GA; signed-request auth. |
| FOCUS billing export | Live | Hourly export of the normalised cost stream. | Delivered to customer-owned S3, Azure Blob, or GCS. |
# PREVIEW — the omniscient CLI is in active development; the snippet
# below is the planned shape and will NOT run today. Sign up at
# /products/omniscient-compute to be notified when the CLI ships.
# omniscient login # one-time, like `aws configure`
# omniscient search compute \
# --gpu h100 \
# --region uk \
# --term reserved-1y \
# --compliance ncsc-official \
# --max-price 1.80 \
# --output tableThe Marketplace UI and the FOCUS billing export are live today. The CLI, the Python and TypeScript SDKs, and the REST API are in active development and will be released into early access; sign up at /products/omniscient-compute to be notified. The intent is parity with the Marketplace UI — every search the UI runs, the CLI and SDKs run, with the same filter set and the same ranking.
Concepts#
Omniscient exposes a small set of concepts that map to how customers think about cross-provider compute. The mental model is provider at the top, with SKUs and regions inside, term types governing pricing, and compliance plus sovereignty tags governing what a regulated workload can see.
- Provider — a single source of compute capacity, classified by tier: hyperscaler (AWS, Microsoft Azure, Google Cloud, Oracle Cloud), neocloud (CoreWeave, Lambda, Crusoe, Yobitel NeoCloud, Scaleway), regional (Equinix Metal, Vultr, DigitalOcean, Hetzner, Latitude.sh), sovereign (UKCloud, Civo, regulated-region operators), and community (Akash, io.net, Render Network, Salad Cloud).
- SKU — a specific accelerator configuration. AWS calls it `p5.48xlarge`, GCP calls it `a3-highgpu-8g`, Azure calls it `Standard_ND96isr_H100_v5`. Omniscient surfaces all of them against the same underlying accelerator family so customers can compare like-for-like.
- Region — a geographic placement with a sovereignty tag attached. Omniscient indexes every public region from every indexed provider; sovereignty filters are applied per region.
- Term — how the capacity is purchased. On-demand, spot, reserved 1-year, reserved 3-year, or savings plan. Term influences the effective price and the discount-amortisation calculation that drives the Composite Value Score.
- Compliance Tag — the regulatory posture of a SKU. The current set includes NCSC OFFICIAL (UK), EU Data Boundary, FedRAMP-equivalent (US), HIPAA-eligible, ISO 27001-certified, SOC 2 Type II attested, and DORA-aligned. A workload pinned to a tag never sees non-eligible capacity.
- Composite Value Score — a single rank combining benchmark quality, throughput, and dollar-per-token. Inputs come directly from InferenceBench so the same model and accelerator combination has the same ranking across the index.
- Freshness Window — the maximum age of the price a customer is looking at. Pricing refreshes on a six-hour cadence; the freshness window is shown on every result row so customers can tell at a glance how recent the number is.
- Topology — the interconnect tier the workload requires. NVLink-fabric (all-units inside one 8-GPU node), InfiniBand-rack (units inside one rack, typical for distributed training), Ethernet-DC (units inside one data centre, tolerating ~10 microsecond extra hop), WAN (units across regions, latency-tolerant batch and serving). Omniscient ranks results by topology fit; if no provider in the indexed catalogue can satisfy the request at the strictest tier, the search returns next-best matches with the gap surfaced explicitly.
- Placement Profile — how the customer wants the result spread across the catalogue. `single-provider` packs the whole request on one provider for simplest billing and gang scheduling. `multi-provider-blend` spreads across two or three providers to combine reservation baselines with opportunistic spot. `multi-region` spans two or more regions for DR or latency-distributed serving, with the sovereignty boundary preserved per region. The profile drives ranking weights and the shape of the returned plan.
Distributed and multi-provider placement#
Most production AI workloads are not single-node. Training a 70B model needs 64 to 512 nodes wired with InfiniBand or NVLink Switch. Production inference often spreads across 2-4 regions for DR. Omniscient supports the search side of those workloads natively, with multi-node, multi-provider, and multi-region placement profiles as first-class filters.
The scope split is intentional and honest: Omniscient finds and ranks the capacity. Yobibyte (or the customer's own orchestrator such as KubeRay, Slurm, or a managed training service) actually runs the distributed workload. The two layers are decoupled — search and match in Omniscient, schedule and run in Yobibyte.
A common request is a 64-node H100 cluster for a 3-week training run. One sovereign neocloud may only have 32 nodes in the customer's region; one regional provider has 24; a hyperscaler has spot capacity for the remainder. The `multi-provider-blend` placement profile returns a plan that satisfies the 64 nodes from the three providers, preserving the sovereignty pin per node and surfacing the blended effective price.
Results are ranked not only by Composite Value Score but also by how strictly the requested topology is satisfied. A 32-node IB-rack request that fits inside one rack ranks above one that spans two racks at a higher Composite Value Score, because rack-spanning materially changes NCCL latency for collective communications.
- Cluster sizes searchable from 1 up to 2,048 nodes.
- Interconnect tiers ranked: NVLink-fabric > InfiniBand-rack > InfiniBand-row > Ethernet-DC > WAN.
- Placement profiles supported: single-provider (default), multi-provider-blend, multi-region.
- Gang availability filter: `--gang` returns only plans where the full cluster is provisionable at once, never partial.
# PREVIEW - Omniscient CLI is in active development; this is the planned shape.
# 64-node H100 cluster, single provider, InfiniBand rack, 3-year reserved, UK sovereign:
# omniscient search compute \
# --gpu h100 \
# --min-nodes 64 \
# --interconnect ib-rack \
# --placement single-provider \
# --region uk \
# --term reserved-3y \
# --compliance ncsc-official \
# --gang \
# --max-price 1.95 \
# --output table
#
# Same workload blended across providers when no single provider has 64:
# omniscient search compute --gpu h100 --min-nodes 64 --interconnect ib-rack \
# --placement multi-provider-blend --region uk --term reserved-3y \
# --compliance ncsc-official --gang --max-price 2.10 --output plan`--gang` is the safety filter for distributed training: it returns only plans where Omniscient can attest that the full cluster is available at provisioning time. Without `--gang`, a result that satisfies the average cluster size may not satisfy the peak; for collective-communication workloads that is a silent failure mode you do not want.
Search query reference#
The customer-facing filter set is the same in the Marketplace UI and the Preview CLI / SDK / REST surfaces. The table below lists the filters customers use; all filters are optional and combine with AND semantics.
| Filter | Type | Description | Example values |
|---|---|---|---|
| gpu | string | Specific accelerator model. | `h100`, `h200`, `b200`, `mi300x`, `l40s` |
| gpu-family | string | Accelerator family — broader than a model. | `hopper`, `blackwell`, `ada`, `ampere`, `cdna3` |
| region | string | Geographic region; accepts canonical or provider-native IDs. | `uk`, `eu`, `us`, `uk-london`, `eu-frankfurt` |
| sovereignty | enum | Sovereignty pin; restricts results to eligible capacity. | `uk-official`, `eu-data-boundary`, `us-fedramp` |
| provider | string | Specific provider name. | `aws`, `azure`, `gcp`, `coreweave`, `yobitel-neocloud` |
| provider-tier | enum | Provider classification. | `hyperscaler`, `neocloud`, `regional`, `sovereign`, `community` |
| term | enum | Purchase term. | `on-demand`, `spot`, `reserved-1y`, `reserved-3y`, `savings-plan` |
| max-price | number | Upper price bound in USD per GPU per hour. | `1.80`, `3.20`, `5.00` |
| currency | enum | Display currency; only USD is supported. | `USD` |
| min-availability | enum | Minimum reported provider availability. | `in-stock`, `queue`, `quote` |
| compliance | string[] | One or more compliance tags the SKU must carry. | `ncsc-official`, `hipaa`, `iso27001`, `soc2` |
| sustainability-tier | enum | Carbon-intensity tier of the underlying region. | `tier1`, `tier2`, `tier3` |
| instance-size | enum | Size class of the underlying instance. | `single-gpu`, `4x`, `8x`, `node` |
| vcpus | number | Minimum vCPU count alongside the accelerator. | `32`, `96`, `192` |
| max-results | integer | Maximum number of rows returned (UI defaults to 50). | `25`, `100`, `200` |
| sort | enum | Ranking criterion. | `value-score`, `price-asc`, `freshness`, `throughput` |
| min-nodes | integer | Minimum cluster size — the search returns plans that aggregate at least this many nodes of the requested SKU. | `8`, `64`, `512`, `2048` |
| interconnect | enum | Required interconnect tier; results below the requested tier are filtered out (or down-ranked when `--allow-degraded` is set). | `nvlink-fabric`, `ib-rack`, `ib-row`, `ethernet-dc`, `wan` |
| max-internode-latency-ms | float | Hard upper bound on point-to-point latency between any two nodes in the returned plan, in milliseconds. | `0.005` (NVLink), `0.5` (IB rack), `5` (Ethernet DC), `50` (WAN) |
| placement | enum | How the result is allowed to spread across the catalogue. | `single-provider` (default), `multi-provider-blend`, `multi-region` |
| gang | flag | All-or-nothing availability — the search returns only plans where the full cluster is attested provisionable. | `--gang` to enable |
Provider coverage#
Omniscient indexes more than 25 providers across five tiers. The table below summarises what the customer can expect by tier — representative providers, SKU coverage, and the regions covered. Every tier refreshes on the same six-hour cadence; the cadence is identical regardless of provider class, so a hyperscaler quote and a sovereign neocloud quote share the same freshness window.
| Tier | Representative providers | Indexed SKUs | Regions covered | Refresh cadence |
|---|---|---|---|---|
| Hyperscaler | AWS, Microsoft Azure, Google Cloud, Oracle Cloud | ~22,000 SKUs | All public regions worldwide | Every 6 hours |
| Neocloud | CoreWeave, Lambda, Crusoe, Scaleway, OVHcloud, Yobitel NeoCloud | ~18,000 SKUs | UK, EU, US, plus partner-region presence | Every 6 hours |
| Regional | Equinix Metal, Vultr, DigitalOcean, Hetzner, Latitude.sh, Genesis Cloud | ~7,500 SKUs | Single-region and multi-region operators | Every 6 hours |
| Sovereign | UKCloud (Six Degrees), Civo, regulated-region operators, Yobitel NeoCloud sovereign tier | ~3,000 SKUs | UK NCSC OFFICIAL, EU Data Boundary, US FedRAMP-equivalent | Every 6 hours |
| Community | Akash Network, io.net, Render Network, Salad Cloud | ~2,500 SKUs | Distributed; advisory pricing | Every 6 hours |
Yobitel NeoCloud is indexed alongside every other provider without preferential ranking. If a customer's workload is better served on a competing neocloud or hyperscaler, that is what Omniscient surfaces — the index is honest about its own first-party capacity.
Limits and quotas#
Limits are split into two groups: limits that apply to the live Marketplace UI and FOCUS export today, and limits that will apply to the Preview CLI, SDKs, and REST API once they reach early access. Both sets are raisable on request.
| Resource | Surface | Default | Enterprise ceiling | Status |
|---|---|---|---|---|
| Search results per query | Marketplace UI | 50 rows | 500 rows | Live |
| Concurrent sessions per workspace | Marketplace UI | 10 sessions | 100 sessions | Live |
| Saved searches per workspace | Marketplace UI | 25 saved | 1,000 saved | Live |
| FOCUS export delivery SLA | FOCUS export | Hourly export window | Hourly export window | Live |
| FOCUS export rows per day | FOCUS export | 1 million rows/day | 1 billion rows/day | Live |
| Search requests per minute | CLI / SDK / REST | 120 req/min | 10,000 req/min | Preview |
| Max results per page | CLI / SDK / REST | 200 results | 1,000 results | Preview |
| Concurrent API tokens per workspace | CLI / SDK / REST | 10 tokens | 200 tokens | Preview |
| Webhook subscriptions | CLI / SDK / REST | 10 per saved search | 100 per saved search | Preview |
| Sub-account scopes per workspace | CLI / SDK / REST | 20 scopes | 1,000 scopes | Preview |
Observability#
Omniscient surfaces freshness, coverage, and availability as first-class signals so customers can tell at a glance whether a result they are looking at is current. Inside the Marketplace UI, every result row carries a per-provider freshness timestamp showing when the price was last refreshed; the workspace-level dashboard shows the last successful refresh per provider and surfaces any indexed region that has gone missing.
Customers running Yobibyte alongside Omniscient see the same signals propagated through Yobibyte's hosted Grafana — freshness lag per provider, missing-region alerts, and last-crawl timestamps. The alert below is the one most teams add first; it pages when a provider's pricing has drifted more than 24 hours stale, which is the threshold at which spend forecasts start to disagree with the real bill.
groups:
- name: omniscient-freshness
interval: 60s
rules:
- alert: OmniscientProviderPriceStale
expr: |
(
time()
- omniscient_provider_pricing_last_refresh_seconds
) > 86400
for: 15m
labels: { severity: page }
annotations:
summary: "{{ $labels.provider }} pricing > 24h stale"
runbook: https://docs.yobitel.com/runbooks/omniscient-freshness
- alert: OmniscientSovereignCapacityEmpty
expr: omniscient_search_zero_results_total{compliance="ncsc-official"} > 0
for: 30m
labels: { severity: warn }
annotations:
summary: "NCSC-OFFICIAL searches returning zero results for {{ $labels.gpu }}"Pair the freshness alert with the missing-region signal; the two together catch the most common failure mode — a provider going dark in a single sovereign region while the rest of the index continues to look healthy.
Cost and FinOps#
Search itself is free. What Omniscient bills for is the value downstream of search: the consolidated price stream and the FOCUS 1.1-aligned billing export that lets customers reconcile cross-provider spend against the same schema. The export is hourly, lands directly in the customer's S3, Azure Blob, or GCS bucket, and carries every column the FOCUS specification requires.
The columns below are the FOCUS fields Omniscient populates on every line item. Downstream tooling — Cloudability, Apptio, Vantage, or a customer-built BigQuery or Snowflake lakehouse — consumes the export against the FOCUS schema without bespoke transformation.
| FOCUS column | What Omniscient populates |
|---|---|
| BilledCost | Provider-billed amount in USD per line item, normalised across the indexed providers. |
| EffectiveCost | Discount-amortised cost over the realistic utilisation window. |
| ListCost | Provider list price before any reservation, savings-plan, or term discount. |
| ChargePeriodStart | ISO-8601 timestamp marking the start of the charge window. |
| ChargePeriodEnd | ISO-8601 timestamp marking the end of the charge window. |
| ServiceCategory | FOCUS-canonical service category (Compute, Storage, Networking, AI/ML). |
| ChargeCategory | Usage, Purchase, Tax, Adjustment, or Credit. |
| ProviderName | The indexed provider (AWS, Azure, Google Cloud, CoreWeave, etc.). |
| RegionName | Canonical region name with sovereignty pin attached. |
| SkuName | Canonical SKU identifier with provider-native ID alongside. |
Omniscient normalises non-USD provider quotes to USD by default before they reach the FOCUS export. Customers needing the provider's native-currency quote alongside USD can request both columns at workspace creation.
Security and compliance#
Compliance posture is a first-class attribute of every SKU in the index. The customer pins a workload to a compliance tag, and Omniscient guarantees that non-eligible capacity never appears in the result set; a downstream provisioning hand-off cannot mix eligible and non-eligible SKUs. The tag set is published, versioned, and surfaced on every result row.
Sub-processor transparency is part of the same surface. Each indexed provider has a sub-processor manifest published and tracked by Omniscient; changes are versioned and surfaced via the workspace alert channel so customers with regulatory notification obligations can satisfy them mechanically. Customer-controlled audit-log streaming sends every search, every export delivery, and every provisioning hand-off to a customer-owned destination (S3, Azure Blob, GCS, or a syslog endpoint).
- NCSC Cloud Security Principles — per-SKU mapping; UK OFFICIAL-tier regions filtered at index time.
- G-Cloud framework — listed under Cloud Software (Lot 2) and Cloud Support (Lot 3).
- EU Data Boundary — every result tagged eu-data-boundary is restricted to EU capacity at search time.
- GDPR and UK DPA 2018 — per-provider sub-processor lists; standard DPA template provided.
- FedRAMP-equivalent — Moderate-baseline-aligned controls surfaced via partner regions in the US.
- HIPAA — BAA-eligible SKUs labelled at the result row for healthcare workloads.
- ISO 27001:2022, ISO 27017, ISO 27018 — per-provider attestations attached to each SKU.
- SOC 2 Type II — current attestations available under NDA per provider.
- Sub-processor transparency — versioned manifests, workspace-channel notifications on change.
- Customer-controlled audit-log streaming — every search, export, and provisioning event delivered to a customer-owned destination.
Migration and alternatives#
Most teams reach Omniscient after a single-cloud deployment outgrows the bill, or after a manual cross-cloud price comparison breaks down at scale. The comparison table below sets the alternatives the customer is realistically weighing; the code block underneath shows the manual incumbent workflow that Omniscient consolidates. The code is honest about what the customer does today — calling each provider's pricing API individually and reconciling the shapes by hand — and the bullet at the end of the section sets out why that manual workflow is the thing Omniscient replaces.
| Concern | Omniscient Compute | Single-cloud procurement | DIY cross-cloud tooling | FinOps tooling (Vantage / Cloudability / Apptio) |
|---|---|---|---|---|
| Provider coverage | 25+ providers across five tiers | One cloud | Whatever the team has scripted | Hyperscaler-centric; neocloud coverage via add-ons |
| Refresh cadence | Every 6 hours, per provider | Whatever the cloud's API serves | DIY; whatever the cron runs at | Daily or longer |
| Sovereignty filter | Admission-enforced per SKU | Region tag only | DIY | DIY |
| Composite Value Score | InferenceBench-sourced ranking | None | None | None |
| Provisioning hand-off | Direct or copy-out per provider | Native to the cloud | DIY | Reporting only |
| FOCUS billing export | Live — hourly to customer storage | Proprietary export | DIY | Available, often as paid add-on |
| Operational ownership | Yobitel runs the index | You run the procurement loop | You run the comparison loop | Vendor runs the dashboard |
| Pricing model | Free search; paid FOCUS export and FinOps surface | Cloud bill | Your engineering cost | Per-spend SaaS |
# Manual incumbent workflow — what cross-provider compute procurement
# looks like today without Omniscient. Each provider exposes a different
# pricing API with a different shape; results have to be reconciled by hand.
# 1. AWS — reserved-instance price for p5.48xlarge (8x H100 SXM5) in London
aws pricing get-products --service-code AmazonEC2 --region us-east-1 \
--filters 'Type=TERM_MATCH,Field=instanceType,Value=p5.48xlarge' \
'Type=TERM_MATCH,Field=location,Value=Europe (London)' \
--query 'PriceList[0]' --output text | jq '.terms.Reserved'
# 2. AWS — current spot price history
aws ec2 describe-spot-price-history \
--instance-types p5.48xlarge \
--availability-zone eu-west-2a \
--product-descriptions "Linux/UNIX" \
--max-results 5 --output table
# 3. Azure — ND-H100-v5 retail prices in uksouth (anonymous API)
curl -s "https://prices.azure.com/api/retail/prices?\$filter=serviceName%20eq%20'Virtual%20Machines'%20and%20armRegionName%20eq%20'uksouth'%20and%20contains(skuName,%20'ND')" \
| jq '.Items[] | {skuName, retailPrice, reservationTerm}'
# 4. GCP — a3-highgpu-8g SKU pricing via Cloud Billing Catalog API
curl -s -H "Authorization: Bearer $(gcloud auth print-access-token)" \
"https://cloudbilling.googleapis.com/v1/services/6F81-5844-456A/skus?pageSize=200" \
| jq '.skus[] | select(.description | contains("A3"))'
# 5. CoreWeave / OVHcloud / sovereign UK — no structured public API; scrape or
# request a partner price feed and rebuild the comparison spreadsheet.
# Each of the above returns a different shape, in a different unit. Provider
# quotes that arrive in non-USD must be normalised to USD by hand before the
# comparison is meaningful. Omniscient Compute consolidates every step of this
# workflow into one ranked query against a canonical SKU schema.The manual workflow above is what most procurement and platform teams do today without Omniscient. The consolidation Omniscient provides — one query, one schema, USD-normalised pricing, and a sovereignty-aware filter — is the value the FOCUS export and the FinOps surface are billed against.
Troubleshooting#
The errors below cover the failure modes seen most often during onboarding and the first weeks of running searches against Omniscient. Each row is the Omniscient-shaped error code, the cause the customer most commonly hits it for, and the customer-facing fix. The full runbook library is at docs.yobitel.com/runbooks.
| Error | Cause | Fix |
|---|---|---|
| StaleProvider | A provider's pricing has not refreshed within the expected window; the last successful refresh is older than the freshness SLA. | Treat the affected rows as advisory; retry the search after the next six-hour refresh, or filter the provider out with `--provider-tier`. |
| MissingRegion | A region the customer asked for is not in the canonical region table — usually a newly-launched provider region. | Use the canonical parent region (for example `uk` instead of `uk-london-3`), or contact Yobitel to open a coverage request. |
| CurrencyNormalisationLag | A provider returned a non-USD quote and the daily normalisation rate has not yet been applied. | Omniscient normalises non-USD provider quotes to USD by default. Retry the search; transient lag clears within a few minutes. |
| TermIneligible | The requested term (for example spot) is not offered for the SKU in the region the customer asked for. | Remove the term filter, or pick a region where the term is sold; the per-SKU detail panel lists eligible terms. |
| SkuNotIndexed | The customer asked for a SKU that the index does not currently cover. | Use the SKU family filter (`gpu-family=hopper`) instead of the exact SKU, or open a coverage request via the account team. |
| ComplianceTagMissing | A provider published a SKU without sufficient compliance metadata to apply a tag. | Reduce the compliance filter, or contact Yobitel — Omniscient withholds the tag rather than risk a false positive. |
| ProviderCrawlFailure | Omniscient could not reach a provider's pricing surface during the last refresh window. | The affected provider is excluded from results until the next successful refresh; the workspace alert channel surfaces it. |
| MarketplaceSearchTimeout | A search ran past the configured timeout, usually because the filter set was overly broad. | Narrow the search with a tighter region or GPU-family filter, or lower `max-results`. |
| FocusExportEmpty | The FOCUS export bucket policy does not allow Omniscient to write objects. | Apply the bucket policy snippet shown in the workspace Billing tab and wait one export window; exports retry hourly. |
| ComplianceFilterEmpty | A sovereignty pin (for example `compliance=ncsc-official`) returned no eligible capacity in the search window. | Widen the region filter, or contact the account team — sovereign capacity availability is published per provider. |
| TopologyConstraintUnsatisfiable: ib-rack across 64 nodes | No indexed provider in the region has 64 H100 SXM5 nodes inside one InfiniBand rack at the requested term. | Loosen the interconnect to `ib-row` if the workload tolerates a rack-spanning hop, widen the region set, or accept a `multi-provider-blend` plan that ships 32+32 across two providers. |
| InsufficientGangCapacity: gang requested, 48/64 attested | Gang filter requested 64 nodes; only 48 are attested provisionable at search time. | Drop `--gang` to see the partial plan, request a smaller cluster, or queue the request (`--queue-for` parameter) for an availability window. |
| PlacementImpossible: sovereign + spot + ib-rack | No indexed provider holds spot-priced H100 capacity inside an IB rack in a UK NCSC OFFICIAL region. | Switch to `--term reserved-1y` or `--term on-demand`; sovereign IB-rack capacity is almost never sold on spot in 2026. |
Where Omniscient Compute fits in the Yobitel stack#
Omniscient is the layer below Yobibyte in the Yobitel stack. Yobibyte calls Omniscient at admission and reconciliation time, so every Inference, FineTune, and AIApplication placement decision is grounded in current capacity and pricing — a workload Yobibyte schedules has, by construction, been priced and sovereignty-checked against the live Omniscient index. InferenceBench supplies the benchmark and economics data that drives the Composite Value Score so the ranking a customer sees in the Marketplace UI is the same ranking Yobibyte uses internally.
The Yobitel AI Applications suite — MediQuery and the rest of the vertical applications — sits on top of Yobibyte and consumes Omniscient indirectly through it. Yobitel NeoCloud is itself one node in the Omniscient index without preferential ranking; if a customer's workload is better served on a competing neocloud or hyperscaler, that is what Omniscient returns.
Practically, the question 'which compute should this workload run on?' is the same question whether asked by a FinOps team in a procurement meeting, by a platform team scripting a CI hand-off, or by Yobibyte's admission webhook. Omniscient is the answer surface for that question across all three contexts. FinOps and platform teams reach Omniscient directly through the Marketplace UI today, and through the CLI, SDKs, and REST API when those surfaces leave Preview.
Distributed workloads reach Omniscient in one of two ways. Customers using Yobibyte get Omniscient placement decisions automatically: when a Yobibyte `Inference` or `FineTune` declares a multi-node requirement, Yobibyte queries Omniscient for the cluster plan at admission time and the resulting placement carries through to the data plane. Customers running their own orchestrators (KubeRay, Slurm, in-house) reach Omniscient directly through the Marketplace UI today and through the CLI, SDK, or REST API once those Preview surfaces are GA. Either path returns the same plan, ranked the same way, against the same indexed catalogue.
References
- Omniscient Compute product page · Yobitel
- Yobibyte platform · Yobitel
- InferenceBench · Yobitel
- FOCUS — FinOps Open Cost and Usage Specification · FinOps Foundation
- NCSC Cloud Security Principles · NCSC
- AWS Pricing API · AWS
- Google Cloud Billing Catalog API · Google Cloud
- Azure Retail Prices API · Microsoft