01DETECTOR BENCHMARK

Detector Benchmark.


Public snapshot of the current DataSitr detector benchmark: current gate, public-safe suite results, and explicit claim boundaries.

Current benchmark: pass
self-run, dated in-repo snapshot · not an external audit Public gate; excludes research corpora — see 05
Overall gate Pass

Depends on both the required-quality gate and the performance gate.

floor: required-quality AND performance gates
Required suites 12/12

Required suites currently passing.

floor: all required suites must pass
English 1K p95 47.9 ms

Current gating latency check for English 1K text.

floor: p95 < 75 ms target
Frozen cases 534

Total cases across the current frozen suites.

count across the current frozen suites

02PERFORMANCE

Performance snapshot


The public page highlights the current 1K-character gating snapshot rather than publishing the full internal performance report.

// p95 vs the 75 ms gate, by language path

03PRECISION / RECALL

Detector precision / recall snapshot


Per-entity-type precision/recall is published against an adversarial-FP and Saudi-name-recall corpus. Dated in-repo snapshots on curated corpora — not an external audit and not a claim of production-wide coverage.

Precision / recall gate Pass

Dated detector precision/recall snapshot.

Total cases 350

Total measured cases in the public JSON artifact.

False positives 2

Total false positives across the published slices.

False negatives 0

Total false negatives across the published slices.

These misses are itemized in the linked JSON, with case id and character span (synthetic curated test tokens — not live customer traffic).

// each artifact is dated independently — see 06

04PUBLISHED SUITES

Current benchmark suites


These are the suites currently included in the public benchmark gate.

05RESEARCH CORPORA

Research corpora coverage


These suites were folded in from the 2026-04-26 detector research package. They expand evaluation coverage across public-domain Arabic literature, Saudi code-switched business text, and adversarial PII attacks. This section is shown separately from the frozen public gate until the new corpora complete trend-history stabilization.

Dated in-repo benchmark over curated public-domain and synthetic research corpora; not an external audit or production-wide coverage guarantee.

Research corpora gate Pass

Separate from the frozen public gate.

Total research records 1283

Across all research suites.

False positives 4

Across research corpora.

False negatives 5

Across research corpora.

Per-suite breakdown

    Adversarial attack-class metrics

      // shown separately from the frozen public gate

      06ARTIFACT & METHOD

      Artifact and method


      This page reads the published public JSON summary. It is intentionally narrower than the internal benchmark report.

      Last public update: 2026-04-29T14:40:52Z

      The precision/recall artifact (03) and the suite/gate artifact (this section) are dated independently — each is open-able above.

      07CLAIM BOUNDARY

      Claim boundary


      This page is a benchmark snapshot, not a blanket claim that every customer payload or every future detector change is perfect. Public benchmark language should stay tied to a fresh artifact.

      The public benchmark page shows only the suites currently included in buyer-facing claims.


      Evaluate the product with the evidence in hand.

      Request a pilot →