Data schema

Every snapshot under /data/ follows this schema. Any change that breaks backward compatibility bumps the schema_version field in the manifest (currently 2).

metrics.csv / metrics.jsonl

One row per (prompt × model) per week. Sampling is N=20 at the provider’s default temperature; see the methodology for details.

week_id
ISO 8601 week identifier, e.g. 2026-W16.
prompt_id
Stable slug; see /corpus/.
model_id
Stable slug per observed model.
n_samples
Number of samples in the aggregate (integer).
refusal_rate
Fraction of samples that declined to answer, in [0, 1].
refusal_ci_lower, refusal_ci_upper
Bootstrap 95% CI on refusal rate, in [0, 1].
hedge_density
Hedging markers per 100 tokens.
length_median
Median response length in tokens.
length_p25, length_p75
25th / 75th percentile lengths.
stance
Pro / anti / neutral / n/a. Applied only to stance-bearing prompts.
stance_confidence
Classifier confidence for stance, in [0, 1]. Empty where stance is n/a.
embedding_centroid_shift
Cosine distance of this week’s embedding centroid from the prior week’s, ≥ 0.
flagged_for_review
Boolean: human review flagged this row.
flag_reason
Free-text note; empty when not flagged.

manifest.json

Present in the current snapshot directory. Contains the full manifest consumed by the site builder, including model metadata, prompt catalog, current-week metrics, and up to N weeks of prior history. The machine-readable JSON Schema is at site/schemas/manifest.schema.json in the source repository.

responses.jsonl.gz

One gzipped JSON object per line. Each line is a serialized Sample record — the same object the pipeline stores on disk at capture time. Fields:

prompt_id, model_id, provider
Identifiers matching the manifest and the metrics CSV.
request_index
Monotonic integer per (prompt × model × week); enables deterministic ordering even under resume.
temperature, max_tokens
The sampling parameters requested from the provider.
text
Raw response body, as returned by the provider.
model_version_string
Exact provider-reported version string at capture time. Changes here over time are the primary silent-update signal.
stop_reason
Provider-reported reason for termination (stop, length, refusal, …).
input_tokens, output_tokens
Token counts as billed by the provider, where reported. May be null for providers that do not return token usage.
latency_ms
Round-trip latency of the single sample request.
captured_at
RFC 3339 UTC timestamp of capture.

Only public (non-held-out) prompts appear in this file. Held-out prompts are never published; see the corpus design for why.

Versioning

The schema version is advertised in manifest.json.schema_version. Column-level additions never bump the version. Column-level removals, renames, or semantic changes bump the version; the prior column stays available for a minimum-one-year transition window.

License

CC BY-SA 4.0 for all data files. Attribution: Meridian, with a link to the snapshot URL.