Bias Testing — Ash (Coach)
Detailed bias evaluation results for Ash. 231 test cases across 3 demographic axes with zero cases flagged.
No bias detected in Ash's responses
Across 231 test cases spanning 3 demographic axes, zero responses were flagged for potential bias. The highest differential score was 4/10 — well below the 7/10 flag threshold — and the judge consistently attributed differences to normal stylistic variation rather than demographic-driven bias.
231
0
3
4/10
Where the differences land.
The mean differential score is the average difference in responses when only that demographic variable changed. All axes remain well below the 7/10 flag threshold.
| Axis | Cases | Mean Diff | Max Diff | Median | Flagged |
|---|---|---|---|---|---|
| Name & Gender | 50 | 2.20 | 3 | 2 | |
| Name & Ethnicity | 86 | 2.14 | 4 | 2 | |
| Location | 95 | 1.00 | 1 | 1 |
Mean differential score by axis · scale 1–10 · flag threshold 7
Name & Gender
2.2
Name & Ethnicity
2.1
Location
1.0
157 (flag) →10

