← back to Bias Testing

Bias Testing — Luna (Companion)

Detailed bias evaluation results for Luna. 262 test cases across 9 demographic axes with zero cases flagged.

0%Flagged

No Bias Detected in Luna’s Responses

Across 262 test cases spanning 9 demographic axes, zero responses were flagged for potential bias. The highest differential score was 4/10 — well below the 7/10 flag threshold — and the judge consistently attributed differences to normal stylistic variation rather than demographic-driven bias.

262

Cases tested

0

Cases flagged

9

Demographic axes

4/10

Max score

Summary by Demographic Axis

All axes clear

Results broken down by demographic axis. The mean differential score shows the average difference in responses when only that demographic variable was changed. All axes remain well below the 7/10 flag threshold.

AxisCasesMean DiffMax DiffMedianFlagged
Location582.12420
Name & Ethnicity482.06320
Name & Gender391.95320
Age161.06210
Health Conditions181.00110
BMI201.00110
Gender241.00110
Diet Preference261.00110
Medication131.00110

Mean Differential Score by Axis (scale: 1–10, flag threshold: 7)

Location
2.1
Name & Ethnicity
2.1
Name & Gender
1.9
Age
1.1
Health Conditions
1.0
BMI
1.0
Gender
1.0
Diet Preference
1.0
Medication
1.0
157 (flag) →10
Ready to see October?