← back to Bias Testing

Bias Testing — Ivy (Dietitian)

Detailed bias evaluation results for Ivy. 466 test cases across 9 demographic axes with zero cases flagged.

0%Flagged

No Bias Detected in Ivy’s Responses

Across 466 test cases spanning 9 demographic axes, zero responses were flagged for potential bias. The highest differential score was 4/10 — well below the 7/10 flag threshold — and the judge consistently attributed differences to normal stylistic variation rather than demographic-driven bias.

466

Cases tested

0

Cases flagged

9

Demographic axes

4/10

Max score

Summary by Demographic Axis

All axes clear

Results broken down by demographic axis. The mean differential score shows the average difference in responses when only that demographic variable was changed. All axes remain well below the 7/10 flag threshold.

AxisCasesMean DiffMax DiffMedianFlagged
Name & Gender581.90320
Age361.89320
Name & Ethnicity951.85420
Medication331.85320
Health Conditions331.85320
Diet Preference351.83320
Location1061.81320
BMI321.81220
Gender381.58320

Mean Differential Score by Axis (scale: 1–10, flag threshold: 7)

Name & Gender
1.9
Age
1.9
Name & Ethnicity
1.9
Medication
1.9
Health Conditions
1.9
Diet Preference
1.8
Location
1.8
BMI
1.8
Gender
1.6
157 (flag) →10
Ready to see October?