Publishing process signals: MODERATE — reflects the venue and review process. — venue and review process.

Dataset bias reduces reliability in otitis media AI

Research area:MedicineOtorhinolaryngologyEar Surgery and Otitis Media

What the study found

Dataset biases and inconsistencies can reduce how reliably artificial intelligence (AI) classifies otitis media, a middle-ear infection, from otoscopic images. The study found that some datasets produced high performance internally but did not generalize well to new data, often because of dataset-specific artifacts.

Why the authors say this matters

The authors conclude that addressing these biases is crucial for developing robust AI solutions. They say this is important for improving high-quality healthcare access and enhancing diagnostic accuracy.

What the researchers tested

The researchers retrospectively evaluated three public otoscopic image datasets from Chile, Ohio (USA), and Türkiye using quantitative and qualitative methods. They also ran two counterfactual experiments: one masked clinically relevant features to test reliance on non-clinical artifacts, and the other examined how hue, saturation, and value affected diagnostic outcomes.

What worked and what didn't

Quantitative analysis found significant biases in the Chile and Ohio datasets. In the first counterfactual experiment, models showed high internal performance (area under the curve, or AUC, above 0.90) but poor external generalization; the Türkiye dataset had fewer biases, and its AUC fell from 0.86 to 0.65 as masking increased, suggesting greater reliance on clinically meaningful features. In the second experiment, common artifacts were identified in the Chile and Ohio datasets, and a logistic regression model trained on clinically irrelevant features from the Chile dataset still achieved high internal AUC (0.89) and external AUC in Ohio (0.87). Qualitative analysis also found redundancy in all datasets and stylistic biases in the Ohio dataset that correlated with clinical outcomes.

What to keep in mind

The abstract describes a retrospective study of three public datasets, so the findings are limited to those datasets and methods. It also notes several sources of bias and inconsistency, but it does not provide additional limitations beyond what is summarized here.

Key points

The study found that dataset bias can undermine AI models for otitis media classification from otoscopic images.
Chile and Ohio datasets showed significant biases, while the Türkiye dataset had fewer biases.
Models could achieve high internal AUC yet perform poorly on external data because of dataset-specific artifacts.
Masking clinically relevant features reduced AUC in the Türkiye dataset from 0.86 to 0.65.
A model trained on clinically irrelevant features from the Chile dataset still achieved high internal and external AUC.
The authors say standardized imaging protocols, diverse datasets, and improved labeling are crucial.

Disclosure

Research title:: Dataset bias reduces reliability in otitis media AI
Image credit:: Photo by shogun on Pixabay

AI provenance: AI provenance information is not available for this post.

Dataset bias reduces reliability in otitis media AI

What the study found

Why the authors say this matters

What the researchers tested

What worked and what didn't

What to keep in mind

Disclosure

More posts

Allograft augmentation was the most cost-effective option in rotator cuff repair

Framework for studying infrastructure failure

NATPS enables efficient sampling of nonadiabatic trajectories

Renovations are linked to tenant relocations in Sweden