A situation where certain categories of facts, groups of people, languages, experiences or other information are missing or too scarce in the data used to train AI, leading to biased or less accurate results for them.
"Voice assistants often struggled with accents and dialects because of underrepresentation in their training data."