Quiz 4 - Question 2
This case highlights how relying on narrow or unrepresentative data sources can distort cultural meaning and exclude communities. This relates to the discussion of why ethical reflection on bias, representation, and cultural context is central to building fair and accountable datasets.
Representation and context
A team develops a translation model for Swahili using mostly English religious texts translated into Swahili. While technically functional, the system struggles with everyday cultural expressions and mistranslates common greetings, sometimes producing offensive or inaccurate results.
Question: Which ethical issues about dataset design are highlighted in this example?
This exercise is part of the course
Google DeepMind: Represent Your Language Data
Hands-on interactive exercise
Turn theory into action with one of our interactive exercises
Start Exercise