Quiz 4 - Question 2
This case highlights how relying on narrow or unrepresentative data sources can distort cultural meaning and exclude communities. This relates to the discussion of why ethical reflection on bias, representation, and cultural context is central to building fair and accountable datasets.
Representation and context
A team develops a translation model for Swahili using mostly English religious texts translated into Swahili. While technically functional, the system struggles with everyday cultural expressions and mistranslates common greetings, sometimes producing offensive or inaccurate results.
Question: Which ethical issues about dataset design are highlighted in this example?
Diese Übung ist Teil des Kurses
Google DeepMind: Represent Your Language Data
Interaktive Übung
In dieser interaktiven Übung kannst du die Theorie in die Praxis umsetzen.
Übung starten