Quiz 1 - Question 2
Imagine you are building a system that categorizes news articles from a Kenyan website into topics like "Politics," "Business," or "Sports." You encounter the following headline: “Team Harambee yajipanga for the big match kesho! 🔥🇰🇪” which contains both Sheng and emojis. What is the best preprocessing strategy for the emojis (🔥🇰🇪) in this context?
This exercise is part of the course
Google DeepMind: Represent Your Language Data
Hands-on interactive exercise
Turn theory into action with one of our interactive exercises
Start Exercise