Why document labeling methods?
You have been tasked with writing up documentation for your model training process. As part of this documentation, you realize that the data you originally started with doesn't have proper labels attached to it and you have no way of knowing how it came to be this way because there is no documentation. Frankly, you're lucky to even have noticed it!
To remedy this, you put together a team of people with deep knowledge of the subject to create a properly attributed response variable for a supervised task. You figure it will be important to document this process because then others will know how the labels were created and can critique them if needed.
You spend the time to write up how you came to the labels, but you have a sense that there is another reason you wrote all of this up. What is another reason to document data labeling methods for a supervised task?
This exercise is part of the course
Developing Machine Learning Models for Production
Hands-on interactive exercise
Turn theory into action with one of our interactive exercises
