This chapter navigates the selection and integration of data sources within the context of responsible data practices. It highlights the importance of data origin, nature, and temporality, emphasizing legal compliance, diversity, and fairness. By exploring types of bias and their origins, we look at data fairness and representation to create a comprehensive dataset for modeling.