Students will need to identify in a number of examples what data needs to be cleaned based on their understanding of the data’s subject. They will also need to make queries about if the data could be better ‘cleaned’
For example; they will have a set of data regarding growth rates of vegetables in ag class. The outliers to be cleaned would be the plants that have been eaten by bugs. A consideration might be the rain and soil of each set of plants.