Document
what decisions and actions were taken to address the data quality problems reported during the Verify Data Quality task
of the Understanding Data activity. Transformations of the data for cleaning purposes and the possible impact on the
analysis results should be considered. Consider also the following questions when creating your documentation:
What
types of noise occurred in the data?
What
approaches did you use to remove the noise? Which techniques were successful?
Are
there any cases or attributes that could not be salvaged? Be sure to note data excluded due to noise.
|