A data cleaning checklist is a list of tasks to make sure your data is accurate, consistent, and ready for analysis.
- Data Profiling
- Identify primary data sources
- Assess data volume
- Survey data types
- Record data age
- Profile data completeness
- Data Cleaning
- Remove duplicates
- Handle missing values
- Correct data entry errors
- Normalize data
- Standardize formats
- Data Validation
- Verify data accuracy
- Validate data ranges
- Cross-check with external sources
- Confirm consistency rules
- Review data relationships
- Data Transformation
- Standardize units of measure
- Convert data types
- Aggregate data
- Map values to common formats
- Reorganize data structures
- Data Enrichment
- Append missing information from trusted sources
- Derive new variables
- Integrate external datasets
- Classify data into categories
- Tag data for easier retrieval
- Data Security
- Mask sensitive information
- Encrypt critical data
- Implement access controls
- Conduct security audits
- Ensure compliance with data regulations