From: Big data quality framework: a holistic approach to continuous quality management
Data quality issues vs data quality dimensions (DQD) | Accuracy | Completeness | Consistency | ||
---|---|---|---|---|---|
Single data source | Cell instance level | Missing data | ● | ● | |
Incorrect data and references, Data entry errors and Misspelling | ● | ||||
Irrelevant data | ● | ||||
Outdated data | ● | ||||
Misfielded and contradictory values | ● | ● | ● | ||
Dataset schema level | Domain and Uniqueness constrains, Functional dependency violation | ● | |||
Wrong data type, poor schema design | ● | ||||
Referential integrity violation, lack of integrity constraints | ● | ● | ● | ||
Multiple data source | Cell instance level | Different units, representations, Structural conflicts | ● | ||
Different aggregation levels, Inconsistent aggregation | ● | ● | |||
Temporal mismatch, inconsistent timing | ● | ||||
Dataset schema level | Heterogeneous data models and schema design | ● | ● | ● | |
Different encoding formats | ● |