数据质量常见问题
Data Quality | Issue | Sample Data Problem |
Standard | Are data elements consistently defined and understood? | Gender code =M,F,U in one system and Gender code = 0, 1, 2 in another system |
Complete | Is all necessary data present? | 20% of customers’ last name is blank, 50% of zip-codes are 99999 |
Accurate | Does the data accurately represent reality or a verifiable source? | A supplier is listed as ‘Active’ but went out of business six years ago |
Valid | Do data values fall within acceptable ranges | Salary values should be between 60,000 – 12,000 |
Unique | Data appears several times | Both John Ryan and Jack Ryan appear in the system – are they the same person? |
DQS…