Interview Questions

Why is it important to clean data? - ETL

ETL (extract, transform, and load) Testing Interview Questions


(Continued from previous question...)

55. Why is it important to clean data? - ETL

The data in the warehouse is used for important analysis and reporting. Cleaning data means finding similarities in the information so that it can be categorized correctly. This will increase the efficiency of the analysis. For example, data that contains the word ‘store’ and ‘stores’ might fall into separate groupings because they are not exactly the same. Data cleansing is a way to ensure that categorically similar information is grouped together. This will make the information easier to find and read. This is an important step before loading. Cleaning the data after the fact is very difficult and time consuming.

(Continued on next question...)

Other Interview Questions