**Note This area is prepping you for courses D426 and D427

image.png

The concept of DIKW pyramid is to illustrate the progression of raw data to valuable insights.

Every IT role may work with certain levels of the pyramid and sometimes all of them.

The data comes in either Structured or Unstructured Data

image.png

(https://prime46.com/structured-and-unstructured-data-for-strategic-decision-making)

Structured data can be thought of from types of databases while unstructured data are those that have no been pre-defined yet or in forms that are harder to organize.

As the world as grown and companies continue to gather data from unusual sources, there is a growing collection of Unstructured Data known as Big Data.

Big Data is a large collection of data that is incapable of being processed by previous generations of analytical tools. Big Data works largely with unstructured data that is retrieved from locations like social media and web pages.

Data Lake – data stored in raw format before it is processed at a warehouse.

Data Hygiene – Cleanliness of data for it to be error free. No duplication, incomplete/outdated data, and mistakes introduced as data is entered, stored, and managed

Data Scrubbing – process of amending or removing data in a database that is incorrect, incomplete, improperly formatted, or duplicated.

Globalization refers to the growing interdependence of the world’s economies and cultures, brought about by the trade of goods and services and the flow of information and people.

image.png