-
Hadoop
An open source software framework for storing data efficiently, processing large datasets (from gigabytes to petabytes), and running applications on clusters of hardware. Hadoop provides a huge storage and allows multiple computers to analyze those database faster than using one large computer.
-
Data Lake
A vast pool of data in its raw format, either structured or unstructured. Users can store data at any scale. Data Lake is developed due to the limitations of data warehouses.
-
Data Warehouse
A large collection of business data which organization use for analytical purposes before making decisions. Unlike Data Lake, Data warehouse stores data from multiple databases in a more organized way. Therefore, it is easier to analyze.
-
Data Ingestion: ETL
"Extract-Transform-Load" is a specific type of data ingestion. It is a process consisted of 3 steps; extract data from different sources, transform the data into a usable and trusted resource, and load that data either into Data Lake or Data Warehouse.
-
BI Dashboard
"Business Intelligence Dashboard" is an information management tool that uses to analyze data and visualize information. The information can be displayed in various forms, including charts, graphs, reports, etc. BI Dashboard provides a big picture of each information being assessed for readers.
-
Real Time Messaging
Communication between data and the data storage system or data analysis system on the internet.
-
Batch Load
Processing a large amount of data at the same time that helps speed up the processing time.