🔑Data engineering is a method of handling and working with data to convert raw data into useful information.
💡Structured data is organized in rows and columns, while unstructured data lacks a predefined structure.
⏱️Hadoop is an open-source framework used for handling big data and consists of components such as HDFS, YARN, and MapReduce.
🌐The 4 V's of big data are volume, veracity, velocity, and variety.
🔧Blocks and block scanner are important concepts in HDFS, where data is stored in blocks and the scanner verifies the integrity of the blocks.