- Data distribution across the nodes.
- Resource allocation, database connection.
- Execution life-cycle on submitting a Job.
- Storage of data
- Details related to the Metastore
** Note: Refer the links metioned below under each ecosystem for detailed explanation **
-
HDFS 🐘
-
SQOOP
- Sqoop Incremental Load:
-
HIVE 🐝
-
SPARK 💥
-
HBASE 🐋