(The tutorial covers all mainstream technologies of the Apache ecosystem: Hadoop, Hive, Spark, Flume, Kafka, Azkaban, Zookeeper, Sqoop, Atlas, Kylin, Presto, Kerberos, Ranger, Zabbix, etc., all using the latest stable version. The content of data governance is more comprehensive, build Kerberos+Ranger integrated authority management system, use Python+Shell scripts to realize automatic monitoring of data quality, use Zabbix+Grafana to realize cluster performance monitoring, use Atlas, the current mainstream metadata management application, to realize data lineage management, Ensure the security, consistency and reliability of the data warehouse.)