(The tutorial covers all mainstream technologies of Apache Ecology: Hadoop, hive, spark, flume, Kafka, Azkaban, zookeeper, sqoop, Atlas, kylin, presto, Kerberos, Ranger, ZABBIX, etc., all of which use the latest stable version. The content of data governance is more comprehensive. An integrated rights management system of Kerberos + Ranger is built. Python + shell script is used to realize automatic data quality monitoring. Blood relationship management ensures the safety, consistency and reliability of the warehouse.)