52ky 发表于 2022-4-17 15:16:38

大数据项目之尚品汇电商数仓

教程涵盖了Apache生态所有主流技术:Hadoop、Hive、Spark、Flume、Kafka、Azkaban、Zookeeper、Sqoop、Atlas、Kylin、Presto、Kerberos、Ranger、Zabbix等,均使用最新稳定版本。 数据治理的内容更加全面。 构建了Kerberos+Ranger的集成权限管理系统。 Python+Shell脚本用于实现自动数据质量监控。 血缘关系管理保证了仓库的安全性、一致性和可靠性。

(The tutorial covers all mainstream technologies of Apache Ecology: Hadoop, hive, spark, flume, Kafka, Azkaban, zookeeper, sqoop, Atlas, kylin, presto, Kerberos, Ranger, ZABBIX, etc., all of which use the latest stable version. The content of data governance is more comprehensive. An integrated rights management system of Kerberos + Ranger is built. Python + shell script is used to realize automatic data quality monitoring. Blood relationship management ensures the safety, consistency and reliability of the warehouse.)







页: [1]
查看完整版本: 大数据项目之尚品汇电商数仓