大数据项目之尚品汇电商数仓
教程涵盖了Apache生态所有主流技术:Hadoop、Hive、Spark、Flume、Kafka、Azkaban、Zookeeper、Sqoop、Atlas、Kylin、Presto、Kerberos、Ranger、Zabbix等,均使用最新稳定版本。 数据治理的内容更加全面。 构建了Kerberos+Ranger的集成权限管理系统。 Python+Shell脚本用于实现自动数据质量监控。 血缘关系管理保证了仓库的安全性、一致性和可靠性。(The tutorial covers all mainstream technologies of Apache Ecology: Hadoop, hive, spark, flume, Kafka, Azkaban, zookeeper, sqoop, Atlas, kylin, presto, Kerberos, Ranger, ZABBIX, etc., all of which use the latest stable version. The content of data governance is more comprehensive. An integrated rights management system of Kerberos + Ranger is built. Python + shell script is used to realize automatic data quality monitoring. Blood relationship management ensures the safety, consistency and reliability of the warehouse.)
页:
[1]