论文部分内容阅读
针对煤炭企业数据处理能力的不足,将云计算技术应用到煤炭企业数据共享体系中,提出了一个利用Hadoop平台对煤炭企业数据进行高效共享的系统。首先,介绍了Hadoop平台及其关键技术;而后建立了应用于煤炭企业的数据共享模型,该模型抽取源数据并通过数据集成存储于数据仓库;最后结合煤炭企业实际需求,设计了数据管理平台,完成了数据集成、模型和并行关联算法的设计。
In view of the lack of data processing capability of coal enterprises, cloud computing technology is applied to the data sharing system of coal enterprises, and a system for efficiently sharing data of coal enterprises by using Hadoop platform is proposed. First of all, the Hadoop platform and its key technologies are introduced. Then a data sharing model applied to coal enterprises is established. The model extracts the source data and stores it in the data warehouse through data integration. Finally, based on the actual needs of the coal enterprises, a data management platform is designed, Completed the data integration, model and parallel association algorithm design.