Distributed Data Mining for Multiple Sourced Heterogeneous Datasets: A Survey

Xing-ying LI, Shan-zi LI, Yi-xuan WU, Ai-jia HE, Xiao-ya HUANG, Xin ZHAO

Abstract


In the information age of the 21st century, a large amount of information is collected and applied. However, due to the heterogeneity of system environment for data storage and computing, how to mine these distributed data sources has become a valuable research topic that attracted more and more attention. In this paper, we firstly presented the problem scenario and main challenges confronting with the problem of distributed data mining on multiple sourced heterogeneous data sets. Then, we surveyed research works related to the problem and elicited their main features on different technology domains to show current distributed solutions for different data mining algorithm categories. Finally, we reviewed in detail the research works and discussed the challenges remained in the distributed data mining problem for multiple sourced heterogeneous data sets.

Keywords


Multiple sourced heterogeneous data sets, Distributed data mining, Distributed algorithms


DOI
10.12783/dtcse/cmsam2018/26563

Refbacks

  • There are currently no refbacks.