%A HE Dongzhi, ZHANG Jifeng, ZHAO Pengfei %T Parallel implementing probabilistic spreading algorithm using MapReduce programming mode %0 Journal Article %D 0 %J Journal of Shandong University(Engineering Science) %R 10.6040/j.issn.1672-3961.0.2014.367 %P 22-28 %V %N %U {http://gxbwk.njournal.sdu.edu.cn/CN/abstract/article_93.shtml} %8 2018-10-20 %X In order to overcome the limitations of the serial probabilistic spreading algorithm in dealing with large-scale dataset, a parallelization of the algorithm was put forth by using MapReduce. The complex computing tasks were decomposed into a series of MapReduce job flow for distributed parallel processing on Hadoop. The input and output data of every step were stored in the Hadoop distributed file system. Hit ratio was used to compare the parallelizable probabilistic spreading algorithm versus the global ranking method performance. Speedups of the parallelizable algorithm were compared while the amount of data and the number of nodes was different. Experiment results showed that the probabilistic spreading algorithm based on MapReduce had good parallelism and had higher hit ratio than the global ranking method. Data scale that can be handled by the serial algorithm was expanded, and the operation speed of the algorithm was raised.