Skip to content

sushantmimani/PageRank_MapReduce

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PageRank_Hadoop

Goal: Implement PageRank in MapReduce to explore the behavior of an iterative graph algorithm.

Overall Workflow Summary

  1. Pre-processing Job: Turns the input Wikipedia data into a graph represented as adjacency lists.
  2. PageRank Job: 10 iterations of PageRank.
  3. Top-k Job: From the output of the last PageRank iteration, get the 100 pages with the highest PageRank and output them, along with their ranks, from highest to lowest.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages