Skip to content

kamalchaturvedi/multiLevelMapReduce

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

multiLevelMapReduce

This is Java based implementation for a multilevel mapreduce program (conversion, sorting & filtering)

Input : A big-data file from a e-commerce website with content given as "SessionId,Timestamp,ItemId"

Usecase : Calculate the top-N items which were clicked in the month of April

Output : Part file with output given as "ItemId ClickCount"

Implementation : The first map-reduce operation converts the input file into an output of type "ItemId, Count". Now to calculate the Top-N items which have the maximum count, we add another map-reduce operation, with a sortComparator in between to sort the output coming from the mapper to the reducer. Note : The reducer & combinator for the second map-reduce have been optimized to only forward the top-N items (as they arrive there in a Descending order)

About

This is Java based implementation for a multilevel mapreduce program (conversion, sorting & filtering)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages