Advanced Database Please solve it using Hadoop
Assume that you have the following relations, each relation represents a dataset oftext files stores on HDFS.1. ratings ( UserID, MovieID, Rating ) // where rating represent the rating between(from 1 to 5) given by the user to the corresponding movieID2. users ( UserID, Gender, Age)3. movies ( MovieID, Title, Genres ) // where genres in the classification of themovie such as comedy, children, action, ….Suppose you have been given a task to find the average rating for each movie inthe form (movieID, Title, avg_rating). Computing the average rating must considerthe following:4. only children and comedy movies5. consider rating values that are above 26. consider ratings from users who’s age is above 25What to submit:• First briefly describe how to implement the above task in MapReduce jobs in anefficient way• specify how many jobs you need• what the purpose of each MapReduce job (what it does)• what the Map and Reduce functions do in each MapReduce Job. write that inpseudo code similar to what we did in the lectures of relational algebra inMapReduce
Already registered? Login
Not Account? Sign up
Enter your email address to reset your password
Back to Login? Click here