Microsoft Word - ProjectDescription2.docx MGS 655 Project 2 Design of a MapReduce Program Objective: Use the song files from the Digital Humanities project uploaded earlier on UBLearns to determine...

1 answer below »
FILE ATTACHED


Microsoft Word - ProjectDescription2.docx MGS 655 Project 2 Design of a MapReduce Program Objective: Use the song files from the Digital Humanities project uploaded earlier on UBLearns to determine the number of non-English words in each song. You can use an English dictionary (such as https://github.com/dwyl/english- words) to figure out whether the word is an English word or not. You are allowed to use your own dictionary of choice, but please provide a link to the dictionary if using a different one than the one mentioned above. a. You are required to write a MapReduce Program / Function as appropriate to attain the above objective. The MapReduce Program should be as detailed as possible. If coding in Java/Python, please provide a well-documented code. If not coding with a programming language of your choice, please provide well documented function, which can be executed by hand. In either case, the grader should be able to determine the sequence of actions suggested to attain the above objective. Also please provide a clear definition of the inputs to the mapper and reducer, contents of the functions and other relevant information. b. You are required to provide at least two examples of the output from your program. If using a programming language, print the output from your execution. If not programming, show the complete execution of the proposed function on at least two songs. In both cases, you should identify the MapReduce tuples produced from any two songs from the archive. c. You will be provided with a link on UBBox. Please upload all code and appendices in a folder named with your first name-last name. As an example, if the name is John Doe, the folder should be named John- Doe. Please ensure that I have access to the materials in this folder. d. You should provide the output from your programs for all the songs in this folder on UBBox. If executing by hand, please select at least five random songs to illustrate how your program works. e. How would you improve your design? f. Can you comment on whether this parallel programming method speeded up your computation time or not? If coding, please provide exact statistics (time for completion of jobs) and if not coding provide an explanation of why there should/should not be any benefit using the method above.
Answered 4 days AfterDec 06, 2021

Answer To: Microsoft Word - ProjectDescription2.docx MGS 655 Project 2 Design of a MapReduce Program Objective:...

Sathishkumar answered on Dec 07 2021
121 Votes
With the effective use of data structures(list and dictionary) in python, we can implement this system
e.How would you improve your design?
We can add some more words in dictionary, while analysing multiple songs.
f. Can you comment on whether this parallel programming method
speeded up your computation time or not? If coding, please provide
exact statistics (time for completion of jobs) and if not coding provide
an explanation of why there should/should not be any benefit using the
method above.
Computation time will remains the same.
For parallelism, it is important to divide the problem into sub-units that do not depend on other sub-units (or less dependent). A problem where the sub-units are totally independent of other...
SOLUTION.PDF

Answer To This Question Is Available To Download

Related Questions & Answers

More Questions »

Submit New Assignment

Copy and Paste Your Assignment Here