The name of the course is "Big Data Proc & Analytics_2021SP" The course was a course on HDFS and HADOOP using Java. The second half was about using apache SPARK using python. Virtual machines were...




The name of the course is


"Big Data Proc & Analytics_2021SP"






The course was a course on HDFS and HADOOP using Java.




The second half was about using apache SPARK using python.




Virtual machines were provided.


one with hadoop and HDFS ser up on it


that can be found at: goo.gl/9CG2W2




One with apache spark set up on it


that can be found at:
https://goo.gl/5uSTpT




Virtual machines can be loaded in VM Ware Workstation Player that can be found at





https://my.vmware.com/en/web/vmware/downloads/details?downloadGroup=PLAYER-


1610&productId=1039&rPId=55792




I need the assignment in the attached document labeled "BigDatsa&AnalyticsProject" done.




The deliverables are




1. Code


2. Demo Video – record a video using screen recording software such as “screencasify”,


“screencast-o-matic”, “zoom recording”.


3. Project Report. The report should include introduction, related work, approach, results, and


conclusion.






You can choose either HDFS Hadoop and Java to complete the project or Apache Spark and Python to complete the project.




I need you to use one of the virtual machines to do the assignment OR complete the project and have the demo video you must make run in one of the virtual machines.


OR just have the demo video ran in linux. Really the virtual machines aren't necessary if you already have hadoop hdfs setup. It just needs to come from within linux.










I don't have a preference for HDFS Hadoop or Apache spark. which ever one you want to use to complete the assignment for me is fine.

Apr 24, 2021
SOLUTION.PDF

Get Answer To This Question

Related Questions & Answers

More Questions »

Submit New Assignment

Copy and Paste Your Assignment Here