Find Jobs
Hire Freelancers

hadoop, spark , cloud service, cluster

$10-30 USD

Avslutat
Publicerad ungefär två år sedan

$10-30 USD

Betalning vid leverans
deadline 16th april Project : Hadoop and Spark The purpose of this project is to support your in-class understanding of how data analytics stacks work and get some hands-on experience in using them. You will need to deploy Apache Hadoop as the underlying file system and Apache Spark as the execution engine. You will then develop several small applications based on them. Task 1: Launch a cluster of virtual machines in a cloud environment (e.g., AWS, Azure, or GCP). You will need to have one node as the master and at least two nodes as workers (slaves). Task 2: Deploy the HDFS service on the cluster. Task 3: Download the text version of Pride and Prejudice from Project Gutenberg, and save it to the HDFS cluster. Task 4: Deploy the Spark service on the cluster. Task 5: Use the file in HDFS as input, run a wordcount program in Spark to count the number of occurrences of each word. Sort the words by count, in descending order, and return a list of the (word, count) pairs for the 20 most used words. Task 6: Write a Spark program that uses Monte Carlo methods to estimate the value of $π$. Since the area of a circle of radius r is $A = πr^2$ , one way to estimate π is to estimate the area of the unit circle. A Monte Carlo approach to this problem is to uniformly sample points in the square $[−1, 1] × [−1, 1]$ and then count the percentage of points that land within the unit circle. The percentage of points within the circle approximates the percentage of the area occupied by the circle. Multiplying this percentage by 4 (the area of the square $[−1, 1] × [−1, 1]$) gives an estimate for the area of the circle What to submit: a report on describing the commands you run, code in any file(s), your observations, and output from all the steps in each task. Also explain the purpose of each step in your report.
Project ID: 33456766

Om projektet

5 anbud
Distansprojekt
Senaste aktivitet två år sedan

Ute efter att tjäna lite pengar?

Fördelar med att lägga anbud hos Freelancer

Ange budget och tidsram
Få betalt för ditt arbete
Beskriv ditt förslag
Det är gratis att registrera sig och att lägga anbud på uppdrag
5 frilansar lägger i genomsnitt anbud på $162 USD för detta uppdrag
Använd avatar
I have a hadoop cluster in my organization with 28 nodes. I use this for my research work and student project. I can do this work. I will provide public URL to access the cluster via Ambari as well as ssh.
$100 USD Om 7 dagar
0,0 (0 omdömen)
0,0
0,0
Använd avatar
Hi, I am an experienced Big Data Engineer with hands-on knowledge of HADOOP and SPARK. I can help you with this project, we can setup 1 master node and 2 workers nodes for hadoop and spark services. I can write efficient code to solve WordCount and $π$ value problem with Monte Carlo methods. I will finally develop a report with solution, their result and explanation of each step. Lets discuss details over chat. Regards, Safi
$280 USD Om 3 dagar
0,0 (0 omdömen)
0,0
0,0
Använd avatar
Hello Sir. I just saw your project in my freelancer feeds and I carefully read it's description and I am really interested in completing it in fast deadline and reasonable budget. My name is Ashish Yadav and I am DevOps Engineer as well as Bigdata hadoop expert. experience with following techs: - Containers: Docker, kubernetes - Cloud: AWS, GCP - DevOps Automation: Jenkins, Maven, Gitlab, Terraform, Ansible - Monitoring: ELK - Hadoop - System Administration: Linux, Debian, Redhat, Centos I have done what you have required project in automation that can be setup only one click using ansible in aws instance at any os. Be sure, Sir, that I am very excited to talk further about this project and be awarded delivery quality solution for your needs. But I have few questions to ask you before we start. So please reach me over chat so we can discuss about it. Thank you very much.
$30 USD Om 7 dagar
0,0 (0 omdömen)
0,0
0,0

Om kunden

Flagga för UNITED STATES
woodside, United States
5,0
2
Verifierad betalningsmetod
Medlem sedan maj 24, 2021

Kundverifikation

Tack! Vi har skickat en länk för aktivering av gratis kredit.
Något gick fel med ditt e-postmeddelande. Vänligen försök igen.
Registrerade Användare Totalt antal jobb publicerade
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Laddar förhandsgranskning
Tillstånd beviljat för geolokalisering.
Din inloggningssession har löpt ut och du har blivit utloggad. Logga in igen.