Find Jobs
Hire Freelancers

Data Analytics in Big Data

$30-250 USD

Slutfört
Publicerad nio månader sedan

$30-250 USD

Betalning vid leverans
Solve the imbalanced classification problem associated with the ECBDL14 problem, with a sub-problem that has 960,000 instances, 90 attributes, and the following class distribution: • Class 1.0: 19,089 instances. • Class 0.0: 940,911 instances. For the test set, 240,000 instances will be used with the following distribution: • Class 1.0: 4,911 instances. • Class 0.0: 235,089 instances. The problem must be solved using the MLLib library and algorithms available in Spark Packages. You must use at least 4 learning algorithms and 4 preprocessing algorithms: a) Learning: From the MLlib library, use the Decision Tree, Random Forest, and another of the student's choice. Additionally, use at least one other algorithm from the Spark Package repository. b) Preprocessing: Use data balancing preprocessing algorithms ROS and RUS1, and at least two additional preprocessing algorithms. It is recommended to use some of the preprocessing algorithms discussed in class. You must describe in detail in a PDF file the entire algorithmic process used, showing the results of each of the algorithms used for training and testing, analyzing the behavior of the algorithms, and showing the flows/combinations of preprocessing algorithms. An analysis of data redundancy associated with the practice dataset must be performed. Additionally, you must attach the scripts (this practice must be done in Scala) Performance Measure The performance evaluation metric used is the TPR x TNR on the test set (the product of the classification rates for each class). The objective is to maximize this performance measure in the study to be conducted. Please remember: Execution instructions on the cluster: • Do NOT use spark-shell. Use spark-submit. • Limit the number of nodes to 14. • Limit the memory used to 4 GB. Data set: • Header: /user/datasets/ecbdl14/[login to view URL] • Train: /user/datasets/ecbdl14/[login to view URL] • Test: /user/datasets/ecbdl14/[login to view URL]
Project ID: 37143171

Om projektet

13 anbud
Distansprojekt
Senaste aktivitet nio månader sedan

Ute efter att tjäna lite pengar?

Fördelar med att lägga anbud hos Freelancer

Ange budget och tidsram
Få betalt för ditt arbete
Beskriv ditt förslag
Det är gratis att registrera sig och att lägga anbud på uppdrag
Tilldelad till:
Använd avatar
Hi, I've read your description carefully. I have full experience with Big Data, PySpark, Machine Learning. I think you can check my skills in my portfolio. I've also worked on several similar projects. So I can complete your project with high quality on time. Looking forward to hear more about the project from you via chatting. Thanks & Best regards!
$50 USD Om 2 dagar
5,0 (1 omdöme)
1,1
1,1
13 frilansar lägger i genomsnitt anbud på $130 USD för detta uppdrag
Använd avatar
Hello, good time Hope you are doing well I'm expert in MATLAB/Simulink, Python, Java, JavaScript and C++ programming and by strong mathematical and statistical background, have good flexibility for solve your project. I have many experience practical and theoretical in implementation different algorithms (such as: state estimation and Kalman filter, design controller, analysis closed loop stability, signal and systems, signal processing, heuristic optimization, fuzzy logic, neural network and machine/deep learning fields). Evidence of this claim exist in the portfolio. I have read your project description and I can help you (without any plagiarism). Please send me the details of your project. Thanks for attention 100% Jobs Completed, 100% On Budget, 100% On Time ⭐⭐⭐⭐⭐ 5-star reviews
$250 USD Om 7 dagar
5,0 (24 omdömen)
6,6
6,6
Använd avatar
Hello, my name is Adnan Gohar and I am an experienced and results-driven professional with a strong background in project management, strategic planning, marketing, data analysis and data processing. I have a track record of success in both corporate and entrepreneurial environments which makes me well-suited to solve your classification problem associated with the ECBDL14 problem. I have extensive knowledge in machine learning (specifically MLlib algorithms), which will be crucial for solving the imbalanced classification problem associated with the ECBDL14 problem. Additionally, I have experience using data balancing preprocessing algorithms ROS and RUS1 which will be necessary for data analysis. My expertise in project management has allowed me to successfully lead cross-functional teams and deliver complex projects on time and within budget. My strong organizational skills attention to detail ensure successful outcomes. I believe that my combination of experience, skills and expertise make me the best choice for this project. If you would like more information or would like me to discuss further please do not hesitate to contact me directly. Thank you for your consideration!
$140 USD Om 7 dagar
5,0 (3 omdömen)
4,0
4,0
Använd avatar
Hi. Thanks for your posting. I have just read your proposal and I am sure I can complete the project on time. I am an expert in ML/DL who has many years of experiences in Big DATA. Please contact me to discuss about the project in more details. Waiting for your contact now... Thanks. Best Regards.
$100 USD Om 2 dagar
5,0 (3 omdömen)
3,8
3,8
Använd avatar
Hi there
$190 USD Om 5 dagar
2,6 (1 omdöme)
3,0
3,0
Använd avatar
Hi there! My name is Ehtisham and I'm an Electrical Engineer and Data Scientist. I have the skillset to help you solve your imbalanced classification problem with a sub-problem that has 960,000 instances, 90 attributes, and a class distribution of Class 1.0: 19,089 instances and Class 0.0: 940,911 instances. I understand that you need to use at least 4 learning algorithms and 4 preprocessing algorithms to solve the problem and I am confident that my skillset can help you solve this problem. I have used various algorithms from the MLlib library such as Decision Tree, Random Forest and another of your choice in past projects so I am confident that my skillset can help you solve this problem. Additionally, I would recommend using some of the preprocessing algorithms discussed in class such as data balancing preprocessing algorithms ROS and RUS1 as part of the algorithmic process used to solve the problem.
$30 USD Om 7 dagar
5,0 (1 omdöme)
1,1
1,1
Använd avatar
Hi. How are you? Thanks for your post. I am an expert in ML/DL( including Big data processing) using python. If you cooperate with me, you will get good results. Please open chat window to contact me. Waiting for you now. Thank you for considering me.
$50 USD Om 1 dag
0,0 (0 omdömen)
0,0
0,0
Använd avatar
Hello. I read your requirement i will do that. Please come on chat we will discuss more about this. I will waiting your reply.
$160 USD Om 2 dagar
0,0 (0 omdömen)
0,0
0,0
Använd avatar
SOFTWARE ARCHITECTURE EXPERT GOOD IN PHP, PYTHON, JAVA SCRIPT, C PROGRAMMING, SQL AND CUDA I have gone through your project details and requirements keenly. I am very convinced to deliver the project within your expected timeline and at good budget. The most important I will deliver the project to meet your expectation
$140 USD Om 7 dagar
0,0 (0 omdömen)
0,0
0,0
Använd avatar
Hello there I'm a big data analytics engineer with 6 years experience in spark pyspark and mllib. Let's discuss more in details. I can start right away and I have done similar projects in past
$250 USD Om 3 dagar
0,0 (0 omdömen)
0,0
0,0
Använd avatar
I have understood your requirements and I would like to work on this task for symbolic price. lets chat for details.
$30 USD Om 7 dagar
0,0 (0 omdömen)
0,0
0,0

Om kunden

Flagga för ECUADOR
Cuenca, Ecuador
0,0
0
Verifierad betalningsmetod
Medlem sedan sep. 3, 2023

Kundverifikation

Tack! Vi har skickat en länk för aktivering av gratis kredit.
Något gick fel med ditt e-postmeddelande. Vänligen försök igen.
Registrerade Användare Totalt antal jobb publicerade
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Laddar förhandsgranskning
Tillstånd beviljat för geolokalisering.
Din inloggningssession har löpt ut och du har blivit utloggad. Logga in igen.