Find Jobs
Hire Freelancers

Data Mining

$30-250 USD

Inställt
Publicerad över fem år sedan

$30-250 USD

Betalning vid leverans
Data Mining (word count: 2000 words) The ‘Hepatitis’ Data set (provided in arff. Format. Will send seperately) contains information about patients affected by the Hepatitis disease. The task is to predict if these patients have or have not hepatitis (Histology: Yes or No). You should use the Weka data mining package, which is installed in the university computers and also available to download from: [login to view URL]~ml/weka/ You should hand in a report covering the following: a) Select a suitable tree building algorithm and build a model. Describe how you split the data for training and testing purposes. Explain the splitting method. (9 points) b) Interpret the output results: - The accuracy rates; - Which attributes were used to make the predictions; - How many nodes and leaves you obtained; - Include a visual tree diagram showing the structure of the model that you built. (12 points, with a greater attention to the accuracy rate interpretation) c) Give a detailed technical description of the classification model: - What tree induction method is utilised; - Which attribute selection criteria is used; - Give an example of how the attributes were selected for growing the tree. (20 points) d) Change the confidence factor to 35%, report any change in the model accuracy, explaining reasons behind the change. (5 points) e) Set the ‘REP’ parameter (Reduced Error Pruning) to ‘TRUE’. Explain the meaning of this operation. Report and explain any change in the model accuracy. (7 points) f) Set the parameter ‘unpruned’ to ‘TRUE’, Report and explain any change in the model accuracy and in the tree structure. Explain which pruning method for this algorithm is used. Carefully explain how pruning was performed. (11 points) g) Report on the model’s comparative ability to other 2 models of your choice (for example, neural networks or SVM or Bayesian network etc.) to predict the class variable. Which model classified data most accurately and what are the possible reasons of its prevalence? (20 points) h) Show a confusion matrix for the model and interpret it. Show a ROC curve and a Lift chart for the decision and interpret them. (6 points) i) Generate a set of rules along the subtree path: Ascites – Class – Spiders – Bilirubin – Sex – Class ‘No’. What would you recommend to reduce the number of rules in the set? Hint: speculate about Support and Confidence. (10 points) Note: the allocated points are given as tentative benchmarks only. The report will be assessed based on overall understanding of the data mining process as well, i.e. relating to the domain of the problem, cross-referencing between the points in the questions, notifying anything interesting etc. The report structure, quality of references will also be evaluated.
Project ID: 18336756

Om projektet

8 anbud
Distansprojekt
Senaste aktivitet fem år sedan

Ute efter att tjäna lite pengar?

Fördelar med att lägga anbud hos Freelancer

Ange budget och tidsram
Få betalt för ditt arbete
Beskriv ditt förslag
Det är gratis att registrera sig och att lägga anbud på uppdrag
8 frilansar lägger i genomsnitt anbud på $167 USD för detta uppdrag
Använd avatar
Hello? How are you? I have good experiences in "Data Mining" as you can see my profile for these (Big Data Sales, Data Mining, Data Processing, Java, Python). I have been working for 7 yrs in this scope. While we contract and work in our jobs, I will get paid once you have confirmed satisfied result. If I do not deliver satisfied result, I will never get paid from you. We can discuss more details to understand more easily if you have other infos. Hope to work with you. Thank you.
$155 USD Om 3 dagar
4,9 (78 omdömen)
6,4
6,4
Använd avatar
Hi, I am a highly trained on Data Entry ,Data Analysis, Copy Typing, Data Mining, Research, Web search, scraping, Product Add, Any Type Of Ecommerce Cart, Expert with great knowledge of Word, Excel. Thanks
$277 USD Om 3 dagar
4,8 (108 omdömen)
6,2
6,2
Använd avatar
Hi there, I have read your project description and i'm confident i can do this project for you perfectly.I still have a few questions. please leave a message on my chat so we can discuss the budget and deadline of the project. Thanks . .
$155 USD Om 3 dagar
5,0 (10 omdömen)
5,2
5,2
Använd avatar
I had a PhD in data mining. I had worked on many data mining projects. I can do your project prefectly. Contact me. Many thanks
$166 USD Om 3 dagar
4,9 (14 omdömen)
5,0
5,0
Använd avatar
Being an experienced academic writer and well researcher. I am 100% confident I can do this project perfectly. I have already written PhD and Masters Level Paper for UK and US Students and I can easily work on it. I am familiar with Harvard, APA, MLA, Chicago and Oxford Reference Style I have been writing all sorts of scholastic and professional documents for many years. Please consider my bid for high quality work. Thanks
$155 USD Om 3 dagar
5,0 (2 omdömen)
3,1
3,1
Använd avatar
Hello? I am a Data scientist with over 5 years experience in using python for data mining, analysis, visualization and modeling. I can provide intuitive python codes using several libraries to produce high quality results. I look forward to your response.
$120 USD Om 2 dagar
2,6 (11 omdömen)
3,6
3,6
Använd avatar
I will use feature tools for feature engineering and compare my results with the software to make sure I understand the software’s algorithms & limitations for a better report & interpretations
$30 USD Om 5 dagar
0,0 (0 omdömen)
0,0
0,0
Använd avatar
I have worked on weka and I know how to build this tree and explain it. Taking 5 days as conservative approach just in case computers give some grief.
$277 USD Om 5 dagar
0,0 (0 omdömen)
0,0
0,0

Om kunden

Flagga för UNITED KINGDOM
Southampton, United Kingdom
5,0
45
Verifierad betalningsmetod
Medlem sedan mars 20, 2017

Kundverifikation

Tack! Vi har skickat en länk för aktivering av gratis kredit.
Något gick fel med ditt e-postmeddelande. Vänligen försök igen.
Registrerade Användare Totalt antal jobb publicerade
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Laddar förhandsgranskning
Tillstånd beviljat för geolokalisering.
Din inloggningssession har löpt ut och du har blivit utloggad. Logga in igen.