Describe in brief operation of the classification algorithms

Assignment Help Database Management System
Reference no: EM131534335 , Length: word count:2500

Data Mining -

A dataset in .ARFF format has been provided for you on Studynet. Analyse this dataset using the WEKA toolkit and tools introduced within this module. Produce a report explaining which tools you used and why, what results you obtained, and what this tells you about the data. Marks will be awarded for: variety of tools used, quality of analysis, and interpretation of the results. An extensive report is not required (at most 4000 words), nor is detailed explanation of the techniques employed, but any graphs or tables produced should be described and analysed in the text. A reasonable report could be achieved by doing a thorough analysis using three techniques. An excellent report would use at least four tools to analyse the dataset, and provide detailed comparisons between the results.

You should perform the following steps:

  • Analyse the attributes in the data, and consider their relative importance with respect to the target class. You should explain what kind of classifier you believe might be most suitable for this task, given the information about the attributes alone.
  • Describe in brief the operation of the classification algorithms you intend to use - these algorithms should be taken from those described in the module. Explain their main characteristics and parameters. Additionally explain any other algorithms you intend to use (such as to modify the original dataset).
  • Describe briefly (not with screenshots) the steps you will use in Weka to prepare the data (if necessary) and run your selected classification algorithms. Construct a table and graph of classification performance against training set size for the classifiers. What can you conclude from your results?
  • Analyse the data structure/representation generated by at least three classifiers when trained on the complete dataset. What does your analysis tell you about the data set?
  • Combine the results from the previous steps and all your classifiers to develop a model of why instances fall into particular classes. (Your answer to this question should be understandable by someone who is not a specialist in data mining; imagine you are making a strategic recommendation to the manager of a company.)

Description of dataset:

The following describe the numeric attributes. All instances are for women aged at least 21. Values of 0 in fields like blood pressure represent missing values.

The output class indicates if the woman had diabetes (1) or not (0).

  • Number of times pregnant
  • Plasma glucose concentration a 2 hours in an oral glucose tolerance test
  • Diastolic blood pressure (mm Hg)
  • Triceps skin fold thickness (mm)
  • 2-Hour serum insulin (mu U/ml)
  • Body mass index (weight in kg/(height in m)^2)
  • Diabetes pedigree function
  • Age (years)
  • Class variable (0 or 1)

Attachment:- Assignment Files.rar

Reference no: EM131534335

Questions Cloud

Discuss the marketing process : Provide a definition of marketing from the American Marketing Association. Define the customer value proposition.
Administrators and machinist : Giant manufacturing is a local Massachusetts employer that employs 75 people in various capacities from management to office administrators and machinist.
Prepare journal entries for the sale of inventory : Quick, Drake, and Sage share income and loss in a 3:2:1 ratio. The partners have decided to liquidate their partnership. On the day of liquidation.
Influence over diversity management decisions : Public opinion has a strong influence over diversity management decisions. Consider this week's readings, videos and your personal experiences.
Describe in brief operation of the classification algorithms : Describe in brief the operation of the classification algorithms you intend to use - these algorithms should be taken from those described in the module
Provide a comprehensive discussion of the products : Provide a comprehensive discussion of the products and/or services provided by your organization.
What actions should an emergency-response crew take : In a another scenario, what actions should an emergency-response crew take to protect lives, property, and the environment at a collision scene?
Prepare journal entries to record ford entry in partnership : Part 1. Goering, Zarcus, and Schmit are partners and share income and loss in a 3:2:5 ratio. The partnership's capital balances are as follows.
African american female employee : An African American female employee is told that she cannot come to work with her hair in decorative braids and if she continues to do so

Reviews

len1534335

6/17/2017 3:56:36 AM

Total 2500 words only. Have to submit an assignment on Datemining using WEKA open source tool. Please find attachment for assignment and dataset. If you need any information or detail, please let me know.

Write a Review

Database Management System Questions & Answers

  Create an erd and relatioal schema

Create an ERD and relatioal schema in third normal form based upon the following business rules. (Hint: ensure that all attributes are FULLY DETERMINED by the primary key.) Don't forget to place your normalization arrows on your relational schema.

  Design and implement a test plan

Design and implement a test plan for the exactly-once service. This includes error/fault injection. The system must be able to handle a server crash and begin running from the point in which it ended.

  Build an uml model in microsoft visio

Neatness of your diagram, please use the concepts of model, package, and sub-system well in the Visio model. Create both your diagrams under the static model from the model explorer in Visio.

  Performance of a distributed database

How can replication help the performance of a distributed database and in what situations can replication hurt the performance of a distributed database?

  Design database by developing a fully attributed data model

Design the database by developing a fully attributed data model. The model should show all tables. Each table should have a primary key and may have foreign keys.

  Write sql statement to retrieve all data sorted in order

Generate the view called RepairSummary which shows only RepairInvoiceNumber, TotalCost, and TotalPaid. Illustrate the SQL statement to retrieve all RepairSummary data sorted by TotalCost.

  Aspect of database or enterprise systems

Find one or more current articles (last six months) describing on aspect of database or enterprise systems. Summarize the article(s) and provide your own perspective, and then browse through the other student posts to learn about other related tec..

  Query evaluation and query optimization

Explain what you understand by the terms Query Evaluation and Query optimization - Discuss the ways or strategies you can apply to improve the query performance

  Use the client table as the source for the mailing labels

Use Avery C2163 labels, and use the default font and color settings. (Hint: Make sure the English option button is selected in the Unit of Measure section.

  Develop new user and new role for assistant dba

You need to develop new user named ASSOCDBA1 and new ROLE named JRDBA1 which can be used for assistant DBA. You wish the new role to contain DBA role that the SYSTEM user ha

  1kate and leopold are thinking about-buying the rockwood

1.kate and leopold are thinking about-buying the rockwood motellocated on interstate 70. before they make up their mind

  Do explain the process of normalization

Explain the context in which Normalization is used?

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd