Construct a table of statistics summarizing your clusters

Assignment Help Other Subject
Reference no: EM133005443

Assessment Description

In this assignment, perform K-Means as a particular type of clustering by programming it in Python, then you will evaluate the use of clustering in a research article, and assess whether or not that research is correct.

To demonstrate completion of this assignment, create a Word document with your working code, screenshots of program results, and written answers to questions. Writing should be professional and rigorous, and include scientific/mathematical justification, where appropriate, for all conclusions reached. Upload your final Jupyter notebook and Word document to the LMS when complete.

Part 1: Operational Tasks

For the following exercises, work with the Framingham_training and Framingham_test data sets. Use only the Sex and Age fields. Standardize Age.

1. Run k-means clustering on the Framingham_training data set, requesting k = 2 clusters.

2. Construct a table of statistics summarizing your clusters. Describe what these two clusters consist of.

3. Perform k-means clustering on the Framingham_test data set, requesting k = 2 clusters.

4. Report the results from your test set. Are your clusters validated?

5. Again run k-means clustering on the Framingham_training data set, this time specifying k = 3 clusters.

6. Construct a table of statistics summarizing your clusters. Describe which records belong to each cluster.

7. Perform k-means clustering on the Framingham_test data set, specifying k = 3 clusters.

8. Report the results from your test set. Are your clusters validated?

9. Run k-means clustering on the Framingham_training data set. Specify k = 4 clusters.

10. Construct a table of statistics summarizing your four clusters. Clearly describe your four clusters.

11. Perform k-means clustering on the Framingham_test data set, requesting k = 4 clusters.

12. Report the results from your test set. Are your clusters validated?

13. Which of the clustering solutions, k = 2, 3, or 4, do you prefer, and why?

Part 2: Mathematical and Statistical Basis

1. Read Liu and Yang (2018). Discuss the clustering issues described in Section 2, including variable versus data clustering, hierarchical clustering, and oblique principal component clustering.

2. Continuing with Liu and Yang (2018), evaluate the self-organizing network discussed in Section 3.2, and in particular the force equations, for its applicability to the clustering issue discussed in the paper. Do these support the experimental design and results outlined in Section 4?

3. Finally, how do these model parameters affect the model-driven predictive model of space-time vectorcardiogram (VCG) signals described in Section 5 of Liu and Yang? Does the multiscale basis function model of VCG signals described in Section 5.1 follow logically from these results? Why or why not?

Include references to all theoretical concepts and works cited. Show all your steps with explanations. Explain major components of complex solutions, code, and any output. Include captions to tables, images, and diagrams. Use formal and detailed mathematical and scientific notation throughout the document.

While APA style is not required for the body of this assignment, solid academic writing is expected, and documentation of sources should be presented using APA formatting guidelines, which can be found in the APA Style Guide, located in the Student Success Center.

Attachment:- Topic - Assignment.rar

Reference no: EM133005443

Questions Cloud

Method and technology used to measure usage : -Ensure you include the following information in your report. Use these as headings in your report to guide you through.
Develop employee complaint systems : The Sarbanes-Oxley Act requires companies to establish ethics codes, develop employee complaint systems, and have antiretaliation policies for employees who act
Discuss the importance of information systems : Discuss the importance of information systems producing expected outputs. Identify real-world examples.
Effect of leadership on australian companies : What is the effect of leadership on the job satisfaction of employees in Australian companies - What is the effect of leadership on employee turnover
Construct a table of statistics summarizing your clusters : Construct a table of statistics summarizing your clusters. Describe what these two clusters consist of and Report the results from your test set
Demonstrate complex knowledge of managing for sustainability : Demonstrate your complex knowledge of Managing for Sustainability, including the concepts that you learn in lessons and the complex relationships
Create a survey using a survey monkey free account : To create a survey using a Survey Monkey free account, Google Forms, or another survey platform that enables you to collect online responses. To write at least
Discuss three strategies local retailers : Besides downsizing, discuss three other strategies local retailers could implement to address labour surplus in light of a temporary decline of sales at their m
What is the best the way manager motivate the staff : What is the best the way manager motivate the staff?

Reviews

Write a Review

Other Subject Questions & Answers

  Cross-cultural opportunities and conflicts in canada

Short Paper on Cross-cultural Opportunities and Conflicts in Canada.

  Sociology theory questions

Sociology are very fundamental in nature. Role strain and role constraint speak about the duties and responsibilities of the roles of people in society or in a group. A short theory about Darwin and Moths is also answered.

  A book review on unfaithful angels

This review will help the reader understand the social work profession through different concepts giving the glimpse of why the social work profession might have drifted away from its original purpose of serving the poor.

  Disorder paper: schizophrenia

Schizophrenia does not really have just one single cause. It is a possibility that this disorder could be inherited but not all doctors are sure.

  Individual assignment: two models handout and rubric

Individual Assignment : Two Models Handout and Rubric,    This paper will allow you to understand and evaluate two vastly different organizational models and to effectively communicate their differences.

  Developing strategic intent for toyota

The following report includes the description about the organization, its strategies, industry analysis in which it operates and its position in the industry.

  Gasoline powered passenger vehicles

In this study, we examine how gasoline price volatility and income of the consumers impacts consumer's demand for gasoline.

  An aspect of poverty in canada

Economics thesis undergrad 4th year paper to write. it should be about 22 pages in length, literature review, economic analysis and then data or cost benefit analysis.

  Ngn customer satisfaction qos indicator for 3g services

The paper aims to highlight the global trends in countries and regions where 3G has already been introduced and propose an implementation plan to the telecom operators of developing countries.

  Prepare a power point presentation

Prepare the power point presentation for the case: Santa Fe Independent School District

  Information literacy is important in this environment

Information literacy is critically important in this contemporary environment

  Associative property of multiplication

Write a definition for associative property of multiplication.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd