Construct a table of statistics summarizing your clusters

Assignment Help Other Subject
Reference no: EM133005443

Assessment Description

In this assignment, perform K-Means as a particular type of clustering by programming it in Python, then you will evaluate the use of clustering in a research article, and assess whether or not that research is correct.

To demonstrate completion of this assignment, create a Word document with your working code, screenshots of program results, and written answers to questions. Writing should be professional and rigorous, and include scientific/mathematical justification, where appropriate, for all conclusions reached. Upload your final Jupyter notebook and Word document to the LMS when complete.

Part 1: Operational Tasks

For the following exercises, work with the Framingham_training and Framingham_test data sets. Use only the Sex and Age fields. Standardize Age.

1. Run k-means clustering on the Framingham_training data set, requesting k = 2 clusters.

2. Construct a table of statistics summarizing your clusters. Describe what these two clusters consist of.

3. Perform k-means clustering on the Framingham_test data set, requesting k = 2 clusters.

4. Report the results from your test set. Are your clusters validated?

5. Again run k-means clustering on the Framingham_training data set, this time specifying k = 3 clusters.

6. Construct a table of statistics summarizing your clusters. Describe which records belong to each cluster.

7. Perform k-means clustering on the Framingham_test data set, specifying k = 3 clusters.

8. Report the results from your test set. Are your clusters validated?

9. Run k-means clustering on the Framingham_training data set. Specify k = 4 clusters.

10. Construct a table of statistics summarizing your four clusters. Clearly describe your four clusters.

11. Perform k-means clustering on the Framingham_test data set, requesting k = 4 clusters.

12. Report the results from your test set. Are your clusters validated?

13. Which of the clustering solutions, k = 2, 3, or 4, do you prefer, and why?

Part 2: Mathematical and Statistical Basis

1. Read Liu and Yang (2018). Discuss the clustering issues described in Section 2, including variable versus data clustering, hierarchical clustering, and oblique principal component clustering.

2. Continuing with Liu and Yang (2018), evaluate the self-organizing network discussed in Section 3.2, and in particular the force equations, for its applicability to the clustering issue discussed in the paper. Do these support the experimental design and results outlined in Section 4?

3. Finally, how do these model parameters affect the model-driven predictive model of space-time vectorcardiogram (VCG) signals described in Section 5 of Liu and Yang? Does the multiscale basis function model of VCG signals described in Section 5.1 follow logically from these results? Why or why not?

Include references to all theoretical concepts and works cited. Show all your steps with explanations. Explain major components of complex solutions, code, and any output. Include captions to tables, images, and diagrams. Use formal and detailed mathematical and scientific notation throughout the document.

While APA style is not required for the body of this assignment, solid academic writing is expected, and documentation of sources should be presented using APA formatting guidelines, which can be found in the APA Style Guide, located in the Student Success Center.

Attachment:- Topic - Assignment.rar

Reference no: EM133005443

Questions Cloud

Method and technology used to measure usage : -Ensure you include the following information in your report. Use these as headings in your report to guide you through.
Develop employee complaint systems : The Sarbanes-Oxley Act requires companies to establish ethics codes, develop employee complaint systems, and have antiretaliation policies for employees who act
Discuss the importance of information systems : Discuss the importance of information systems producing expected outputs. Identify real-world examples.
Effect of leadership on australian companies : What is the effect of leadership on the job satisfaction of employees in Australian companies - What is the effect of leadership on employee turnover
Construct a table of statistics summarizing your clusters : Construct a table of statistics summarizing your clusters. Describe what these two clusters consist of and Report the results from your test set
Demonstrate complex knowledge of managing for sustainability : Demonstrate your complex knowledge of Managing for Sustainability, including the concepts that you learn in lessons and the complex relationships
Create a survey using a survey monkey free account : To create a survey using a Survey Monkey free account, Google Forms, or another survey platform that enables you to collect online responses. To write at least
Discuss three strategies local retailers : Besides downsizing, discuss three other strategies local retailers could implement to address labour surplus in light of a temporary decline of sales at their m
What is the best the way manager motivate the staff : What is the best the way manager motivate the staff?

Reviews

Write a Review

Other Subject Questions & Answers

  History of african life in the americas

In Hughes' poem "The Negro Speaks of Rivers," he recounts a history of African life in the Americas

  Develop own professional practice and ethical standards

How you integrate the materials in this subject with what you already know. Sometimes the materials may challenge what you already know

  A dollhouse or death of a salesman

Read Oedipus the King AND one of the following plays: A Dollhouse or Death of a Salesman. Also read the information in the text about tragedy.

  Prepare an essay outlining the proper water flow requirement

Prepare an essay outlining the proper water flow requirements for an NFPA 25 fire protection system (FPS) that is installed within a general purpose assembly.

  Describe the creative employee benefits plan

Using at least three comparison web examples from the industry you used in your "Herzberg's Two-Factor Theory" discussion post this week, delineate a creative.

  Explain the disparity

How women's working impacts their likelihood to file for divorce. Describe the economic outcomes for men and women after divorce and explain the disparity.

  Describe why it is important as professional to familiarize

Describe why it is important as a professional to familiarize yourself with internal and external influences and practices .

  Identify as possibilities in your research approach

Which processes and stressors do you identify as possibilities in your research approach? What specific supports do you have in place that can help you overcome these stressors and complete your research study?

  Changing to different time and place affect film

How may changing the setting to a different time and place affect a film?

  What stage do you think your family members are in

Re-read the theories of racial identity development, as described on pp. 189-194 in Andreatta. In a 2-page paper, discuss your response to the theories outlined. How well (or not) do the models describe you and your own experience of race? How wel..

  Analyze at least two relapse prevention strategies

One of the largest hurdles in recovering from a substance use disorder does not concern getting sober, but rather, staying sober over time.

  Identify and describe the core values of the agency

Identify and describe the core values of the agency. Discuss the degree to which those core values are aligned with advocacy, leadership, or social change.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd