Describe your implementation and experiment setup

Assignment Help Computer Engineering
Reference no: EM131919047

Assignment: Clustering

Your task for this assignment is to implement and evaluate the k-means clustering algorithm.

1. Implement the k-means clustering algorithm.

a. You can use any programming language that you are familiar with.

b. The program should be executable with at least 3 parameters: the name of the dataset file, k, and the name of the output file.

c. The output file should contain numerical class labels (formatted as one number per row) for all the records in the test dataset and report the sum squared error (SSE) in the last row.

d. You only need to handle numerical attributes (categorical attributes are not required).

2. Select two datasets from the UCI repository and evaluate the algorithm using SSE and another metric of your choice (e.g. BCubed precision and recall or Jaccard score if you have the class labels) with varying k. (I intend to run your implementation on the fisher iris dataset without the labels.

3. Write a brief report to:

a. Describe the datasets.

b. Describe your implementation and experiment setup, e.g. any preprocessing you performed on the dataset such as normalizing the attributes, distance metrics you used, etc.

c. Present the experiment results with varying k.

d. Discuss the insights and conclusions from your experiments.

4. This is an individual assignment.

5. Submission. You will upload two items to Canvas: your PDF report and a zip or tar file.

This zip/tar file must contain:

Your source files (include your name(s) in commented form at the top of all source files), the executable, a README file explaining how to compile/run your program, the output files for your test datasets.

Reference no: EM131919047

Questions Cloud

Design and operation of the new management accounting system : Write a report to senior management explaining how you, as the management accountant, may contribute to the design and operation of management accounting system
What is the probability that at least : If 7 people appear at random to give blood, what is the probability that at least one of them is Rh-Negative?
Calculate the expected average flow time : New Time Videos (NTV) is a new online video rental service. In the field, it is trying to compete by offering its customers access to all of the major.
Examine the number of parking spaces needed : Pizza Time Restaurants is building a new pizza place and needs to determine how big to make the various parts of its facility.
Describe your implementation and experiment setup : Describe your implementation and experiment setup, eg. any preprocessing you performed on dataset such as normalizing attributes, distance metrics you used etc.
How many customers on average can mike salon process : Assuming that the waiting area always has at least one customer in it, how many customers on average can Mike's salon process in a day.
Mean life of compact fluorescent light bulbs : If a light bulb manufacturing company wants to? estimate, with 95?% ?confidence, the mean life of compact fluorescent light bulbs to within ±175 hours
Identify and give examples of the steps that can be taken : Identify and give examples of the steps that can be taken at the functional level to improve Post's efficiency, product quality, an ability to innovate.
What was the average miles per hour per trip : What was the average miles per hour per trip? (Show work)

Reviews

Write a Review

Computer Engineering Questions & Answers

  Discuss has facebook solved the mobile ad problem

By allowing instant access to information, online purchases, a camera, and social media, our smartphones have become a part of our daily routines

  What is the definition of the term baud rate

What does it mean when a signal is self-clocking? What is the definition of the term baud rate? How does baud rate differ from bits per second?

  What follows from requiring calls to be a hierarchy

What follows from requiring CALLS to be a hierarchy? Draw the CALLS graph for your sample program, and check whether the graph is a DAG or not.

  Create a new sound with one half of the first sound

Create a new sound with one half of the first sound, then add the two sounds together for the length of two sounds, and then add second half of second sound.

  Describing the hardware and software used to support

describing the hardware and software used to support personal workgroup and enterprise computing within your current

  Distinguish between a file and a directory

What is a file? Distinguish between a file and a directory. Distinguish between a file and a file system. Why is a file a generic concept and not a

  What two basic functions does ip implement

What two basic functions does IP implement, What four key mechanisms does IP use to provide its service

  Security onion vm capture packets from your virtual machine

What is the name of the file from the GET command - briefly describe the significance of the content in the screenshot.

  What is the purpose of the accept field in an http request

Describe the purposes of the five most commonly used HTTP methods. What is the purpose of the Accept field in an HTTP request?

  Define adt money that has operation for arithmetic operation

Define an ADT Money that has operations for arithmetic operations on real numbers having exactly two digits to the right of the decimal point.

  Elaborate on the steps and processes you would have to do

Elaborate on the steps and processes you would have to do to try to insure that your recovery plan would work in the event of a disaster.

  Write down a program for the following decision table

The National Earthquake Information Center has asked you to write down a program for the following decision table to characterize an earthquake based on its Richter scale number.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd