What is the type of the kinds of attributes

Assignment Help Basic Statistics
Reference no: EM133245287

1) What is the type of the following kinds of attributes (a) age (in years), (b) salary, (c) ZIP code, (e) height, and (f) intensity of rain? Classify them as continuous or discrete, and as qualitative (nominal or ordinal) or quantitative (interval or ratio).

2)An analyst sets up a sensor network in order to measure the temperature of different locations over a time period. What is the type of attributes collected (temperature)? What is the type of the dataset?

3) It is desired to partition customers into similar groups on the basis of their demographic profile.

a. What features could we use? Provide 3 examples. Would you describe such data as heterogeneous?

b. Which data mining problem is best suited to this task?

4)Suppose that you had a set of arbitrary objects, each representing different characteristics of gadgets. A domain expert gave you the similarity value between every pair of objects. How would you convert these objects into a multidimensional data set for clustering the gadgets ?

5)Suppose that you had a data set, such that each data point corresponds to sea-surface temperatures over a square mile of resolution 10×10. In other words, each data record contains a 10×10 grid of temperature values with spatial locations. You also have some text associated with each 10×10 grid. How would you convert this data into a multidimensional data set? How many features will each data point have?

6) Compute the cosine similarity, Jaccard coefficient (if possible, for binary vectors), Euclidean distance, correlation coefficient for the following vectors, x, y:

a. x = (0, -1, 1, 2,-2), y = (0, -2, 2, 4, -4)

b. x = (0, 1, 0, 0, 0), y = (0, 1, 0, 0, 1)

c. x = (-1, -1, -1, -1, -1), y = (1, 1, 1, 1, 1)

7) Compute the cosine similarity and the Jaccard coefficient, between the two sets {A, B, C} and {A, C, D, E}. Hint: how will you represent each set?

8) Create three documents, A, B, and C such that the Euclidean distance between A and B is smaller than the Euclidean distance between A and C, even though documents A and B have no common words whereas documents A and C have some common words.

9) Are the following similarity measures good or bad for finding similarity in document-term data? Provide a one-line justification for each answer you provide.

a. correlation

b. cosine

c. Euclidean

Reference no: EM133245287

Questions Cloud

What proportion of scores are higher than : Which z-score has approximately 20% of scores in the tail of the distribution?
What is the impact of globalization on the transmission : What are some of the emerging issues in national and international health inequities, and how are health systems attempting to address them?
Identifying ways to improve fuel efficiency : You have been hired to conduct business research for the purpose of identifying ways to improve fuel efficiency without disturbing consumer preference.
Identify the expected stool consistency for ostomies : Stool consistency ranges from liquid to formed, depending on the location of the ostomy. Identify the expected stool consistency for ostomies.
What is the type of the kinds of attributes : 1) What is the type of the following kinds of attributes (a) age (in years), (b) salary, (c) ZIP code, (e) height, and (f) intensity of rain? Classify them as c
What is the value of the test statistic : In order to know whether there is a significant difference between the average yearly incomes of marketing managers in the East and West of the United States, t
True proportion of orange candies : For Mr. p's birthday, Mr. l bought Mr. p a huge Reese's candy machine filled with Reese's Pieces. Mr. l promised Mr. p that 40% of the candies in the machine we
Relationships between variables or differences between group : Think of some challenges you have faced in your current or previous employment. Summarize the problem, develop a research question, and state the null and alter
Mean body weight of a population : 50 years ago, the mean body weight of a population of penguins was 23 kg. Researchers are concerned that the mean body weight is decreasing, so they took a rand

Reviews

Write a Review

Basic Statistics Questions & Answers

  Make payback method analysis more accurate

1. The CCA formula used in capital budgeting was developed to: a. Make payback method analysis more accurate

  Reference groups and referent systems

What are the differences between reference groups and referent systems?

  People recall pleasant and unpleasant

A social psychologist recently developed a childhood memories test that is intended to measure how people recall pleasant and unpleasant memories from childhood

  By the numbers-border communities

By the Numbers: Border Communities" reported on county taxes for a sample of counties in the same state as Allegheny County, a populous county

  Computer model to help predict the profitability

La Quinta Motor Inns developed a computer model to help predict the profitability of sites that are being considered as locations for new hotels.

  Approximation of the binomial distribution

Use the? P-value method. Use the normal distribution as an approximation of the binomial distribution.

  Why do isomers have different boiling points

Why is there an increase in boiling point with larger alkaline molecules? Why do isomers have different boiling points?

  State the null hypothesis ho and the alternative hypothesis

To test the null hypothesis that the mean waist size for males under 40 years equals 34 inches versus the hypothesis that the mean differs from 34, the following data were collected: 33, 33, 30, 34, 34, 40, 35, 35, 32, 38, 34, 32, 35, 32, 32, 34, 36,..

  How many different ways can she select

Ellen's speaks to her advisor and finds out that her program actually only requires that at least one of her four courses be a math course.

  Computer response time is an important application of the

computer response time is an important application of the gamma and exponential distributions. suppose that a study of

  Determining measurements and conversions

Convert the height of the stack of pennies into meters. Calculate how many pennies would be needed to make a tower 1 billion meters tall. Give answer in both standard calculation and in scientific notation.

  Calculate average age of all current first-time mothers

A May 8, 2008, report on National Public Radio (www.npr.org) noted that the average age of firsttime mothers in the United States is slightly higher.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd