Compute the normalized euclidean

Assignment Help Basic Computer Science
Reference no: EM133001559

Problem #1 (Cluster Analysis): For the Excel file Colleges and Universities Cluster Analysis Worksheet (D2L Content > Datasets by Chapter > Chapter 10 > CollegesandUniversitiesClusterAnalysisWorksheet.xlsx), compute the normalized Euclidean distances between Berkley, Cal Tech, UCLA, and illustrate the results in a distance matrix.

Problem #2 (Cluster Analysis): For the three clusters identified in Table 10.3, find the average and standard deviations of each numerical variable for the schools in each cluster and compare them with the average and standard deviation for the entire data set. Does the clustering show distinct differences among these clusters?
This is Table 10.3 (taken from the author's PPT - but matches the textbook)

Problem #3 (Classification): Using the approach described in Example 10.6, classify first record in the worksheet Records to Classify in the Excel File Credit Risk Data (D2L Content > Datasets by Chapter > Chapter 10 > CreditRiskData.xlsx), using the l-NN algorithm for k = 1 to 5. Use only Checking, Savings, Months Customer, and Months Employed.

Problem #4 (Classification): Use discriminant analysis to classify the new records in the Excel file Credit Approval Decisions Discriminant Analysis (D2L Content > Datasets by Chapter > Chapter 10 > CreditApprovalDecisionsDiscriminantAnalysis.xlsx) using only Credit Score and Years of Credit History as input variables.

Problem #5 (Residual Analysis and Regression Assumptions): Use the results from Problem #11 (D2L Content > Datasets by Chapter > Chapter 8 > National Football League.xlsx) to analyze the residuals to determine if the assumptions underlying the regression analysis are valid. In addition, use the standard residuals to determine if any possible outliers exist.

Problem #6
(Building Good Regression Models): The State of Ohio Department of Education has a mandated ninth-grade proficiency test that covers writing, reading, mathematics, citizenship (social studies), and science. The Excel file Ohio Education Performance (D2L Content > Datasets by Chapter > Chapter 8 > OhioEducationPerformance.xlsx) provides data on success rates (defined as the percent of students passing) in school districts in the greater Cincinnati metropolitan area with state averages.

Part A: Suggest the best regression model to predict math success as a function of success in the other subjects by examining the correlation matrix; then run the Regression tool for this set of variables. (Note: that "All" is not an academic subject!)

Part B: Develop a multiple regression model to predict math success as a function of success in all other subjects using the systematic approach described in the chapter. Is multicollinearity a problem?

Part C: Compare models in parts A and B. Are they the same? Why or why not?

Problem #7 (Regression with Categorical Independent Variables): A national homebuilder builds single-family homes and condominium-style townhouse. The Excel file House Sales (D2L Content > Datasets by Chapter > Chapter 8 > HouseSales.xlsx) provides information on the selling price, lot cost, type of home, and region of the country (M=Midwest, S=South) for closings during one month.

Part A: Develop a multiple regression model for sales prices as a function of lost cost and type of home without any interaction term.

Part B: determine if an interaction exists between lot cost and type of home and find the best model.

Attachment:- Credit Risk Data.rar

Verified Expert

This task provides a clear working procedure on discriminant analysis and multiple regression models. Discriminant analysis was used for the loan approval process. Multiple regression regression was used to determine the factors that influenced the math success. The best model was classified using adjusted r square value. the two models used for comparison are independent variables with interaction and without interaction.

Reference no: EM133001559

Questions Cloud

Do the authors critically examine the articles : Do the authors initially choose a variety of articles that relate to the subject? Describe why you think so. (They may find articles that loosely relate
Provide the workings and journal entries for impairment loss : Provide the workings and journal entries for the impairment loss of the division assuming that the fair value of the land is: (a) $70,000
Determine the alpha for the informed investor : Suppose there is an informed investor whose portfolio has a beta of 1.4 and an expected return of 16%. Determine the alpha for the informed investor
How would get the half-month convention depreciation value : If my yearly straight-line depreciation is $1,500 (purchased the good 10/1), how would you get the half-month convention depreciation value
Compute the normalized euclidean : Compute the normalized Euclidean distances between Berkley, Cal Tech, UCLA, and illustrate the results in a distance matrix.
Compute the rate of return for this investment : Estimated average annual costs are $16,000. Assuming that annual revenues and costs will be uniform, compute the rate of return for this investment
How much net income would Monroe report in year one : Monroe Minerals Company purchased a copper mine for $122,000,000. Based on this information, how much net income would Monroe report in Year 1
Determine the net present value for a project : Determine the net present value for a project that costs $117,000 and would yield after-tax cash flows of $18,000 the first year
Prepare consolidated financial statements in conformity : Discuss the possible reasons for Honda to prepare its consolidated financial statements in conformity with U.S GAAP

Reviews

len3001559

9/30/2021 12:44:35 AM

Answer on same sheet and use excel if needed. More files being emailed for rest of the questions.

Write a Review

Basic Computer Science Questions & Answers

  Identifies the cost of computer

identifies the cost of computer components to configure a computer system (including all peripheral devices where needed) for use in one of the following four situations:

  Input devices

Compare how the gestures data is generated and represented for interpretation in each of the following input devices. In your comparison, consider the data formats (radio waves, electrical signal, sound, etc.), device drivers, operating systems suppo..

  Cores on computer systems

Assignment : Cores on Computer Systems:  Differentiate between multiprocessor systems and many-core systems in terms of power efficiency, cost benefit analysis, instructions processing efficiency, and packaging form factors.

  Prepare an annual budget in an excel spreadsheet

Prepare working solutions in Excel that will manage the annual budget

  Write a research paper in relation to a software design

Research paper in relation to a Software Design related topic

  Describe the forest, domain, ou, and trust configuration

Describe the forest, domain, OU, and trust configuration for Bluesky. Include a chart or diagram of the current configuration. Currently Bluesky has a single domain and default OU structure.

  Construct a truth table for the boolean expression

Construct a truth table for the Boolean expressions ABC + A'B'C' ABC + AB'C' + A'B'C' A(BC' + B'C)

  Evaluate the cost of materials

Evaluate the cost of materials

  The marie simulator

Depending on how comfortable you are with using the MARIE simulator after reading

  What is the main advantage of using master pages

What is the main advantage of using master pages. Explain the purpose and advantage of using styles.

  Describe the three fundamental models of distributed systems

Explain the two approaches to packet delivery by the network layer in Distributed Systems. Describe the three fundamental models of Distributed Systems

  Distinguish between caching and buffering

Distinguish between caching and buffering The failure model defines the ways in which failure may occur in order to provide an understanding of the effects of failure. Give one type of failure with a brief description of the failure

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd