What is the overall error rate on the validation data

Assignment Help Finance Basics
Reference no: EM131668737

Question: Each year, the American Academy of Motion Picture Arts and Sciences recognizes excellence in the film industry by honoring directors, actors, and writers with awards (called the Oscars) in different categories. The most notable of these awards is the Oscar for Best Picture. The Data worksheet in the file Oscars contains data on a sample of movies nominated for the Best Picture Oscar. The variables include total number of Oscar nominations across all award categories, number of Golden Globe awards won (the Golden Globe award show precedes the Oscars), whether the movie is a comedy, and whether the movie won the Best Picture Oscar. There is also a variable called Chrono Partition that specifies how to partition the data into training, validation, and test sets. The value t identifies observations that belong to the training set, the value v identifies observations that belong to the validation set, and the value s identifies observations that belong to the test set. Partition the data using XLMiner's Standard Partition procedure by selecting Use partition variable in the Partitioning options area and specifying the variable Chrono Partition. Construct a logistic regression model to classify winners of the Best Picture Oscar. Use Winner as the output variable and Oscar Nominations, Golden Globe Wins, and Comedy as input variables. Perform an exhaustive-search best subset selection with the number of best subsets equal to 2.

a. From the generated set of logistic regression models, select one that you believe is a good fit. Express the model as a mathematical equation relating the output variable to the input variables. Do the relationships suggested by the model make sense? Try to explain the relationships.

b. Using the default cutoff value of 0.5 for your logistic regression model, what is the overall error rate on the validation data?

c. Note that each year there is only one winner of the Best Picture Oscar. Knowing this, what is wrong with classifying a movie as a winner or not using a cutoff valued. What is the best way to use the model to predict the annual winner? Out of the six years in the validation data, in how many years does the model correctly identify the winner?

d. Use your model to classify the 2011 nominees for Best Picture. In Step 3 of XLMiner's Logistic Regression procedure, check the box next to In worksheet in the Score new data area. In the Match variable in the new range dialog box, (1) specify New Data in the Worksheet: field, (2) enter the cell range A1:E9 in the Data range: field, and (3) click Match variable(s) with same name(s). When completing the procedure, this will result in a LR_New Score worksheet that contains the predicted probability that each 2011 nominee will win the Best Picture Oscar. What film did the model believe was the most likely to win the 2011 Best Picture Oscar? Was the model correct?

Reference no: EM131668737

Questions Cloud

Find the associated z-score : Find the associated z-score for each of the following standard normal areas.
Standard deviations above the mean : Bob's exam score was 2.17 standard deviations above the mean. The exam was taken by 200 students.
Calculate the periodic and cumulative net cash flows : Analyze Genesis Energy's project options. Then, calculate the periodic and cumulative net cash flows for each potential project
What is the allocation of the budget among the three media : The Westchester Chamber of Commerce periodically sponsors public service seminars and programs. Currently, promotional plans are under way for this year's.
What is the overall error rate on the validation data : Using the default cutoff value of 0.5 for your logistic regression model, what is the overall error rate on the validation data?
Research about the frost as a nature poet : Choose one of the research topics: Frost as a nature poet, Poe's use of the color red as a symbol etc. and research about it
What is the rmse : Consider the Pre-Crisis worksheet data. Partition the data into training (50 percent), validation (30 percent), and test (20 percent) sets.
Design a file-copying program : Design a file-copying program, in the C programming language, named filecopy using ordinary pipes. Write this program using UNIX pipes
Multiply and simplify the given expression : Multiply and Simplify the following Expression. (2+2 v2 )(-5-2 v2). Simplify the following: (2+v6)/(2+v3) 9) and -2 v24-2 v6 +5 v6 -2 v20.

Reviews

Write a Review

Finance Basics Questions & Answers

  Financial reporting and analysis

Finance is about Gunns Ltd, a company in dealing with forestry products in Australia. The company has also been listed in Australian Stock Exchange. As many companies producing forestry products, even Gunns Ltd is facing various problems. Due to the ..

  A report on financial accounting

This report is specific for a core understanding for Financial Accounting and its relevant factors.

  Describe the types of financial ratios

Describe the types of financial ratios and other financial performance measures that are used during venture's successful life cycle.

  Differences between sole proprietorship and corporation

Briefly describe the major differences between a sole proprietorship and a corporation

  Prepare a cash budget statement

Calculate the expected value of the apartment in 20 years' time. What is the mortgage loan repayment at the beginning of each month

  What are the implied interest rates

What are the implied interest rates in Europe and the U.S.?

  State pricing theory and no-arbitrage pricing theory

State pricing theory and no-arbitrage pricing theory

  Small business administration

Identify the likely stage for each venture and describe the type of financing each venture is likely to be seeking and identify potential sources for that financing.

  Effect of financial leverage

The Effect of Financial Leverage and working capital management

  Evaluate the basis for the payment to the lender

Evaluate the basis for the payment to the lender and basis for the payment to the company-counterparty.

  Importance of opps, ipps, mpfs and dmepos

Research and discuss the differences and importance of : OPPS, IPPS, MPFS and DMEPOS.

  Time value of money

Time Value of Money project

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd