Estimate a multiple linear regression

Assignment Help Econometrics
Reference no: EM132136265

Basic Econometrics Research Report Group Assignment -

This assignment uses data from the BUPA health insurance call centre. Each observation includes data from one call to the call centre. The variables describe several characteristics of the call (eg the length of the call, the amount of silence in the call), characteristics of the customer (eg state of residence, family type, number of adults and children), and measures of performance (eg net promoter score, sentiment score of the customer). In this assignment we are interested in predicting the net promoter score and the length of the call.

Please use the dataset CallCentre.dta and associated information file CC_DEFINITIONS_.XLSX to answer these questions. Use the software program STATA 15 available through RMIT MyDesktop for all data analysis. This is a group assignment where you can work alone or with up to three other students (a maximum group size of four). All group members will receive the same marks for the assignment. You must submit an electronic copy of your assignment in Canvas in pdf, doc or docx format. Hard copies will not be accepted. Show your tables and calculations as well as answering the questions in full sentences. Please make sure your tables of results are neatly formatted, not just copied and pasted from STATA, and that you write your answers in clear sentences. You should write no more than 1000 words (not including tables/calculations) in total for this assignment. The number of words, tables, graphs, calculations given in parentheses after each question are a guide.

1. Calculate descriptive statistics using the 'summarize' command for the variables net_promoter_score, total_silence, total_silence_weighted, agent_to_cust_index and agent_crosstalk_weighted and present the results in a table. Comment on what we learn about these variables from the descriptives. Graph a scatter plot of net_promoter_score against agent_crosstalk_weighted and describe the relationship between these two variables. (100 words, 1 table, 1 graph)

2. Estimate a multiple linear regression with net_promoter_score as the dependent variable and total_silence_weighted, agent_to_cust_index and agent_crosstalk_weighted as the explanatory (independent) variables. Predict the change in net_promoter_score associated with a 0.1 increase in total_silence_weighted and a 0.01 increase in agent_crosstalk_weighted. Assuming this is the correct model specification, are we sure that total_silence_weighted has a negative effect? [Hint: consider the t-statistic and p-value] (50 words, 1 table, 2 calculations)

3. Add dummy variables to the regression to control for all of the potential effects of State and Package. Make sure the base category is customers with the "HOSPITAL AND EXTRAS" package in NSW. Carefully interpret the estimated coefficient on the package1 dummy variable you have included. Why is this NOT a very important result? [Hint: Use the variable labels to include and interpret the correct variables, consider the descriptive statistics of the dummy variables to interpret their importance] (50 words, 1 table)

4. Include a quadratic specification of the variable "sentiment_score_cust" in the model along with the existing explanatory variables. Calculate and interpret the marginal effect of a 1 point change in "sentiment_score_cust" when sentiment_score_cust = 1 and when sentiment_score_cust=4. (50 words, 1 table, 2 calculations)

5. Explain the conditional mean independence assumption and assess its relevance with respect to the explanatory variable "sentiment_score_cust". [Hint: Think about factors that may be included in the error term of the regression: the customer's experience with the company (positive or negative), the general attitude of the customer towards call centre conversations (positive or negative) and whether these may be correlated with sentiment_score_cust] (100 words)

6. As agent time is a cost to their business, BUPA may also be interested in predicting lcall_duration (the natural log of call_duration). Design a regression model to predict lcall_duration. Choose the explanatory variables to include, and whether to include them as dummies/ logs/ polynomials/ interactions as you feel appropriate. Present the results of the descriptive statistics and your final regression model in tables. Discuss the statistical significance of the explanatory variables in your model. Discuss how you have designed your model with reference to the "Gauss Markov" assumptions and whether these assumptions are likely to be met. Interpret the results of THREE of your explanatory variables, which you consider to be the key drivers of lcall_duration (ie the length of the call). Do NOT include the variables net_promoter_score, nps_group3, sentiment_score_cust, call_duration or call_durationsq in your model. (400 words, 2 tables, 3 calculations).

Reference no: EM132136265

Questions Cloud

Compensation and benefits package they want : What do millennials need to consider to get the compensation and benefits package they want?
Reporting of quality performance : Discuss the organizations involved in public reporting of quality performance data for healthcare organizations.
Why do we tend to blame others : How much do you know about the social world? There are 10 statements. Two of the 10 statements are false, the rest are true. Which two are false?
Decentralized methods of control : Compare and contrast the hierarchical and decentralized methods of control.
Estimate a multiple linear regression : Basic Econometrics Research Report Group Assignment - Estimate a multiple linear regression with net_promoter_score as the dependent variable
Explain the contributions that teams : Explain the contributions that teams make and how managers can help teams be more effective.
Leadership and management development : Individual differences in leadership and management development: why not clone managers?
Enterprise systems for the organization : Explain why integrating organizational functions using enterprise systems for the organization is preferable/necessary.
Examples of employment or employee laws : Please assist with giving two examples of employment or employee laws that you believe were vital in changing or creating today's workplace

Reviews

len2136265

10/9/2018 10:05:16 PM

Requires STATA software. Use the software program STATA 15 available through RMIT MyDesktop for all data analysis. This is a group assignment where you can work alone or with up to three other students (a maximum group size of four). All group members will receive the same marks for the assignment.

len2136265

10/9/2018 10:05:09 PM

You must submit an electronic copy of your assignment in Canvas in pdf, doc or docx format. Hard copies will not be accepted. Show your tables and calculations as well as answering the questions in full sentences. Please make sure your tables of results are neatly formatted, not just copied and pasted from STATA, and that you write your answers in clear sentences. You should write no more than 1000 words (not including tables/calculations) in total for this assignment. The number of words, tables, graphs, calculations given in parentheses after each question are a guide.

len2136265

10/9/2018 10:05:03 PM

Rubric for marking - 1. Descriptive statistics A) Present descriptive statistics table, B) comment on descriptives, C) present and comment on graph. 2. Multiple linear regression A) Estimate regression model, B) present table, C) two predictions, D) comment on total_silence_weighted effect 3. Dummy variables A) Include dummy variables correctly, B) Comment on package1 coefficient C) Why not an important result 4. Quadratic Specification A) Include quadratic specification correctly and present results in table. B) Calculate marginal effect when sentiment_score_cust=1 C) Calculate marginal effect when sentiment_score_cust=4

len2136265

10/9/2018 10:04:56 PM

5. Conditional mean independence A) Explain conditional mean independence assumption. B) Discuss with reference to the variable "sentiment_score_cust". 6. Design model 1 A) Present tables of preliminary regressions/descriptive statistics B) Present tables of final regression results C) Discuss appropriate specification (logs/polynomials) D) Discuss appropriate specification (dummies) E) Discuss statistical significance of coefficients in model. 6. Design model 2 A) Discuss Gauss_Markov assumptions 1-3 B) Discuss Gauss_Markov assumptions 4-5 C) Prediction 1 D) Prediction 2 E) Prediction 3. 7. Neat formatting of tables. 8. Clear expression of answers in full sentences. There will be up to 5 additional marks awarded for presentation of your answers (neat formatting of tables and clear expression of answers in full sentences).

Write a Review

Econometrics Questions & Answers

  What is the interpretation of the various coefficients

Estimate the preceding regression. What is the interpretation of the various coefficients? Give a logical reason for why the results are this way

  Can you think of a reason why this might be so

It is interesting to note that technical improvements in agriculture or manufacturing have generally been slow to arise in countries that have relied on slave labor. Can you think of a reason why this might be so?

  For the perfectly competitive firm find total revenue

Discuss the differences you observe in your answers above between the monopoly and perfectly competitive firm. x-axis 0 8 18 21 30 price and cost per unit y-axis 0 20 33 35 40 quanity.

  Difference between contractionary fiscal policies

How might contractionary and expansionary fiscal policies affect your organization?

  Formulate the maximization problem of the entrepreneur

Prove that if the entrepreneur has turned down production with some technique a' at date t, he will never accept technique a' at date t + s, for s > 0 (i.e., he will not accept it for any possible realization of events between dates t and t + s).

  Does the ability to move first give the employeran advantage

If so, how? As the employee, is there anything you could do to realize a higher payoff?

  Would certain types of industry be more sensitive

Would certain types of industry be more sensitive to taxes on land as opposed to buildings?

  What happens to output if govt purchases are increased

This question considers a closed economy Keynesian model that is augmented to include transfers payments to consumers (Tr = Transfers) that increase consumers' disposable incomes and lower government savings.Derive and calculate the Keynesian mult..

  How many drivers cross the bridge

If the city wishes to raise as much revenue as possible from the tolls, where will the city decide to charge a toll: in the inelastic portion of the demand curve, the elastic portion of the demand curve, or the unit elastic portion? Explain.

  What price will theatre charge for daytime tickets

Cinema Theater has estimated the following demand functions for its movies: Daytime demand, QD = 400 - 50 PD Nighttime demand, QN = 200 - 20 PN The marginal cost of serving another customer is $5 and its fixed costs are $100.

  How does the deal impact the consumers opportunity set

A recent newspaper circular advertised the following special on tires: "Buy three, get the forth tire for free-limit one free tire per customer." If a consumer has $500 to spend on tires and other goods and each tire usually sells for $50.

  Is winsome widgets in long-run equilibrium

Consider the following table of costs for the Winsome Widget Factory, which operates in a perfectly competitive market. The market price faced by this firm is $6.00 per widget. a. Fill in the formula for AFC, AVC, ATC, MC, TR, MR, and Total Profit.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd