How much time people spend on a website

Assignment Help Applied Statistics
Reference no: EM131918477

- This assignment will analyze the data (HotelClickStream.xls) and interpret the results. This dataset includes clickstream data of online transactions for hotel booking in year 2011. Appendix includes the detailed description for the variables.

- Please follow the instructions very carefully to do this assignment! Please do the following analyses and answer the corresponding questions. Please copy/summarize your key results for each question to a word file along with your answers to produce the final report for submission.

1. Please first create the following 2 additional variables into your data

1) REF_D (create a dummy variable indicating whether the transaction was referenced from other website, if not, the final booking website was directly accessed. If no information provided for the variable REF_DOMAIN_NAME, REF_D = 0; otherwise REF_D = 1)

2) LOG_PRICE (take the log transformation of the variable PROD_TOTPRICE using the LOG function in excel)

a) Please provide a summary table showing the top 10 domain names (DOMAIN_NAME) that generated the most volume of transactions the report should look like the following Table (Hint: one way to do this is to use the COUNTIF function in excel). Please summarize briefly your observations from the results

Rank

Domain Names

# of Transactions

1

marriott

524

b) Please provide a summary table showing the top 10 reference domain names (REF_DOMAIN_NAME) that generated the most volume of transactions the report should look like the following Table. Please summarize briefly your observations from the results.

  Rank               

Reference Domain Names          

# of Transactions

1

google

620

c) Please provide summary statistics (N, Max, Min, Mean, and Std.) for variables: DIRECTP_D; REF_D; DURATION; PAGES_VIEWED; LOG_PRICE; and TRANS_FREQ. Please report your summary statistics table and provide short descriptions (a few bullet points) of your observations.

2. Please use the Binary Outcome (Logistic/Logit) regression technique to answer the question on "what are the factors that influence people's decision on whether to book directly on a hotel website or from other third party website?" Please use DIRECT_D as your Dependent Variable (DV); and REF_D, LOG_PRICE, TRANS_FREQ, DURATION, HOUSEHOLD_SIZE, CHILDREN_D, and CONNECTIONSPEED_D as your Independent Variables (IV). Please report and interpret your regression results, which should include the interpretation of the regression coefficients.

3. a) Please use the Count Data (Poisson) regression model to answer the question on "what are the factors that influence people's booking frequencies?" Please use TRANS_FREQ as your DV; and REF_D, LOG_PRICE, PAGES_VIEWED, HOUSEHOLD_SIZE, CHILDREN_D, and CONNECTIONSPEED_D as your IVs. Please report and interpret your regression results, which should include the interpretation of the regression coefficients.

b) Please repeat the analysis in question a) using the Negative Binomial Regression model. Please report and interpret your regression results and coefficients.

c) Please summarize your observations by comparing the results from a) and b).

4. a) Please use the linear regression technique to answer the question on "what are the factors that influence how much time people spend on a website?" Please use DURATION as your DV; and you may decide on the IVs by conducting the similar exercises in Assignment #1. Please ONLY report and interpret your final regression results.

b) Please use the linear regression technique to answer the question on "what are the factors that influence how many pages people views when visiting a website?" Please use PAGES_VIEWED as your DV; and you may decide on the IVs by conducting the similar exercises in Assignment #1. Please ONLY report and interpret your final regression results.

c) Alternatively, you can also use count data model (Poisson or Negarive Binomial) since PAGES_VIEWED is a variable with discrete and non-negative integers. Using the similar set of IVs, do you see significantly different results by using linear regression vs. count data models?

d) Please summarize your observations by comparing the results from a), b), and c).

Attachment:- HotelClickStream.rar

Attachment:- Appendix.rar

Verified Expert

The file solved all 5 problems including subproblems using excel and spss. Spss was used to calculate logit, linear, negative binomial and poisson's regression. All data )(3470) sample was subjected to analysis

Reference no: EM131918477

Questions Cloud

Discuss the subject matter of the photographic exhibition : Discuss the subject matter/theme of the photographic exhibition. How has the photographer visualized the theme through the photographs on display?
What are some accounting changes that a firm should make : ACT 5733 - Advanced Managerial Accounting Mid-Term Exam. What are some accounting changes that a firm should make
What does two percent annual inflation rate mean : What does a 2 percent annual inflation rate mean? What would be its new stock price per share?
Functional and innovative product : How does this difference relate to the supply system that should be used to provide these items to retail outlets?
How much time people spend on a website : WEB ANALYTICS ASSIGNMENT
Statistical quality control : Why does statistical quality control lead to improvements in many businesses?
Determine the expected rate of return for the stock : Determine the expected rate of return for the stock. Should you purchase this stock? Why?
Operations and supply chain management : As it relates to Operations and Supply Chain Management, Design of Products and Services
Use the equation to find hartman unlevered beta : Hartman Motors has $13 million in assets, Use the Hamada equation to find Hartman's unlevered beta,

Reviews

len1918477

3/28/2018 3:21:32 AM

Hi, I have an assignment to complete and I need your help in completing it in 4 days. Please let me know the cost and delivery date. Also every question has to be answered. GROUP (UP TO 3 PEOPLE) WEB ANALYTICS ASSIGNMENT #2 (50 POINTS) DUE DATE: Thursday • Please follow the instructions very carefully to do this assignment! Please do the following analyses and answer the corresponding questions. Please copy/summarize your key results for each question to a word file along with your answers to produce the final report for submission.

Write a Review

Applied Statistics Questions & Answers

  What sampling method is used to select your sample data

Organise your sample data in a spreadsheet as per the instructions in the Excel sheet. What sampling method is used to select your sample data

  Run a hypothesis test to test whether the population

In a simple random sample of 36 families in a certain country, you find that sample mean income to be $9,500 with a sample standard deviation of $2,500.  Run a hypothesis test (α=.05) to test whether the population's mean exceeds $9,000.  What is the..

  A joint probability density functio

A Joint probability density function is given by the following function:                 c * y-2 x-3   if x > 1, y > 1, and x > y f(x,y) =   c * x-2 y-3    if x > 1, y > 1, and y > x                      0                      otherwise

  Convert the rectangular coordinate into polar coordinates

Convert the rectangular coordinate (-8,i (square root of 3) into polar coordinates.

  A cream with alpha hydroxy acid,

Use Skin Cream as your header. Select Stat > Basic Statistics > 1- Proportion... Complete the dialog in order to obtain a 97 percent confidence interval. b) Based on the 97% confidence interval for women exhibiting no improvement, do you have suffici..

  Find the minimum percentage of all possible daily demand

Find the expected demand. Interpret this value, and label it on the graph of part a. Using Chebyshev's Theorem, find the minimum percentage of all possible daily demand values that will fall in the interval [mx ± 2sx].

  Distribution of scores for physical functioning

Distribution of scores for Physical Functioning in women is normal, where are 99% of the women's scores around the mean in this distribution? Round your answer to two decimal places.

  Prepare a table of population numbers by sex and age

Then using Excel, prepare a table of population numbers (not percentages) by sex (in the columns) and age (in the rows)

  Quantitative analysis for management

Formulate a nonlinear program representing the profit maximization problem for the bakery - Formulate the goal programming representation of this problem, with the other three goals having priorities P2, P3, and P4, respectively.

  A large manufacturing company investigated the service

However, the company recently installed a just-in-time system in which suppliers are linked more closely to the manufacturing process. A random sample of 118 deliveries since the just-in-time system was installed reveals that 22 deliveries were late...

  What would be resulting cost equation for maintenance costs

Using the High-Low method of cost estimation and Number of Photocopies as the cost driver, what would be the resulting cost equation for Maintenance Costs

  The alternative hypothesis h1 in symbolic form

The proportion of people aged 18-25 who currently use illicit drugs is equal to 0.20 (or 20%). Express the null hypothesis H0 and the alternative hypothesis H1 in symbolic form. Be sure to use the correct symbols - μ, p, and σ-for the indicated

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd