Understand and apply key statistical theories

Assignment Help Management Information Sys
Reference no: EM132983038 , Length: word count:2000

MIS772 Predictive Analytics - Deakin University

Data Analysis and Report

Learning Outcome 1: Understand and apply key statistical theories and data mining concepts

Assignment Objectives

This assignment aims for students to learn how to ...
- Articulate problems and solutions in business terms
- Gain insights from data
- Prepare data for different models
- Develop classification models
- Assess and report model performance.

Case Study Description

SkytraxAI approached you to develop a RapidMiner process to assess travellers' experience with major airlines around the world, through which practical implications can be offered to airline managers to improve their services and traveller's experience in terms of their overall ratings.

SkytraxAI provided you with a sample of more than 41,000 customer reviews of 362 airlines, which were generated during the period 2002-2015. The data set includes the following information:
• airline_name: Name of the airline
• link: relative URL path to the airline webpage on Skytrax.
• title: review title
• author: user id of the reviewer on Skytrax.
• author_country: the country of origin where the traveller comes from.
• date: date when the review was posted to Skytrax
• cabin_flow: type of cabin flown by the traveller
• overall_rating: a number to reflex the satisfactory level of traveller (lowest 1 to highest 10).
• seat_comfort_rating: rating for seat comfortability (lowest 1 to highest 5).
• cabin_staff_rating: rating for cabin staff services (lowest 1 to highest 5).
• food_beverage_rating: rating for food and drink served onboard (lowest 1 to highest 5).
• inflight_entertainment_rating: rating for inflight entertainment system (lowest 1 to highest 5).
• value_money_rating: rating for value for money (lowest 1 to highest 5).
SkytraxAI are interested in generating some insights about the airlines, especially answering the below questions:
A. What are the (top 10) most popular airlines in this data set? Among them, which airline received the best visitors' experience in term of overall rating? Which airlines offer the best value for money?
B. What are the (top 5) most popular airlines used by travellers from the top two countries with the most travellers?
C. What are the (top 2) most popular airlines for business travel? Market competitors are defined as businesses that target similar groups of customers. Based on this definition, identify the market (author_country), where these two identified airlines are competitors for business travellers.
D. How to reliably predict if the travellers are satisfied with the airlines (overall ratings ≥ 4)? Define appropriate measures of traveller satisfaction and compare the performance of different classifiers to predict the traveller satisfaction with airlines.
SkytraxAI wants you to use RapidMiner to process and explore the provided data, and then develop and evaluate classifiers to predict the traveller satisfaction, and to minimise misclassifications. The data set is available on Cloud Deakin site, named MIS771 A1 data.zip. you will need to unzip the file before importing into RapidMiner.

Task and Deliverables:
• Executive Summary: Define your problem and solution in business terms, in doing so answer questions A, B, C and D, cross-reference with other report sections for support.
• Data Exploration, Pattern Discovery, and Preparation: Visualise the selected attribute characteristics. Use the visualisations to support answering questions A, B and C.
Deal with duplicates or missing values to prepare data for predictive modelling. Transform attributes or create new ones as needed. Use appropriate analysis and data visualisation to investigate relationships between attributes (predictors and label). Interpret results.
• Predictive Modelling: Create and explain two classification models, e.g., k-NN and Decision Tree, to address part of question D. Explain and justify your model's properties. Investigate and deal with any class imbalance.
• Model evaluation and improvement: Use hold-out and cross-validation of the model. Utilize honest testing. Compare the performance of different models and select the best. Qualify how much we can trust the answer to question D.

Attachment:- Predictive Analytics.rar

Reference no: EM132983038

Questions Cloud

Identify two different career strategies unemployed adult : a) Identify two different career strategies an unemployed adult can use to manage through the turbulence of uncertain times at the workplace
Advantages and disadvantages of leaving at the time : When it is most appropriate for employees who are laid off to be expected to leave?
Technology policies for the development of country : How could use ancient philosophy thoughts to design science and technology policies for the development of own country?
Phrases mean to a health care organization : Have you ever heard the phrases "sales are up" or "profits are up"? Think about what these phrases mean to a health care organization.
Understand and apply key statistical theories : Understand and apply key statistical theories and data mining concepts - Define your problem and solution in business terms
Understand about the problem of e-waste : Explain what you understand about the problem of e-waste and its many causes, and then propose at least one solution for this issue.
What is the employee benefit expense for the current year : The expected rate of return on plan assets was 10%, the weighted average rate of return for plan assets was 9.5%, What is the employee benefit expense
Explain the five models of corporate entrepreneurship : As a 21st Century Entrepreneurial Leader, explain the five models of corporate entrepreneurship and why they are important to understand and comprehend.
What is the accounts payable balance as at March : Please complete a cash disbursement budget (for direct materials) for the first quarter of the year. What is the accounts payable balance as at March 31st

Reviews

Write a Review

Management Information Sys Questions & Answers

  Information technology and the changing fabric

Illustrations of concepts from organizational structure, organizational power and politics and organizational culture.

  Case study: software-as-a-service goes mainstream

Explain the questions based on case study. case study - salesforce.com: software-as-a-service goes mainstream

  Research proposal on cloud computing

The usage and influence of outsourcing and cloud computing on Management Information Systems is the proposed topic of the research project.

  Host an e-commerce site for a small start-up company

This paper will help develop internet skills in commercial services for hosting an e-commerce site for a small start-up company.

  How are internet technologies affecting the structure

How are Internet technologies affecting the structure and work roles of modern organizations?

  Segregation of duties in the personal computing environment

Why is inadequate segregation of duties a problem in the personal computing environment?

  Social media strategy implementation and evaluation

Social media strategy implementation and evaluation

  Problems in the personal computing environment

What is the basic purpose behind segregation of duties a problem in the personal computing environment?

  Role of it/is in an organisation

Prepare a presentation on Information Systems and Organizational changes

  Perky pies

Information systems to adequately manage supply both up and down stream.

  Mark the equilibrium price and quantity

The demand schedule for computer chips.

  Visit and analyze the company-specific web-site

Visit and analyze the Company-specific web-site with respect to E-Commerce issues

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd