Describe the proportion of phone apps which are free

Assignment Help Applied Statistics
Reference no: EM132356350

Statistics and Data Analysis Assignment -

OVERVIEW OF THE ASSIGNMENT - This assignment will test your skill to collect, summarise and present data using Microsoft Excel and/or other approved tools. It will also test your understanding to interpret the output produced by the software to solve business problems.

You will need to use the dataset provided as well as collecting your own dataset and produce a numerical and graphical summary. You will need submit an Excel file following the requirement as explained below.

TASK DESCRIPTION - There are two datasets involved in this assignment: Dataset 1 and Dataset 2, detailed below.

Dataset 1: You will receive an email that contains a dataset that is specifically allocated to you. This dataset is edited from Google Play Store Apps dataset provided by Lavanya Gupta that can be obtained from Kaggle under the Creative Commons Attribution 3.0 Unported License. The number of cases from the original dataset has been reduced and all NaN values have been removed.

Dataset 2: You will need to collect a dataset via survey to answer the question given in Section 6 below. You will need to collect data from international students, between 3 - 4 different country of origin with at least 5 students per country.

Both datasets should be saved in an Excel file (see Submission Requirement on the next page). All data processing should be performed in Excel or Statkey. Specific instruction as to which tools should be used for each section will be given during tutorials.

Your tasks are to provide a description for each dataset in Section 1, and to answer the following research questions given in Section 2 to Section 6 using dataset 1 or dataset 2 as indicated in each section.

1. Section 1: Description about Data

a. Dataset 1: Give a short but clear description about this dataset. Is this primary or secondary data? What are the cases? What are the variables and their types?

b. Dataset 2: Explain how you collect the data and discuss its limitation (e.g. whether your sample is biased). Is this primary or secondary data? What are the variables and their types?

2. Section 2: Are most google play apps free?

Using Dataset 1, describe the proportion of phone apps which are free. You need to provide both numerical summary as well as graphical display that easily shows the proportion of the free apps.

3. Section 3: What is the price distribution of paid apps after an iteration of outlier removal?

Using Dataset 1, perform one iteration of outlier detection on the price of paid apps using the method described in the lecture notes. After removing those outliers, describe the price distribution of paid apps using both numerical and graphical summary which shows the remaining outliers, if any.

4. Section 4: Is there a difference in prices among paid apps from the categories Communication, Games, and Tools?

Using Dataset 1, describe the distribution of paid apps from the categories Communication, Games and Tools. You need to provide both numerical summary as well as graphical display which shows the outliers, if any.

5. Section 5: Is there any relationship between Rating and Review?

Using Dataset 1, describe the relationship between the rating of an app and the number of reviews it receives. You need to provide both numerical summary as well as graphical display.

6. Section 6: Do international students from different countries tend to use different communication apps?

Using Dataset 2, describe the relationship between a student's country of origin and the main communication app the student is using (e.g. WhatsApp, Fb Messenger, WeChat, LINE, Viber, etc). You need to provide both numerical summary and graphical display.

Attachment:- Statistics and Data Analysis Assignment File.rar

Reference no: EM132356350

Questions Cloud

What is it called when a class inherits a derived class : What is it called when one class is derived from another single class? What is it called when a derived class has got more than one base class?
Widely known online system and widely known desktop system : Select a widely known online system (i.e., a Google product) and a widely known desktop system (i.e., a Microsoft product).
Managers operating in a global environment : Provide a recent practical example of an Australian organisation which has faced the challenges of international competition and expansion
Discuss the freedom of speech as topic in cyber law : Freedom of speech- The essay will discuss the freedom of speech as a topic in cyber law.
Describe the proportion of phone apps which are free : BUS708 Statistics and Data Analysis Assignment - Using Dataset 1, describe the proportion of phone apps which are free
Create a program that asks the user to enter a string : Create a program that asks the user to enter a string that is temporarily stored in variable x. The program should use a method to know if the entered value.
Implementing cybersecurity in the energy sector : ITS 834-Implementing Cybersecurity in Energy Sector. You have been hired as security consultant for EnergyA which is electric utility company based in USA.
What is the value of y after the following code is executed : Which is the statement that declares and initialises with zero a two dimensional array variable m with 3 rows and 4 columns?
Challenges of international competition and expansion : BUMGT5920 - Management in a Global Business Environment - Describe, using academic references, the international challenges and possible opportunities

Reviews

len2356350

8/12/2019 1:59:59 AM

SUBMISSION REQUIREMENT - You need to submit an Excel file to Turnitin which consists of: Dataset 1 and Dataset 2, each in separate worksheet, with appropriate sheet name 2. Numerical & graphical summary for each section, each section should be answered in separate worksheet with appropriate sheet name (e.g. “Section 1”, “Section 2”, etc) Arrange the worksheets starting with Dataset 1, Dataset 2, Section 1, Section 2, etc. MARKING CRITERIA - Students are advised to read the marking rubric provided on Moodle. Detailed marking criteria based on this rubric will be provided during tutorial week 6.

len2356350

8/12/2019 1:59:53 AM

DEDUCTION, LATE SUBMISSION AND EXTENSION - Late submission penalty: - 5% of the total available marks per calendar day unless an extension is approved. This means 0.75 marks (out of 15 marks) per day. For extension application procedure, please refer to Section 3.3 of the Subject Outline. Please do NOT email the lecturer or tutor to seek an extension, you need to follow the procedure described in the Subject Outline.

Write a Review

Applied Statistics Questions & Answers

  What does it mean to set alpha at .05

What does it mean to set alpha at .05?What is your null hypothesis? Alternate hypothesis?Is this a one-tailed or two-tailed hypothesis?Calculate the obtained z. Do you reject or fail to reject the null hypothesis?

  What differentiates the two formulas and why the difference

What differentiates the two formulas Σ (x - xbar)^2/n-1 from Σ (x - µ)^2/n, and why the difference?

  Population distribution in order to perform test

Construct and interpret a 95% confidence interval estimate of the difference in the mean measurements in-lineand from an analytical lab.

  Create a stem and leaf chart for the variable money.

Create a stem and leaf chart for the variable money.

  Characteristics on patients participating in clinical trial

Characteristics on patients participating in a clinical trial.

  How many days should one order last on average

What is the expected total inventory holding and ordering cost per year and how many days should one order last on average?

  Can we always be completely certain that the probability of

Discuss what we mean by a binomial experiment. As you can see, a binomial process or binomial experiment involves a lot of assumptions! For example, all the trials are supposed to be independent and repeated under identical conditions. Is this always..

  Functions of parametric statistical procedures

What would be the appropriate statistical procedure to test the following hypothesis:  "Triglyceride values are a good predictor of weight in obese adults." What are the functions of parametric statistical procedures?

  Bio stats practise

Should you have a cup of coffee to make you more alert when studying for a big test? A researcher is interested in studying the effect of caffeine, and he comes up with the following plan for an experiment. The experiment will involve 100 volunteers ..

  Describe the relationship between probability and odd ratios

Describe the relationship between probability, odds, and odds ratios. Make sure that you give an example (make one up) that illustrates how you would interpret

  Probability that six fishes bite during the two hours

What is the probability that six fishes bite during the first two hours - what is the probability that he fails to catch any fishes during the first two hours?

  Perform the hypothesis test

What are the assumptions made in performing the hypothesis test in Question 3? Are these assumptions reasonable? Provide explanation to substantiate your view.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd