Calculate the number of work hours per function

Assignment Help Basic Statistics
Reference no: EM132212564

Assignment:

Comparing Software Development Workloads Estimating the cost of developing software in terms of work load is difficult since it is a challenge to quantify the size and complexity of a software system.

The article Analysis of Size Metrics and Effort Performance Criterion in Software Cost Estimation provides an overview of different metrics used to assess size and complexity (Malathi & Sridhar, 2012).

The metrics include counts of lines of code, function point counts, and operation counts. Function point counts are often utilized because they can be estimated based on project design specifications. The dataset pointworkload.cvs contains data collected from 104 programming projects at AT&T between 1986 and 1991 (Matson & Huguenard, 2005).

This dataset include number of work hours for each project, the function point count for each project, and identifiers for operating system, data management system, and programming language utilized. In this application, you will investigate whether operating system, data management system and programming language impact the number of work hours per function point for a project. Open the dataset pointworkload.csv in Excel.

Create a new column that calculates the number of work hours per function point for each project. Save the file with this new data column. pointworkload.csv Next, you would want to look at the distribution of work hours per function point in a frequency diagram. Doing so in Excel requires either binning and counting the data yourself or installing the Data Analysis Toolpak Add-On.

However, even with the add-on, simply getting a histogram requires multiple steps. Excel is designed for data presentation not for significant statistical analysis. It is capable of the statistical analysis but only with add-ons, macros, or programming. Instead of taking these steps, you will switch now to a software tool designed for statistical analysis, SPSS.

Go to the Resources section for Unit 4, and download the document IBM_SPSS_Installation_and_Registration_Instructions. This will guide you through the process of installing the statistical analysis platform SPSS which you will utilize for the remainder of this assignment.

Import the file you revised in Excel to include work hours per function point into SPSS (be sure to tell it that yes there are variable names included at the top of your file) and take a screenshot showing your successful installation and import.

This screen shot should be pasted into your overall document. In the top tool-bar, select Analyze, Descriptive Statistics, Frequencies. Put the work hours per function point variable you created in the Variable(s) column.

Click Charts and select Histogram. Then,click Continue and OK. SPSS will now run the requested analysis. In the Output, scroll down to the histogram and copy-paste it into your overall document. Describe the distribution of the data.

Does it appear to be normally distributed? What are the average and standard deviation? Are there any outliers? Now, you are ready to determine whether operating system, data management system, or language impact the work hours per function point. To do this, you will utilize two different statistical tools.

The t-test for difference in means between two independent samples and the analysis of variance. There are two different operating systems utilized. A 0 indicates UNIX, and a 1 indicates MVS. The t-test will allow you to assess the null hypothesis that the two operating systems give the same average work load per function point.

Select Analyze, Compare Means, Independent-Samples T-Test. Your test variable is work hours per function point. Your grouping variable is OS. You will need to click Define Groups and make Group 1 = 0 (UNIX) and Group 2 = 1 (MVS). With these defined, click Continue and OK to get both the group statistics and the t-test results.

Use the group statistics to calculate the t-value. Show all of your work for the calculation. For ?=0.05, what is the p-value for the hypothesis? Based on this result, draw a conclusion as to whether or not the different operating systems result in a significant difference in work load per function point.

By examining the t-test results from the previous question, you can see that both the t-statistic and the p-value are calculated there. You will be running several tests to determine if programming language impacts work load per function point, and you should draw your data from these charts rather than calculating by hand.

Go back to your Independent-Samples T-Test and change the Grouping Variable to Language. Define the groups as 1 (Cobol) and 2 (PLI). Copy the t-test results to your overall document. Repeat this process for groups 1 (Cobol) and 3 (C), groups 1 (Cobol) and 4 (Other), groups 2 (PLI) and 3 (C), groups 2 (PLI) and 4 (Other), and groups 3 (C) and 4 (Other). Copy all six t-test results to your overall document. Based on these result, draw a conclusion as to whether or not the different programming languages result in a significant difference in work load per function point.

Be sure to state the different null hypotheses considered and which are rejected and accepted at ?=0.05. Running six different t-tests certainly answers the question of whether or not programming language effects work load per function point, but it is relatively time consuming to run and assess each of these results separately. Analysis of variance (ANOVA) allows this multiple group comparison. Go to Analyze, Compare Means, One-Way ANOVA.

Select work hours per function point as your dependent variable and Language as factor then click OK. Copy the ANOVA table to your overall document. Explain what the ANOVA table tells you and what conclusions can be drawn. ANOVA has the down side that it only tells if some group is significantly different from some other group but does not identify those groups.

You can obtain that information by adding a post hoc test to compare means. Go back to the One-Way ANOVA and click on Post Hoc. You will see numerous options. These are all different methods for comparing the groups.

Each approaches the comparison differently. You will utilize the Tukey comparison here. Select Tukey then click Continue and OK. You will see both a comparison table and a table creating homogenous subsets. From this data you should be able to conclude that there is a significant difference between 1 (Cobol) and 2 (PLI).

Copy these charts to your overall document and explain how that conclusion may be drawn. How does this compare to your t-test conclusions? Utilize t-test and/or ANOVA to determine the impact of database management system on work load per function point.

The values are 1 (IDMS), 2 (IMS), 3 (INFORMIX), 4 (INGRESS), and 5 (Other). You should present your data, draw conclusions, and explain those conclusions.

Verified Expert

Descriptive statistics uses the data to provide descriptions of the population,either through numerical calculations or graphs or tables.Inferential statistics makes inferences and predictions about a population based on a sample of data taken from the population in question.

Reference no: EM132212564

Questions Cloud

Virtual technologies development group : The Director of Marketing has asked for your assistance as she looks to name a manager of the Virtual Technologies Development group
Identifying areas of controversy and gaps in literature : Identifying areas of controversy and gaps in literature and formulating questions that need further research
What might be some treatment options you could pursue : This week we studied psychological disorders. Which of the disorders covered in your lessons do you think would be the most challenging to have and why?
Discuss difficult transition that you have made in workplace : Discuss a difficult transition that you have made in the workplace. What lesson did you learn from that experience?
Calculate the number of work hours per function : Create a new column that calculates the number of work hours per function point for each project - Analysis of Size Metrics and Effort Performance Criterion
Identified in literature of safety culture-learning culture : What are some themes identified in the literature of safety culture and learning culture?
Desctibe the effects and prevention of given topic : Make a 12 slides presentation on "Rising suicides among school children in India because of educational pressure; Causes, Effects and Prevention'.
Describe problem to be worked on in your chosen case study : In 1 to 2 sentences, identify and describe the problem to be worked on in your chosen case study. In 1 to 2 sentences, explain how feminist theory.
Description of any validity reliability concerns : For this assignment, you will research assessment options for the offender population that you are writing about in your Signature Assignment

Reviews

Write a Review

Basic Statistics Questions & Answers

  Find the probability of exactly two jackpots in five trials

Suppose that a guest claims that she played the slot machine 5 times and hit the jackpot twice. Find the probability of exactly 2 jackpots in 5 trials.

  Description of linear regression and correlation

A suburban hotel derives its gross income from its hotel and restaurant operations. The owners are interested in the relationship between the number of rooms occupied on a nightly basis and the revenue per day in the restaurant.

  Prove that the space p of all polynomials is an

prove that the space p of all polynomials is an infinite-dimensional vector space. hint mathematical induction may be

  Independent random digits

Let Dn be the average of n independent random digits from {0, . . . 9}.

  Find project probability distribution using given data

The construction time for a highway project depends on weather conditions. It is expected to take 240 days if the weather is dry and the temperature is hot.

  Estimate the numerical value of the correlation

The figure shows a scatterplot of the height of the left seat of a seesaw and the height of the right seat of the same seesaw.

  Reasonable interpretation of the survey results

Which of the following represents a reasonable interpretation of the survey results? For those not reasonable, explain the ?aw.

  Probability that employee reports to work between given time

A study shows that employees that begin their work day at 9:00 a.m. vary their times of arrival uniformly from 8:40 a.m. to 9:30 a.m. The probability that a randomly chosen employee reports to work between 9:00 and 9:10 is?

  Area corresponding to the probability shaded

Assume the random variable X is normally distributed with mean μ = 50 and standard deviation σ = 7. Compute the probability. Be sure to draw a normal curve with the area curve with the area corresponding to the probability shaded.

  Find correlation between test and the bdi

The correlation between his test and the BDI was r =.14. Evaluate this correlation. What does this correlation tell us about the relationship between these two instruments?

  What are the effects of training group on work performance

What are the effects of training group (blue, red or yellow group) and gender on work performance scores? Provide the IV, DV, Covariate, and best method of analysis

  Development of models of uncertainty in decision analysis

1. Explain in your own words the role that data can play in the development of models of uncertainty in decision analysis.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd