Evaluate and apply key steps and issues involved in data

Assignment Help Other Subject
Reference no: EM132487568

Learning Outcome 1: Critically analyse and evaluate various statistical and computational techniques for analysing datasets and determine the most appropriate technique for a business problem;

Learning Outcome 2: Critically evaluate, develop and implement solutions for processing datasets and solving complex problems in variousenvironments using relevant programming paradigms;

Learning Outcome 3: Evaluate and apply key steps and issues involved in data preparation, cleaning, exploring, creating, optimizing and evaluating models;

Learning Outcome 4: Evaluate and apply aspects of data science applications and their use.Assessment Requirements

This assignment will use employment data of Wales from the StatsWales data source. This dataset provides workplace employment estimates, or estimates of total jobs, for Wales and its NUTS2 areas, along with comparable UK data disaggregated by industry section.

For this assignment students will undertake a data analysis and machine learning approach to reveal the workplace employment landscape of Wales.

Part 1. Data processing
1.1. Download the dataset for the period 2009 - 2018 and create a dataframe that concatenates Wales (total)employment value only.
1.2. Check for any null value or outlier. If found replace that with mean value.
1.3. Change the name of the industries as bellow

Part 2. Data analysis
For each question provide graph/chart along with your own interpretation (~ 50 words)
2.1. Which industry employed highest and lowest workers over the period?
2.2. Which industry has the highest and lowest overall growth over the period?
2.3. Which years are the best and worst performing year in relation to number of employment. (highest and lowest employment)

Part 3. Visual analysis
Create a dynamic scatter/bubble plot showing the change of workforce number over the period using Plotly express.
4. Correlation
4.1. Taking average employment number for each industry over the period, show and identify the highest and lowest correlated industries.
4.2. Make a year wise correlation for each industry. Does the aforementioned industries are also correlated over the each year? Explain your answer.

Part 5. Clustering (k means&hierarchical)
5.1. Using the best and worst performing year column's employment data (2.3) undertake a K means clustering analysis (K=2 & 3) and identify industries cluster together. Writeyour own interpretation (~100 words).
5.2. Using the same dataset (best & worst performing) create a hierarchical cluster. Compare the cluster with k means clusters.

Attachment:- Python assesment.rar

Reference no: EM132487568

Questions Cloud

Interest of the lending institution : Is it in the interest of the lending institution to get the money back in a lump sum?
What are some of features with lsd : A) What are some of these features with marijuana? B) What are some of these features with LSD?
What amount did canliss borrow : 5 annual installment payments of $13,000 beginning on year from today. The interest rate on the note is 4%. What amount did canliss borrow?
How much will leslie accumulate in three years : Assume an interest rate of 18% compounded quarterly. How much will Leslie accumulate in 3 years by depositing $540 at the beginning of the next 12 quarters?
Evaluate and apply key steps and issues involved in data : Evaluate and apply key steps and issues involved in data preparation, cleaning, exploring, creating, optimizing and evaluating models
Unit 13 Computing Research Project Assignment : Unit 13: Computing Research Project Assignment. Title - "An apps for smartphones called "Digital Comrade" that prevents smartphone users from digital hazards
Determine the loss if the old elevator is replaced : Determine the loss if the old elevator is replaced. Last year (2016), Richter Condos installed a mechanized elevator for its tenants.
Highest murder rates in the industrialized world : Why does the United States have one of the highest murder rates in the industrialized world? What can and should we do to reduce our murder rate?
Determine what is the coupon rate : A 12-year, annual coupon bond is priced at $1,102.60. The bond has a $1,000 face value and a yield to maturity of 5.33 percent. What is the coupon rate?

Reviews

len2487568

4/7/2020 3:10:40 AM

Please find the attachment for the assignment brief. Please find the instructions here below. 1. I would like a jupyter notebook which includes all the words and coding in that. 2. Once you are okay with the requirement I will be sending some documents in which the tutor had thought me some coding. You should append the coding from his tutorials nothing out of the box or high-level coding is not accepted. 3. I will also be sending a video recording from which you can get a glance at how to work on the requirement. 5. The deadline for this is. 6. Once you are done with the requirement you should be able to connect with me to explain the code written. If in any questions or clarity don't hesitate to reach out to me.

Write a Review

Other Subject Questions & Answers

  Cross-cultural opportunities and conflicts in canada

Short Paper on Cross-cultural Opportunities and Conflicts in Canada.

  Sociology theory questions

Sociology are very fundamental in nature. Role strain and role constraint speak about the duties and responsibilities of the roles of people in society or in a group. A short theory about Darwin and Moths is also answered.

  A book review on unfaithful angels

This review will help the reader understand the social work profession through different concepts giving the glimpse of why the social work profession might have drifted away from its original purpose of serving the poor.

  Disorder paper: schizophrenia

Schizophrenia does not really have just one single cause. It is a possibility that this disorder could be inherited but not all doctors are sure.

  Individual assignment: two models handout and rubric

Individual Assignment : Two Models Handout and Rubric,    This paper will allow you to understand and evaluate two vastly different organizational models and to effectively communicate their differences.

  Developing strategic intent for toyota

The following report includes the description about the organization, its strategies, industry analysis in which it operates and its position in the industry.

  Gasoline powered passenger vehicles

In this study, we examine how gasoline price volatility and income of the consumers impacts consumer's demand for gasoline.

  An aspect of poverty in canada

Economics thesis undergrad 4th year paper to write. it should be about 22 pages in length, literature review, economic analysis and then data or cost benefit analysis.

  Ngn customer satisfaction qos indicator for 3g services

The paper aims to highlight the global trends in countries and regions where 3G has already been introduced and propose an implementation plan to the telecom operators of developing countries.

  Prepare a power point presentation

Prepare the power point presentation for the case: Santa Fe Independent School District

  Information literacy is important in this environment

Information literacy is critically important in this contemporary environment

  Associative property of multiplication

Write a definition for associative property of multiplication.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd