DALT7002 Data Science Foundations Assignment

Assignment Help PL-SQL Programming
Reference no: EM132495631

DALT7002 Data Science Foundations - Oxford Brookes University

Learning Outcome 1. Demonstrate the ability to identify and integrate data of various types from traditional and alternative sources, and make informed judgements about their use in data science research
Learning Outcome 2. Critically evaluate the methodologies applied in data collection, data processing, data analysis & dissemination of research findings
Learning Outcome 3. Critically assess methods and data strengths and limitations combined to application of R and/or Python

Introduction

In this coursework you will prepare a data model that combines a range of data sets. We are primarily interested in the processes you take to achieve your data model, though you will need to produce a final data set and model.

Scenario

Oxford Brookes University would like to offer a new service to staff to encourage the brightest and best staff to join us, and in recognition of the fact that Oxford itself, can be a very expensive place to live.

This new service is a town advice service that recommends towns in Oxfordshire based on a certain key characteristics, these being:

• House prices
• Broadband speed
• Crime in the area over the last month

They would also like to consider other factors such as:

• Nearby rights of way
• Distance from Oxford vs size of the road
• Availability of Allotments

There may be other factors. So you should also gather more information from a member of Oxford Brookes academic staff to find about any other key issues that might affect a person's choice of location.

Tasks

You must use datasets that are published on by the UK government, either centrally or through a public body that would be available to a member of the UK public. You should prepare a brief questionnaire about the knowledge acquisition and send it to a domain expert (in this case Dr. Younas) to gain an insight into any other data sources you may wish to query. Dr. Younas's email


Using this information, you should produce a unified data set and model that could be used to drive a recommendation system, documenting and explaining all the processes that you undertake to achieve this data set and model. You must ensure that -

• All data used is normalised to at least 3NF
• You must use the MySQL server on SOTS to store the data or another MySQL server. You should include your tables as part of the report
• Your model must use the three key characteristics
• Your model may use the additional characteristic(s) suggested above or that arise from the knowledge acquisition session
• The combined data set must be stored in a MySQL server
• You should demonstrate that you can query the data set in R
• You should have a simple recommendation system, written in R, that allows the user to specify a value in the range 0-10, for each of the three key characteristics and then produces a score for a town and displays the top 3 towns in order
• The towns used are in Oxfordshire.
• You may restrict the number of towns you look at to main towns, but you must justify your selection in your report

You should produce a report detailing

• The stages you took to identify, obtain, clean, and use the data sets associated with the three key characteristics
• The stages you took to identify, obtain, clean and use any additional data sets that you needed to either combine or fully utilise the three key characteristics
• A justification of the approaches used in identifying, cleaning, and using the datasets
• How you might obtain, clean and use any one data set associated with the optional characteristics (Note: you do not have to do the actual work, just say what the issues are with this type of data and how you might incorporate it into your system)
• The results of your knowledge acquisition questionnaire with your domain expert
• How you might obtain, clean and use any one additional data set based on your knowledge acquisition questionnaire (Note: you do not have to do the actual work, just say what the issues are with this type of data and how you might incorporate it into your system)
• A discussion of any legal or ethical issues with the proposed system and the data used
• An overview/design of your R code
• Your R code
• Names and descriptions of the MySQL database tables
• Testing of your system

Attachment:- Data Science Foundations.rar

Reference no: EM132495631

Questions Cloud

Lower average mileage than mark b : A gives a lower average mileage than mark B? Find the value of p, interpret the result. What assumptions should you take to work on this problem?
What amount of working capital is currently maintained : Your preference is to have a quick ratio of at least 0.80 and a current ratio of at least 2.00. How do the existing ratios compare with your criteria?
Discuss Slow Food Movement in relation to food consumption : Critically discuss the Slow Food Movement in relation to food consumption in the 21st century, integrating and giving examples of the concepts of provenance
Normal variable with mean : Given that x is a normal variable with mean µ = 44 and standard deviation s = 6.7, find the following probabilities
DALT7002 Data Science Foundations Assignment : DALT7002 Data Science Foundations Assignment help and solution, Oxford Brookes University - assessment writing service - Demonstrate the ability to identify
Prepare separate entries for each transaction for lima : Prepare separate entries for each transaction for Lima. The merchandise purchased by Maw on June 10 cost Lima $3000 and the goods returned cost.
Customer buying an air conditioner : A heating and cooling company advertises that any customer buying an air conditioner during the first 16 days of July will receive a 25 percent discount
Prepare separate entries for transaction on books of maw co : Prepare separate entries for each transaction on the books of Maw Co. On June 10, Maw Co. purchased $6000 of merchandise from Lima Co
What would be the net book value on january : An estimated residual value of $1,200. The company uses double-declining-balance depreciation. The net book value on January 1, 2021 would be

Reviews

Write a Review

PL-SQL Programming Questions & Answers

  Create simple reports in an oracle database

laboratory provides practice in the use of SQL commands to create simple reports in an Oracle database

  Describe conceptually how an sql retrieval query

Question 1: Describe conceptually how an SQL retrieval query will be executed by specifying the conceptual order of executing each of the six clauses?

  Write single query that retrieves information for management

If a customer has no rentals, or did not rent any movies multiple times, management does not want to see them in the list. Write a single query that retrieves this information for management.

  Write a select statement that joins the customers table

Write a SELECT statement that joins the Customers table to the Addresses table and returns these columns: FirstName, LastName, Line1, City, State, ZipCode.

  Create a calculator application.

You must center the form on the screen.

  What happens when a new account is opened

What happens when a new account is opened? Write SQL statement(s) to add data to the tables for a new account. (Go ahead, give yourself a million dollars!)

  Assignment on aggregate functions

After reviewing and completing the Unit 1 Guided Practice 2, I suggest that you review all tables using the Object Browser area of the SQL Workshop associated with the scenarios below, as well as field data types and data (case sensitivity) before..

  Retrieve the title of the course along with the number di

Retrieve the title of the course along with the number DI students who registered in this course in order of the student registration number.

  Analyze how sql differs from a programming language

In addition to this, analyze how SQL differs from a programming language with which you are familiar. Explain your general opinions of SQL thus far, and classify it as easy or not, and as useful or not.

  Find the most classes taken by students

Find the students by student ID who have taken the most class (counted by enrolled students) and not counting where an ‘F' was the grade - Find the facility by facility id who teach math class.

  Determine resonant frequency in series rlc resonant circuit

Given the series RLC resonant circuit in the figure, operating at variable frequency, determine: The resonant frequency ω o ,  The circuit’s quality factor Q , The cut-off frequencies, f 1  & f 2  and the bandwidth BW

  Important considerations when selecting network hardware

As a network administrator, what do you think are the most important considerations when selecting network hardware? Why?

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd