Reference no: EM132639459
MITS5512 Methods of Data - Victorian Institute of Technology
Assessment 1: Case Study
Task
Suppose you are working for a company/organisation. Your manager gives you a data and ask you what you can do with the data in terms of adding some values to the company goals and future operational research plans.
Three tasks you need to do: 1) choose a proper data related to a business case study, 2) choose a proper software to open the data and do data visualisation and exploratory analytics and 3) write a clear and accurate report and put all findings in the report.
Your report should have 1200-1500 words addressing the following: information on the data, type of features, literature review on the data and methodology you are going to apply, what you are going to solve and how, plots and recommendations. The report should have some plots (4-6 screenshots) from your findings with explanations.
1. Choose a data
Choose a data from Kaggle website, or a government open source data. The data should be related to a business case study, such as house marketing, climate change, patients records and banking data. You need to add information on data in your report, including reference where you downloaded the data, information of data type and features.
2. Visualisation and Exploratory Analysis
Select any data science tools to open the data. Look at the data and find out how you can improve quality of the data. You must provide some data visualisation using selected software. Do an exploratory data analysis on the data that you have gathered. Exploratory data analysis is an approach for analysing data sets to summarize their main characteristics, often with visual methods. These analytics should be in your report.
Assessment 2: Research Report
Introduction
This assessment item relates to the unit learning outcomes as in the unit descriptor. This assessment is designed to improve student research skills and to give students experience in researching a topic and writing a report relevant to the Unit of Study subject matter.
Task
For this component you will write a report or critique on a recent academic paper related to Data Science or Data Science Methodologies. Some possible topic areas include but are not limited to:
• Supervised Learning
• Unsupervised Learning
• Semi-Supervised Learning
• Anomaly Detection
• Association Analysis
• Regression Analysis
• Classification Analysis
• Pattern Recognition
• Feature Selection - (aka Dimensionality Reduction)
• Ensemble Methods
• Neural Nets and Deep Learning
• Transfer Learning
• Reinforcement Learning
• Natural Language Processing
• Applications of Data Science
The paper you select must be directly relevant to one of the above topics or another topic and be related to Data Science. The paper must be approved by your lecturer and be related to what we are studying this semester in Data Science Course. The paper can be from any academic conference or other relevant Journal or online sources such as Google Scholar, or Academic department repositories. All students must select a different paper. Thus, the paper must be approved by your lecturer before proceeding. In case two students are wanting to present on the same paper, the first who emails the lecturer with their choice will be allocated that paper. Please note that popular magazine or web-site articles are not academic papers. The paper you chose should be published in the last 5 years.
The report should be limited to approx. 1500 words (not including references). Use 1.5 spacing with a 12-point Times New Roman font. Though your paper will largely be based on the chosen article, you should use other sources to support your discussion or the chosen papers premises. Citation of sources is mandatory and must be in the IEEE style.
Assessment 3: Major Assignment
Introduction
This assessment item relates to the unit learning outcomes as in the unit descriptor. This assessment is designed to improve student analytic skills and to give students experience in problem solving, decision-making and presentation skills in data science methods and tools.
Task
For this assignment, you are required to work in a group of maximum 4 students and two files are required to be uploaded in the Moodle (provided links) by one of the group members. The first file is a report containing maximum 2000 words with 6-10 screenshots from your findings. The second file is a presentation file of your analytics and findings. Note that both files need to be uploaded by only one of the group members.
1. Data
The required dataset is available in the Moodle. You need to add a section in your report and talk about the data and challenges there. What kind of issues available there? What are the features? Add some information on the data in this section of your report.
2. Data Analytics & Visualization
Identify what kind of data it is and what you can do with this data if this data passes to you by your company. To apply such method, you need to explore the data and apply data processing, such as data cleaning and feature engineering, if it is required. Then choose a proper data science method/s to analysis the data. Suppose this is a company data that you are working for them. What are the issues available there and what you can recommend for your manager in company to enhance their objectives and to the benefits of the company?
3. Report (Weightage 20%)
Your report should have 1500-2000 words, excluding references, addressing the business questions, challenges, analytics, recommendation and visualisation related to the data. It should cover what are the issues in the data, you are going to solve and how, plots and recommendations. The report should have important plots (6-10 screenshots) from your findings. Note that plots need to be labelled and explained inside the report.
All coding, including data uploading, cleaning, analytics and visualisation should be coded in Python. The python code should be included at the end of your report in a section called Appendix.
Note: Structure and font of your report should follow the word file template provided in the Moodle. Your report should be a single word or pdf document containing your report and need to be submitted through Moodle. One submission per group and make sure all group members participate and add their names in the report. Your report should have a contribution table at the end of the report.
4. Presentation
The presentation should be a maximum of 10 minutes for the whole team. Each member must participate in video presentation file and talk at least 2 minutes in the video related to the methodology used, findings, contribution or recommendation.
Note: Need only Assessment 2: Research Report
Attachment:- Methods of Data.rar