Reference no: EM133775475 , Length: word count:6500
Big Data
Assessment 1: Case Study Analysis Report
Objective(s)
This assessment item relates to the unit learning outcomes as in the unit descriptor. This assessment is designed to improve student research and writing skills and to give students experience in researching literature on a specific topic relevant to the Unit of Study subject matter. Students
will be expected to complete a literature review to discuss a contemporary Big Data, access, design or Information design issue which an IS professional may experience. Students will critically analyze current academic papers then present their work in a detailed literature review and analysis.
Assignment Description
This assessment will be completed individually. All students must have a different topic. Students can choose to write about the same technology, but the approach and the thrust of each paper must be different. To ensure this uniqueness, each student must decide on a topic and email their topic and title to their tutor within the first 2 weeks. Your tutor will respond with an approval or with a message that you will either need to choose a different topic or to change the thrust of your paper. You tutor may decide to do this for you. Once it has been approved you should begin by working towards the first deliverable.
The deadline for submitting the draft is in week 4, and it will be structured into the following sections:
Abstract
Introduction
At least 5 sections which are relevant to the topic you have been allocated
Conclusion
Reference
The final submission of your paper is due in week 4
The final submission should be no less than 1500 words. Your literature review should be presenting the state of current knowledge in the specific area of your topic, and as such, should have a narrative that flows from one paragraph to another. You cannot achieve this with bullet points and small disjoint sections.
Tableau & Splunk Project
The Assignment consists of a research report of 1500 words Due Week 8 30%.
Part 1: Business Data Analysis Problems to be addressed in the report
The report should provide an analysis and evaluation of Global Superstore financial performance over a five-year period from 2014 to 2018 on its global reach of 165 countries. Conduct a comprehensive analysis of revenue, profit, discounts and customer behaviour. Analyse the Key Performance Indicators (KPI) in monitoring, assessing and managing a firm's performance in terms of:
Diverse product range generating higher sales revenue and translation into superior profit
The influence discounts and promotions have on sales and customer loyalty.
Predicting sale trends in product sub-categories in different regions
Investigate which categories and products are the most popular and the most profitable. Should the Global Superstore consider streamlining their products offering, investing more heavily in products which are performing well and removing which are not?
Examine the relationship between purchasing patterns and the percentage of discount applied to determine the impact the discounts and promotions have on sales.
Customer service is another key factor that businesses use to differentiate themselves from their competitors. Determine the volume of repeat customers. Assess the level of customer service that they provide and make recommendations on how they can improve their service and customer retention.
Examine which countries and products have highest return rates. Determine if there are any products that the Global Superstore should consider removing from their product range, as well as aiding them in developing strategies to combat or reduce the rate of return in countries with a high frequency of returns. Perform a return analysis across categories and subcategories.
Evaluate the average shipping cost, most popular mode of shipping and shipping latency to establish if there is a connection between these variables and purchases made.
Predicting future revenue streams is an essential component when determining merchandising decisions.
This report should build a predictive model to determine sales revenue from a given product in each region. This should be limited to the top five sub-categories.
The research report must have the format:
Cover sheet (make sure each team member's name and student no are on the coversheet)
Executive Summary
Table of contents
Company Information
Problem Identification
Data Collection
Analysis and Discussion
Recommendations
References
Part 2: Network Data Analysis
In this part of the assessment, you must analyse the log details of the global super store in Australia with Splunk.
Perform a basic search for errors and any type of failures in all data sources. Provide a screenshot of the entire page once completed.
Perform a new search for password and any type of failures on port 22 in all data sources. Provide a screenshot of the entire page once completed.
Do you see trends over time? If so, what were they?
Use the output of your search to refine the results by adding a new field to the search
Take a screenshot of your ‘Activity Jobs Menu' detailing the current job saved with expiration date.
Take a screenshot of your search history. Set a filter to narrow down your search results.
Show where are selected field, interesting fields and all fields located. How do you use fields to perform a search.
How do you add time range when performing search.
Objective(s): Evaluate and compare various distributed big data computing frameworks, focusing on their architecture, performance, scalability, ease of use, and application areas.
Structure:
Introduction (10%)
Define distributed big data computing.
Importance of distributed computing frameworks in handling big data.
Overview of the report.
Framework Analysis (40%)
Apache Hadoop
Architecture (HDFS, MapReduce, YARN)
Performance and scalability
Pros and cons
Use cases
Apache Spark
Architecture (RDD, DAG, Spark SQL, MLlib)
Performance and scalability
Pros and cons
Use cases
Apache Flink
Architecture (DataStream API, Batch Processing, CEP)
Performance and scalability
Pros and cons
Use cases
Other Relevant Frameworks (e.g., Apache Storm, Apache Samza)
Brief overview
Comparison with the above frameworks
Comparative Analysis (30%)
Comparative table highlighting key features, advantages, and disadvantages.
Discussion on the best framework for different use cases (real-time processing, batch processing, machine learning, etc.).
Case Study (10%)
Detailed analysis of a real-world application using one of the discussed frameworks.
Evaluation of the chosen framework's performance and impact on the application.
Conclusion (10%)
Summary of findings.
Recommendations based on the comparative analysis.
References (not graded but mandatory)
Cite all sources in a consistent format (APA/MLA/Harvard).
Presentation: 10%
Objective: Present the key findings from the report in a clear, engaging, and concise manner.
Structure:
Introduction (10%)
Brief overview of the topic and purpose of the presentation.
Key Findings (50%)
Highlight major points from the framework analysis.
Use visuals (charts, tables, diagrams) to illustrate comparisons.
Case Study Summary (20%)
Summarize the case study, focusing on the application and impact of the chosen framework.
Conclusion (10%)
Summarize the overall findings and recommendations.
References (10%)
Citation and listing of references as per IEEE format.
Students need to demonstrate to the tutor that each team member has made a significant contribution to the report. It is suggested the group use a collaborative environment such as google drive to store documents and work on the assignment. You will also create a document that lists each task and the name of the team member/s responsible for the task.
The task allocation must be approved by the tutor before commencing other work on the report.The group is also required to discuss their progress with the tutor on a weekly basis.
Additional information regarding this Assessment:
Report document standards
Normal font is Calibri, size 11 point for the body of all documents with the text fullyjustified.
Headings should not exceed 14 points in size except on a title page where larger fontsare appropriate for the title of a report.
Documents should use 1.15 spacing within a paragraph and have an 8-point spacebetween paragraphs.
Footers should be created on the report that includes a page number.
Up to 15% of the Report contents may be quoted or paraphrased from
other sources provided you acknowledge and cite the original source of the material you use.
Use IEEE referencing on all quoted or paraphrased material.