Analyse the population biology of abalone in tasmania

Assignment Help Software Engineering
Reference no: EM132305801

TASMANIA ABALONE RING PREDICTION

Description

In this assignment we will apply machine learning techniques learned in the lectures and tutorials to analyse the population biology of abalone in Tasmania. In particular, we will predict the age of abalone from physical measurements. By observation, the age of abalone can be estimated by the number of rings which can be seen by a microscope. In this assignment you will predict the number of rings given other attributes.

Name

Data Type

Meas.

Description

----

---------

-----

-----------

sex

nominal

 

M, F, and I (infant)

length

continuous

mm

Longest shell measurement

diameter

continuous

mm

perpendicular to length

height

continuous

mm

with meat in shell

whole weight

continuous

grams

whole abalone

shucked weight

continuous

grams

weight of meat

viscera weight

continuous

grams

gut weight (after bleeding)

shell weight

continuous

grams

after being dried

edible

boolean.

 

True, and False

rings

integer

 

+1.5 gives the age in years

Task 1: Data Collection
Identify irrelevant information from the data and remove it to clean the data. Hint: Use Weka or Excel.

Task 2: Data Pre-processing
There are some missing values for height attribute and rings. Decide the way you handle this issue and explain why.
Hint: Use Weka or Excel.

Task 3: Data Transformation
We need to create a new attribute called volume from other attributes as: volume = length * diameter * height.
Normalise the data into [0-1] range.
Hint: Use excel or write program (if you know how to do it).

Task 4: Data Mining & Pattern Evaluation
Prepare your data from the to have:
- A training set of the first 2500 samples
- A validation set of the next 633 samples
- A test set of the last 1044 samples
Run 15 machine learning algorithms and report their accuracy on the validation set to a table.
Select the algorithm that give the highest accuracy in validation set and run the algorithm using the training set and test set. Report this result in test set.
There will be a table of top test result. Email me the results of test set at any time (screenshot of your results in Weka) to put your name to the rank.
Explain how the best algorithms work (in the report) Tips: How to improve performance?
- Handle the missing data issue effectively, use data normalisation, ury different techniques learned from lectures and select the ones that give top accuracy in validation

Task 5: Write a report
Write a report using the following template.

MAJOR ASSIGNMENT

Synopsis of the task and its context
This is an individual assignment making up 20% of the overall unit assessment. The assessment criteria for this task are:
1) Apply machine learning pipeline to solve a real-world problem (Biology of Tasmania Abalone).
a) Identify relevant data
b) Process and clean data
c) Transform data (making new attribute and normalise data)
d) Apply machine learning techniques to predict abalones' rings.
2) Writing a scientific report (1.5-2 pages A4, double column)
a) Understand the impact of this work.
b) Analysis of the results.
c) Identify the best technique for this problem and understand how it works.

Unit learning outcomes
On successful completion of this unit...

1. understand the local and global impact of AI on individuals, organizations, and society

2. adapt and apply techniques for acquiring, representing, and reasoning with data, information, and knowledge

3. select and effectively apply techniques to develop simple AI solutions

4. analyze a problem, apply knowledge of AI principles, and use ICT technical skills to develop potential solutions

5. evaluate strengths and weaknesses of potential AI solutions

Attachment:- ARTIFICIAL INTELLIGENCE.rar

Verified Expert

This paper involves machine learning using weka software and excel. The original idea was to split the provided data into training,testing and validation sets, 15 machine learning models were trained and their performance was compared on the testing dataset,The best model was then picked and discussed.

Reference no: EM132305801

Questions Cloud

How would switching to a real-time database solution help : How would switching to a real-time database solution help Falcon Security remain competitive?
Why businesses must spend time and money to educate employee : It is important to understand that humans and technology interact in all information systems. Why do you feel businesses must spend time and money to educate.
Connection between role ambiguity and task demands : What's the connection between role ambiguity and task demands.
What is the osi security architecture : What is the OSI security architecture? What is the difference between passive and active security threats? List and briefly define categories of security.
Analyse the population biology of abalone in tasmania : KIT108 - Artificial Intelligence - University of Tasmania - Identify irrelevant information from the data and remove it to clean the data. Hint: Use Weka
What must be considered in the expansion process : You are the Business Manager for TRE Aviation, Inc., an aircraft equipment and parts manufacturing company.
What is the difference between security and safety : What is the relationship between risk management and vulnerability assessment? What is the difference between security and safety?
Concepts of societal marketing and sustainability : Put yourself in the role of a marketing manager. From this perspective, do you agree with the concepts of societal marketing and sustainability?
Explain the relevant issues in auditing : Investigate and explain the relevant issues in Auditing and Accounting raised by the case Ryan Wealth Holdings Pty Ltd v Baumgartner [2018] NSW

Reviews

len2305801

5/14/2019 11:30:11 PM

2. Scientific Report(30%) a) Understand the impact of the work Good introduction. Explain clearly how machine learning can help in the case of biology of Tasmania Abalone. Provided a dataset or use cases for training evaluation that has the potential to establish the likely success or failure of the proposed method Provided a dataset or use cases for training evaluation Provided limited data for training/testing b) Analysis of the results Report the results in a table. Provide analysis over the results, i.e. explain the effect of data selection, data processing, and data transformation step on the results. c) Explain the best ML technique. Show his/her understanding of the selected ML technique which give the best results. Explain the advantages of this ML technique which contribute the best performance.

len2305801

5/14/2019 11:29:50 PM

Criteria HD (High Distinction) 1. Machine learning pipeline(70%) a) Data collection Excellent choice of irrelevant attribute(s) and be able to perform removal of that attribute(s) from the original dataset. b) Data processing (10%) Identify and apply relevant data processing techniques for both height and ring attributes and do not lose any data samples. c) Data transformation Can create new attribute volume and normalise data attributes to [0-1] range (except rings). d) Data Mining and Pattern evaluation Can divide the data, create ARFF files and run all 15 different machine learning techniques and report the accuracy. The best accuracy is not less than 10% of the best one from all students.

len2305801

5/14/2019 11:29:36 PM

Synopsis of the task and its context This is an individual assignment making up 20% of the overall unit assessment. The assessment criteria for this task are: 1) Apply machine learning pipeline to solve a real-world problem (Biology of Tasmania Abalone). a) Identify relevant data b) Process and clean data c) Transform data (making new attribute and normalise data) d) Apply machine learning techniques to predict abalones’ rings. 2) Writing a scientific report (1.5-2 pages A4, double column) a) Understand the impact of this work. b) Analysis of the results. c) Identify the best technique for this problem and understand how it works.

Write a Review

Software Engineering Questions & Answers

  Research report on software design

Write a Research Report on software design and answer diffrent type of questions related to design. Report contain diffrent basic questions related to software design.

  A case study in c to java conversion and extensibility

A Case Study in C to Java Conversion and Extensibility

  Create a structural model

Structural modeling is a different view of the same system that you analyzed from a functional perspective. This model shows how data is organized within the system.

  Write an report on a significant software security

Write an report on a significant software security

  Development of a small software system

Analysis, design and development of a small software system.

  Systems analysis and design requirements

Systems Analysis and Design requirements

  Create a complete limited entry decision table

Create a complete limited entry decision table

  Explain flow boundaries map

Explain flow boundaries map the dfd into a software architecture using transform mapping.

  Frame diagrams

Prepare a frame diagram for the software systems.

  Identified systems and elements of the sap system

Identify computing devices, which could be used to support Your Improved Process

  Design a wireframe prototype

Design a wireframe prototype to meet the needs of the personas and requirements.

  Explain the characteristics of visual studio 2005

Explain the characteristics of Visual Studio 2005.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd