Analyse the population biology of abalone in tasmania

Assignment Help Software Engineering
Reference no: EM132305801

TASMANIA ABALONE RING PREDICTION

Description

In this assignment we will apply machine learning techniques learned in the lectures and tutorials to analyse the population biology of abalone in Tasmania. In particular, we will predict the age of abalone from physical measurements. By observation, the age of abalone can be estimated by the number of rings which can be seen by a microscope. In this assignment you will predict the number of rings given other attributes.

Name

Data Type

Meas.

Description

----

---------

-----

-----------

sex

nominal

 

M, F, and I (infant)

length

continuous

mm

Longest shell measurement

diameter

continuous

mm

perpendicular to length

height

continuous

mm

with meat in shell

whole weight

continuous

grams

whole abalone

shucked weight

continuous

grams

weight of meat

viscera weight

continuous

grams

gut weight (after bleeding)

shell weight

continuous

grams

after being dried

edible

boolean.

 

True, and False

rings

integer

 

+1.5 gives the age in years

Task 1: Data Collection
Identify irrelevant information from the data and remove it to clean the data. Hint: Use Weka or Excel.

Task 2: Data Pre-processing
There are some missing values for height attribute and rings. Decide the way you handle this issue and explain why.
Hint: Use Weka or Excel.

Task 3: Data Transformation
We need to create a new attribute called volume from other attributes as: volume = length * diameter * height.
Normalise the data into [0-1] range.
Hint: Use excel or write program (if you know how to do it).

Task 4: Data Mining & Pattern Evaluation
Prepare your data from the to have:
- A training set of the first 2500 samples
- A validation set of the next 633 samples
- A test set of the last 1044 samples
Run 15 machine learning algorithms and report their accuracy on the validation set to a table.
Select the algorithm that give the highest accuracy in validation set and run the algorithm using the training set and test set. Report this result in test set.
There will be a table of top test result. Email me the results of test set at any time (screenshot of your results in Weka) to put your name to the rank.
Explain how the best algorithms work (in the report) Tips: How to improve performance?
- Handle the missing data issue effectively, use data normalisation, ury different techniques learned from lectures and select the ones that give top accuracy in validation

Task 5: Write a report
Write a report using the following template.

MAJOR ASSIGNMENT

Synopsis of the task and its context
This is an individual assignment making up 20% of the overall unit assessment. The assessment criteria for this task are:
1) Apply machine learning pipeline to solve a real-world problem (Biology of Tasmania Abalone).
a) Identify relevant data
b) Process and clean data
c) Transform data (making new attribute and normalise data)
d) Apply machine learning techniques to predict abalones' rings.
2) Writing a scientific report (1.5-2 pages A4, double column)
a) Understand the impact of this work.
b) Analysis of the results.
c) Identify the best technique for this problem and understand how it works.

Unit learning outcomes
On successful completion of this unit...

1. understand the local and global impact of AI on individuals, organizations, and society

2. adapt and apply techniques for acquiring, representing, and reasoning with data, information, and knowledge

3. select and effectively apply techniques to develop simple AI solutions

4. analyze a problem, apply knowledge of AI principles, and use ICT technical skills to develop potential solutions

5. evaluate strengths and weaknesses of potential AI solutions

Attachment:- ARTIFICIAL INTELLIGENCE.rar

Verified Expert

This paper involves machine learning using weka software and excel. The original idea was to split the provided data into training,testing and validation sets, 15 machine learning models were trained and their performance was compared on the testing dataset,The best model was then picked and discussed.

Reference no: EM132305801

Questions Cloud

How would switching to a real-time database solution help : How would switching to a real-time database solution help Falcon Security remain competitive?
Why businesses must spend time and money to educate employee : It is important to understand that humans and technology interact in all information systems. Why do you feel businesses must spend time and money to educate.
Connection between role ambiguity and task demands : What's the connection between role ambiguity and task demands.
What is the osi security architecture : What is the OSI security architecture? What is the difference between passive and active security threats? List and briefly define categories of security.
Analyse the population biology of abalone in tasmania : KIT108 - Artificial Intelligence - University of Tasmania - Identify irrelevant information from the data and remove it to clean the data. Hint: Use Weka
What must be considered in the expansion process : You are the Business Manager for TRE Aviation, Inc., an aircraft equipment and parts manufacturing company.
What is the difference between security and safety : What is the relationship between risk management and vulnerability assessment? What is the difference between security and safety?
Concepts of societal marketing and sustainability : Put yourself in the role of a marketing manager. From this perspective, do you agree with the concepts of societal marketing and sustainability?
Explain the relevant issues in auditing : Investigate and explain the relevant issues in Auditing and Accounting raised by the case Ryan Wealth Holdings Pty Ltd v Baumgartner [2018] NSW

Reviews

len2305801

5/14/2019 11:30:11 PM

2. Scientific Report(30%) a) Understand the impact of the work Good introduction. Explain clearly how machine learning can help in the case of biology of Tasmania Abalone. Provided a dataset or use cases for training evaluation that has the potential to establish the likely success or failure of the proposed method Provided a dataset or use cases for training evaluation Provided limited data for training/testing b) Analysis of the results Report the results in a table. Provide analysis over the results, i.e. explain the effect of data selection, data processing, and data transformation step on the results. c) Explain the best ML technique. Show his/her understanding of the selected ML technique which give the best results. Explain the advantages of this ML technique which contribute the best performance.

len2305801

5/14/2019 11:29:50 PM

Criteria HD (High Distinction) 1. Machine learning pipeline(70%) a) Data collection Excellent choice of irrelevant attribute(s) and be able to perform removal of that attribute(s) from the original dataset. b) Data processing (10%) Identify and apply relevant data processing techniques for both height and ring attributes and do not lose any data samples. c) Data transformation Can create new attribute volume and normalise data attributes to [0-1] range (except rings). d) Data Mining and Pattern evaluation Can divide the data, create ARFF files and run all 15 different machine learning techniques and report the accuracy. The best accuracy is not less than 10% of the best one from all students.

len2305801

5/14/2019 11:29:36 PM

Synopsis of the task and its context This is an individual assignment making up 20% of the overall unit assessment. The assessment criteria for this task are: 1) Apply machine learning pipeline to solve a real-world problem (Biology of Tasmania Abalone). a) Identify relevant data b) Process and clean data c) Transform data (making new attribute and normalise data) d) Apply machine learning techniques to predict abalones’ rings. 2) Writing a scientific report (1.5-2 pages A4, double column) a) Understand the impact of this work. b) Analysis of the results. c) Identify the best technique for this problem and understand how it works.

Write a Review

Software Engineering Questions & Answers

  How does software engineering differ from other engineering

How does software engineering differ from other, more traditional fields of engineering such as electrical and mechanical engineering?

  Consist of a series of phases

The software life cycle can be said to consist of a series of phases. The classical model is referred to as the waterfall model. Which phase may be defined as "The concept is explored and refined, and the client's requirements are elicited?"

  The balanced scorecard

Project managers can use one of several approaches to measuring the quality and success of an IS/IT project. For example, balanced scorecards are a simple and effective way of gauging the level of user adoption and success of an IS/IT project.

  Explain the basic architechture of a knowledge based system

Describe a design of your chosen system and explain the basic architechture of a knowledge based system.

  Responsive web design and development principles

COIT20268 - Responsive Web Design (RWD) - CQ university - you are to provide a formal report documenting your critical evaluation of the website

  Draw erd for student entity

Draw and ERD for this situation. What attribute or attributes did you designate as the identifier for the STUDENT entity? Why?

  Describe the ordering system to be designed

Describe the ordering system to be designed. Material can be taken from your approved proposal that is submitted to your instructor.

  Explanations on spreadsheet concepts

A workbook can contain more than one worksheet. Provide an example of a spreadsheet application where it would be necessary to have more than one worksheet, can you describe how that works?

  Which of these does not account for software failure

Which of these does not account for software failure ?

  Illustrate the use cases for the dentist office system

Create a use case diagram that would illustrate the use cases for the following dentist office system: Whenever new patients are seen for the first time, they complete a patient information form that asks their name.

  Different software development methods

Compare and contrast at least three (3) different software development methods. Explain why the testing of software products is of great importance

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd