Implement multiple algorithms

Assignment Help Python Programming

Reference no: EM132734330

Project

Task
You have been provided with a dataset, which you are to use to predict crop yield given climate data, i.e. a regression problem. The task is to perform a "typical" machine learning project:
• Prepare the data
• Implement multiple algorithms, specifically:
? Linear regression (As the baseline; you may reuse your earlier code)
? Regression forest (Use technique in lecture; do not use a histogram)
? Gaussian process
? One of your own choosing (One we taught or something else)
• Evaluate the algorithms on the dataset and tune them to maximise performance
• Critically analyse the results

1. What exploration of the data set was conducted?
2. How was the dataset prepared?
3. How does a regression forest work?
4. How does a Gaussian process work?
5. Which additional algorithm did you choose and why?
6. What are the pros and cons of the algorithms?
7. Describe the toy problem used to validate the algorithms, and explain its design?
8. What evidence of correct, or incorrect, implementation did the toy problem provide?
9. How were the hyperparameters optimised?
10. What results are obtained by the algorithms?
11. How fast do the algorithms run and how fast could they run?
12. Which algorithm would you deploy and why?
13. How could the best algorithm be improved further?
14. If you were to try another algorithm then which one and why?
15. Are the results good enough for real world use?
16. How could this solution fail?
17. What improvements could be made to the data set?

Data set
The task is to predict crop yield (tonnes per hectare) for the major maize crop of farms distributed across the entire planet (the major crop is the main crop of the year; farms may also grow a second crop with a lower yield). For each farm climate data is provided, specifically three features for each month:
• Total rainfall, in mm.
• Mean minimum temperature, in degrees celsius (minimum temperature for each day, averaged over the entire month).
• Mean maximum temperature. Likewise to above.
Year has also been included to account for improvements in farming techniques; the data covers three decades

The data is provided as one csv file (it includes a header) - you are responsible for splitting it for train/test and hyperparameter learning. Each row is an exemplar, and each column a feature.
Your task is to predict the last column (#38) given the first 37 columns. There are 31744 exemplars.

This data set was created by merging and subsampling
• "The global dataset of historical yields for major crops 1981-2016" by Iizumi & Sakai.
• USA NOAA World Weather Records (WWR).

Attachment:- Project.rar

Reference no: EM132734330

Questions Cloud

Diversity and universality : Based on Leininger's Theory of Cultural Care: Diversity and Universality, the goal was to provide culturally congruent holistic care

How should the supervisor deal with an employee : How should the supervisor deal with an employee who is consistently late or absent? What verifiable facts are available? What are the decisions be made?

Calculate the company cost of equity capital : The company's tax rate is 25%. Ortiz's CFO has calculated the company's WACC as 9.9%. Calculate the company's cost of equity capital

Should the company implement a dress code : Should the company/ office manager implement a dress code? What might be the causes of the problem-the reason this change is being considered?

Implement multiple algorithms : Predict crop yield given climate data, i.e. a regression problem. The task is to perform a "typical" machine learning project

Discuss your thoughts on the innovation plan : Discuss your thoughts on the innovation plan and if you feel it will be a success. In your responses among your peers, expand the discussion by providing.

Cultural competency self-assessment : Use the link provided to complete a self-assessment. After completing it, Describing your thoughts about your performance on the assessment.

What is the estimated cost of common equity using the CAPM : The return on an average stock in the market last year was 15%. What is the estimated cost of common equity using the CAPM

Outcome-based behavioral healthcare : Quality of care and services are hallmarks of modern-day healthcare industry. What are challenges faced while measuring outcomes for psychotherapy

User Account

All Pages