What portion of data use for training validation and testing

Assignment Help Computer Engineering
Reference no: EM133659291

Assignment: Machine Learning & Artificial Intelligence for Bioinformatics

This assignment needs to be completed on the Google Collaboratory, and the results submitted as screenshots in a .doc or .pdf. Include the completed run of the corresponding code the question refers too along with your written answer (you can include additional code if you want). You will need to

Also you are welcome to run Tensorflow code outside of the Collaboratory, if you have such a setup, please note though that the submission need to follow the same format, meaning code cells-> output as shown on the Collaboratory (for example do not submit Python interactive command terminal code)

In preparation for the assignment, you can review again the Google Collaboratory posted in the last lecture. Watch the YouTube videos "Getting Started with Google CoLab | How to use Google Colab" and "Google Colab Tutorial for Beginners | Get Started with Google Colab" in order to become familiar with the Collaboratory (feel free to watch any additional on Youtube):

Note: You need to run cells from top to bottom (since top code cells generate dependencies for the lower cells), so you have to copy-paste and run the code cells in your own Google Collaboratory, in the same order shown in the code each question points you too. Then as the questions request you to do (for example, adjusting the number of epochs), you have to edit the code in the corresponding cells and re-run each cell. If you are still confused on how this works, re-watch the above videos with tutorials on the Google Collaboratory and also additional videos.

Question A

NOTE: Use instead of "from keras.layers.normalization import BatchNormalization" the "from keras.layers import BatchNormalization".

Run the following code on the Collaboratory. Tip: If you are logged in your Google account and click the "Copy to Drive" button on the top. This will make a full copy of this Google Collaboratory sheet under your own account, and save you a lot of typing and copy-pasting compared to starting a new sheet and transferring everything over manually.

I. How many different types of neural networks (and what kind of networks) are being used to classify the digits - show the corresponding part of the code where these networks are implemented.

II. Run the code with both types of neural networks that are in it, based on the metrics, which one does it classify the digits better? Please explain your answer by also defining the metrics (so you understand what each metric means).

III. Could you try a different activation function instead of softmax in the final layer and see what happens with the model predictions and its metrics?

Question B

Run the following code on the Collaboratory (you can skip the part showing the images if you wish). You will need to copy this code in a new, clean sheet of the Google Collaboratory.

I. Modify the number of Convolutional and Max Pooling layers, for example add a pair or two, and remove a layer or two:

model = tf.keras.Sequential([
tf.keras.layers.experimental.preprocessing.Rescaling(1./255),
tf.keras.layers.Conv2D(32, 3, activation='relu'),
tf.keras.layers.MaxPooling2D(),
tf.keras.layers.Conv2D(32, 3, activation='relu'),
....

Then rerun the training with the modifications

model.compile (
...
and also
model.fit (
train_ds,
..

What do you observe changing in the metrics? (Just run it for 3 epochs as it is)

II. Modify the number of epochs increasing them gradually (you might reach a point where it gets too slow in the Google Collaboratory). What do you observe in the metrics as you increase the epochs, is there a point where the metrics plateau?

III. In which part of the code we split the dataset in training / validations and what portions? What is the purpose of doing this?

IV. Look at the structure of the Convolutional Neural Network as specified in the code for this image classification example. What are the differences? Make those adjustments to modify the code you just made on a - c above, and re-run the model (use 5 epochs or so). What do you observe in the model metrics?

Question C

Run the following code on Deep Learning for genomics on the Google Collaboratory:

I. Describe in a couple of sentences the overall function of this neural network for bioinformatics predictions - what the predictions taking place, what are the data used, and what type of neural network we are using? From which parts of the code you can find the answers to each of these points ?

II. How many prediction classes this neural network has, and describe what are these classes. In addition to finding this from the text cells in the code, also point the parts of the actual code that would demonstrate the number of prediction classes (it should be one of the final layers in the network).

III. What portion of the data we use for training, validation and testing? Where do you see that in the code?

IV. Run the code in your Google Collaboratory up to the point where we have the model lost / accuracy graphs (including printing these graphs). What do you observe in these graphs if you modify the testing and validation portions of the datasets? You would need to re-run the cells from all the way up (where we define the training / validation portions) up and including the cells generating the graphs. Similarly if you reduce significantly the number of epochs, what do you observe in those graphs?

Reference no: EM133659291

Questions Cloud

Develop programs using java programming concepts : ICT502 Object-oriented programming, Victorian Institute of Technology - Develop programs using Java programming concepts and OO principles based on problem
Decisions after someone has been convicted with crime : You are a Detroit judge tasked with making sentencing decisions after someone has been convicted with a crime.
The Federal Trade Commission Act : The Federal Trade Commission Act...? Violating an anti-trust law could mean which of the following?The Civil Rights Act of 1968 is commonly referred to as?
Provide two specific questions andr topics for your evp : Provide two specific questions and/or topics for your EVP, including relevant details. Traditional text-based format is recommended for EVP consultations.
What portion of data use for training validation and testing : What the predictions taking place? What portion of the data we use for training, validation and testing? Where do you see that in the code?
Compare and contrast the recommended practices : GDECE 101 Foundations of early childhood education - Compare and contrast the recommended practices with some of the historical approaches examined in class
Partly responsible for the injuries he sustained : Sullivan alleges that the tavern owner is partly responsible for the injuries he sustained.
Discuss the purpose of the analysis of group differences : Discuss the purpose of the analysis of group differences and note what, if any differences were found. What is the purpose of the analysis of group differences?
Essential framework for resolving complex interactions : An essential framework for resolving the complex interactions between individual rights and more general society issues is provided by the Oakes Test

Reviews

Write a Review

Computer Engineering Questions & Answers

  Mathematics in computing

Binary search tree, and postorder and preorder traversal Determine the shortest path in Graph

  Ict governance

ICT is defined as the term of Information and communication technologies, it is diverse set of technical tools and resources used by the government agencies to communicate and produce, circulate, store, and manage all information.

  Implementation of memory management

Assignment covers the following eight topics and explore the implementation of memory management, processes and threads.

  Realize business and organizational data storage

Realize business and organizational data storage and fast access times are much more important than they have ever been. Compare and contrast magnetic tapes, magnetic disks, optical discs

  What is the protocol overhead

What are the advantages of using a compiled language over an interpreted one? Under what circumstances would you select to use an interpreted language?

  Implementation of memory management

Paper describes about memory management. How memory is used in executing programs and its critical support for applications.

  Define open and closed loop control systems

Define open and closed loop cotrol systems.Explain difference between time varying and time invariant control system wth suitable example.

  Prepare a proposal to deploy windows server

Prepare a proposal to deploy Windows Server onto an existing network based on the provided scenario.

  Security policy document project

Analyze security requirements and develop a security policy

  Write a procedure that produces independent stack objects

Write a procedure (make-stack) that produces independent stack objects, using a message-passing style, e.g.

  Define a suitable functional unit

Define a suitable functional unit for a comparative study between two different types of paint.

  Calculate yield to maturity and bond prices

Calculate yield to maturity (YTM) and bond prices

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd