Reference no: EM131114153
1. A dataset has 1000 records and 50 variables with 5% of the values missing, spread randomly throughout the records and variables. An analysis decides to remove records that have missing values. About how many records would you expect would be removed?
2. Given a database table containing weather data as follows:
Outlook
|
Temperature
|
Humidity
|
Windy
|
Class: Play
|
Sunny
|
Hot
|
High
|
False
|
No
|
Sunny
|
Hot
|
High
|
True
|
No
|
Overcast
|
Hot
|
High
|
False
|
Yes
|
Rainy
|
Mild
|
High
|
False
|
Yes
|
Rainy
|
Cool
|
Normal
|
False
|
Yes
|
Rainy
|
Cool
|
Normal
|
True
|
No
|
Overcast
|
Cool
|
Normal
|
True
|
Yes
|
Sunny
|
Mild
|
High
|
False
|
No
|
Sunny
|
Cool
|
Normal
|
False
|
Yes
|
Rainy
|
Mild
|
Normal
|
False
|
Yes
|
Sunny
|
Mild
|
Normal
|
True
|
Yes
|
Overcast
|
Mild
|
High
|
True
|
Yes
|
Overcast
|
Hot
|
Normal
|
False
|
Yes
|
Rainy
|
Mild
|
High
|
True
|
No
|
Where Outlook, Temperature, Humidity, and Windy are the input variables (predictors), and Play is the output variable (response).
a. Compute the prior probability
P(PLAY='Yes') =
P(PLAY='No') =
b. Compute the conditional probability
P(Outlook='Sunny'|PLAY='Yes') =
P(Outlook='Sunny'|PLAY='No') =
P(Temperature = ‘Mild'|PLAY='Yes') =
P(Temperature = ‘Mild'|PLAY='No') =
P(Humidity = ‘High'| PLAY='Yes') =
P(Humidity = ‘High'| PLAY='No') =
P(Windy = ‘False'| PLAY='Yes') =
P(Windy = ‘False'| PLAY='No')=
3. Using naïve Bayes classification method to classify the following unknown record and to indicate whether to play or not.
(Outlook = ‘Sunny', Temperature = ‘Mild' , Humidity = ‘High' , Windy = ‘False')
4. Association Rule Mining:
Given a transaction database for mining association rule as follows:
Database D
TID
|
Items
|
100
|
A C D
|
200
|
B C E
|
300
|
A B C E
|
400
|
B E
|
Please useApriorialgorithm to mine association rules with minimum support count = 2.
(Please show the derivation process step by step with candidate itemsets.)
How these elements contribute to the central ideas of play
: Review the stage directions and, in your discussion post, identify the most important aspects of the setting.Then, consider how these elements contribute to the central ideas of the play
|
How many gates would such a system require
: Develop a two-dimensional addressing system using a 6-to-64 decoder, a 64-word×128- bit matrix, and 16-input multiplexers. How many gates would such a system require?
|
How would the results be used to make a diagnosis
: Explain what physical exams and diagnostic tests would be appropriate and how the results would be used to make a diagnosis. List five different possible conditions for the patient's differential diagnosis, and justify why you selected each.
|
Determine the value of the company shares
: The average growth of dividends for the past five years is expected to persist in the foreseeable future. You are required to determine the value of the company's shares after payment of the dividend of 2004.
|
How many records would you expect would be removed
: A dataset has 1000 records and 50 variables with 5% of the values missing, spread randomly throughout the records and variables. About how many records would you expect would be removed?
|
Explain the implied volatility
: Find the price of a six month european call option on a non-dividend paying stock with a strike price of 20 when the current stock price is 18, the risk free rate is 6% per annum and the volatility is 30 per annum. Use the Black scholes merton mod..
|
Describe the two families in the film
: Describe the two families in the film (ie the names of the family, people in household, jobs held, current financial situation,etc) - Did race impact the families lives? Explain
|
Minimum average collection period
: The minimum average collection period required to approve the cash discount plan is _________days?
|
Show a block diagram of an srff connected to store 1 bit
: Using 4 SRFFs obtain the block diagram for an SISO shift register.
|