Designing and Building a Prediction Model

Assignment Help Database Management System
Reference no: EM133254129

Assignment

Designing and Building a Prediction Model for Bike Buyer Data with a Classifier Choose any two classifiers and apply to your Bike Buyer data Set

Language to be used: Python

Plan your experiment with:

1. Determine Data preprocessing methods required to apply for each of your classifiers
2. For each classifier,
2-1. Compare the accuracy of the classifier with two different sets of input parameters if applicable
2-2. Compare the accuracy of the classifier with two different data preprocessing methods.
2-3. Experiment for Feature Selection with PCA tools or Your Own Experiment (See Below for an example)

3. Compare the accuracy of each test of the classifiers

4. Discuss about your results:
- Why your inducted model is different for the same training data as you change the parameter values or the classifier.
- Why a certain parameter setting, or a classifier shows with better accuracy than the others that you tried.
- Anything you observed Dataset to be used:
- Use your data VTargetMail

Phases:
Phase 1. Determine Data preprocessing methods to apply for each of your classifiers. For example, Discretization for Decision Tree
- Vectorization of a record for SVM
- Normalization for Neural Network
Phase 2. Design your Data Analytic Experiment with Two different Classifiers of Your Choice. Choose any two different classifiers covered in class, for example, Decision Tree, Naïve Bayesian, SVM, Neural Network, K Nearest Neighbor, or any other classifier to compare the Accuracy of the results from your classifier.
Phase 2-1. Experiment to Find the Best Parameter Setting for your Classifier. For Example:

Example1:Decision Tree Classifier: C5 for GainRatioSplit, CART for GiniSplit on the same set of data with different parameter settings as follow:
- Measure: Entropy, GINI
- Different Minimum Support Thresholds
- Different Complex Penalty Degrees on the Number of Splits Example2: Neural Network:
Test with two Different Topologies: The number units of a hidden layer, The number of hidden layers SVM: Test with different Kernel functions
K Nearest Neighbor: Test with two different K values and distance metrics Or alternatively

Phase2-2. For Naïve Bayes, NN or SVM, Experiment with two different Data. Transformation Methods For Continuous and numeric Attributes,
1) Data set as floating point without Discretization and Binarization
2) Data set with Discretization and Binarization

Phase 2-3 Experiment for Feature Selection with either
1) Feature Significance Analysis with PCA tools
2) Your Own Experiment as follow:
Simple Experiment for Feature Selection Methodology
2-2-1. Pick the best parameter setting and data transformation from Phase 2.
2-2-2. Apply Your Classifier with the best parameters set to each different feature sets from your input file to see if there is any significant difference in the result for each iteration. (See Below for an example)

3. Validate your result with your Test Set to compare the Accuracy of your models for each classifier with different Parameter settings or different transformation method.

4. Discuss about your results:
- Why your inducted model is different for the same training data as you change the parameter values or the classifier.
- Why a classifier shows better accuracy than the others for a certain parameter setting or with a different transformation method.
- Any observations you made
Feature Significance Analysis with PCA tools

Simple Experiment for Feature Selection Methodology

1. Simple Experiment for Feature Selection Methodology to choose the best feature set: 1-1 Pick the best Model with the best parameter setting from Phase 1 and 2.

1-2 Apply your Model with the best parameters to different input sets (created with different combinations of feature sets from your VTargetMail input file to see if there are any significant differences in the result of each feature set in terms of Accuracy.

Attachment:- Building a Prediction Model.rar

Reference no: EM133254129

Questions Cloud

Discuss facebook acquisition of whatsapp : Discuss Facebook acquisition of Whatsapp, explicitly highlighting the Conclusion. How did the companies fare from the acquisition? Did it make sense?
How themes of culture and identity drive them : Discussion Select two of this week's readings and discuss how the themes of "culture" and "identity" drive them.
Outstanding accounts payable : ABC Office had the following selected account balances at the year-end:
Determine microsoft annual break-even point : CVP Analysis Using Published Financial Statements Condensed data in millions of dollars from Microsoft's 2019 and 2018 income statements follow:
Designing and Building a Prediction Model : Designing and Building a Prediction Model for Bike Buyer Data with a Classifier Choose any two classifiers and apply to your Bike Buyer data Set
Estimating machine repair costs : In an attempt to determine the best basis for predicting machine repair costs, the production su- pervisor accumulated daily information on these costs and prod
Option for organizations : Outsourcing is often an option for organizations in order to reduce costs. Choose whether you are for outsourcing or against outsourcing.
Calculate the apy : You and your housemate are offered credit cards by mail. She assumes that the 8% APR on the offer means 8% if monthly payments are made.
What is the total value of the bond : The CFO of your company has asked you to analyze a 30-year bond if the market interest rate is 4%. It is a semi-annual pay bond with a 5% coupon rate, 10 years

Reviews

Write a Review

Database Management System Questions & Answers

  Knowledge and data warehousing

Design a dimensional model for analysing Purchases for Adventure Works Cycles and implement it as cubes using SQL Server Analysis Services. The AdventureWorks OLTP sample database is the data source for you BI analysis.

  Design a database schema

Design a Database schema

  Entity-relationship diagram

Create an entity-relationship diagram and design accompanying table layout using sound relational modeling practices and concepts.

  Implement a database of courses and students for a school

Implement a database of courses and students for a school.

  Prepare the e-r diagram for the movie database

Energy in the home, personal energy use and home energy efficiency and Efficient use of ‘waste' heat and renewable heat sources

  Design relation schemas for the entire database

Design relation schemas for the entire database.

  Prepare the relational schema for database

Prepare the relational schema for database

  Data modeling and normalization

Data Modeling and Normalization

  Use cases perform a requirements analysis for the case study

Use Cases Perform a requirements analysis for the Case Study

  Knowledge and data warehousing

Knowledge and Data Warehousing

  Stack and queue data structure

Identify and explain the differences between a stack and a queue data structure

  Practice on topic of normalization

Practice on topic of Normalization

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd