Explain why when a model is fit to training data

Assignment Help Database Management System
Reference no: EM131923592

Problem

1. Using the concept of overfitting, explain why when a model is fit to training data, zero error with those data is not necessarily good.

2. In fitting a model to classify prospects as purchasers or non-purchasers, a certain company drew the training data from internal data that include demographic and purchase information. Future data to be classified will be lists purchased from other sources, with demographic (but not purchase) data included. It was found that "refund issued" was a useful predictor in the training data. Why is this not an appropriate variable to include in the model?

Reference no: EM131923592

Questions Cloud

Compute the gross margin ratio : Compute the gross margin ratio (both with and without services revenue) and net profit margin ratio. Compute the current ratio and acid-test ratio
Explore the data using data visualization capabilities of r : Explore the data using the data visualization capabilities of R. Which of the pairs among the variables seem to be correlated?
Which model are you more likely to consider for deployment : Two models are applied to a dataset that has been partitioned. Which model are you more likely to consider for final deployment?
How many records would you expect to be removed : A dataset has 1000 records and 50 variables with 5% of the values missing, spread randomly throughout. About how many records would you expect to be removed?
Explain why when a model is fit to training data : Using the concept of overfitting, explain why when a model is fit to training data, zero error with those data is not necessarily good.
Examine the data carefully and indicate what your next step : Consider the sample from bank database shown in Table 2.16; it was selected randomly from. Examine the data carefully and indicate what your next step would be.
Describe the roles assumed by validation and test partition : Describe the difference in roles assumed by the validation partition and the test partition. Comment on the likelihood that it was sampled randomly.
Identify whether the task required is supervised learning : Assuming that data mining techniques are to be used in the following cases, identify whether the task required is supervised or unsupervised learning.
Which of the following statements are true : Consider a new scenario (independent of Part 3). The Global Packaging Co. offers now an incremental discount of 0.5% off for those items ordered

Reviews

Write a Review

Database Management System Questions & Answers

  What is the selectivity of an equality predicate on key

CSC 553 Advanced Database Topics Assignment. Consider the relation r (Key, Name, Address). The relation takes 200 blocks on disk and holds 10000 tuples. What is the selectivity of an equality predicate on Key

  Provide example that is relevant to a college environment

Provide example that is relevant to a college environment that illustrates reasons for converting database tables to the First, Second, and Third Normal Forms.

  Discuss degree to which you believe visio diagram reflects

Discuss the degree to which you believe the Visio diagram reflects the database design.  Create the appropriate relationships between each entity within the diagram.

  Create a database to keep track of all the courses

The chairperson in the ISOM department at GMU needs to create a database to keep track of all the courses offered by the department.

  Implement a good information systems plan

Implementation of business analytics, an organization will also need to implement a good information systems plan in order to collect, manage, and organize all of the data.

  Discuss the relationship between colossal and core patterns

Discuss the relationship between colossal and core patterns. What is boosting? State why it may improve the accuracy of decision tree induction? Ensemble methods improve classification accuracy. How?

  Apply and consolidate skills acquired in the requirement

Develop a domain model for the car park system. Express your model with a class diagram, showing any inheritance and compositional relationships.

  Design a suitable database system

Design a suitable database system with a suitable web based front end, which should include the following details :The web based interface should hab=ve necessary forms and fields to update student attendence,marks ,faculty profile,faculty workshops ..

  Provide an efficient application development structure

Using specialization hierarchies can provide an efficient application development structure. Justify the use of surrogate primary keys for a database design.

  Er diagram of cardinality and modality

ER Diagram of cardinality and modality bank management system and discription of bank management system

  Explain what is meant by wear-leveling in flash drives

What is the capacity of a hard drive (in GB) consisting of 120,000 tracks, 4,000 sectors, and 4 surfaces? Assume each block has 512 bytes.

  Write an essay describing the use of an olap data cube

Write a 2 to 3 page essay describing the use of an OLAP Data Cube. Your essay should also describe the operations of Drill Down, Roll Up, Slice, and Dice.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd