Dimensionality reduction and feature selection

Assignment Help Basic Computer Science
Reference no: EM133052109

1. Provide a brief description of what Principal Components Analysis (PCA) does. [Hint: See Appendix A and your lecture notes.] State what's the input and what the output of PCA is.

2. What's the difference between dimensionality reduction and feature selection?

3. Describe in detail 2 different techniques for feature selection.

4. Given a sample dataset (represented by a set of attributes, a correlation matrix, a co-variance matrix, ...), apply feature selection techniques to select the best attributes to keep (or equivalently, the best attributes to remove).

5. What's the difference between feature selection and feature extraction?

6. Give two examples of data in which feature extraction would be useful.

7. Given a sample dataset, apply feature extraction.

8. What's data discretization and when is it needed?

9. What's the difference between supervised and unsupervised discretization?

10. Given a sample dataset, apply unsupervised (e.g., equal width, equal frequency) discretization, or supervised discretization (e.g., using entropy).

11. Describe 2 approaches to handle nominal attributes with too many values.

12. Given a dataset, apply variable transformation: Either a simple given function, normalization, or standardization.

13. Definition of Correlation and Covariance, and how to use them in data pre-processing.

Reference no: EM133052109

Questions Cloud

Business process redesign : If you have you been involved with a company doing a redesign of business processes, discuss what went right during the redesign
Create database model : Create a simple class diagram containing three classes: Vehicle, Car, and Truck. Provide two attributes for each of these three classes.
Principal ingredients of public-key cryptosystem : List three approaches to message authentication. What are the principal ingredients of a public-key cryptosystem? What is a digital signature?
Industry experts believe blockchain is technology : Industry experts believe blockchain is a technology that has the potential to affect the business of most IT professionals in the next five years.
Dimensionality reduction and feature selection : What's the difference between dimensionality reduction and feature selection? Describe in detail 2 different techniques for feature selection.
Scalability and efficacy of existing analytics techniques : The scalability and efficacy of existing analytics techniques being applied to big data must be empirically examined.
Intensity can produce a wide range of other colors : Mixing two or three of the three primary colors of light with varying degrees of intensity can produce a wide range of other colors.
Amazon web services : Amazon Web Services (AWS) is making huge steps with data and analytics, just to name a few areas.
Data mining-text mining and sentiment analysis : Explain the relationship among data mining, text mining, and sentiment analysis. Define text mining, and discuss its most popular applications.

Reviews

Write a Review

Basic Computer Science Questions & Answers

  Sector blockchain is playing prominent role

In the present competitive business of the IT, sector blockchain is playing a prominent role.

  Design a knowledge portal for u.s. pharma’s researchers

Include in your design specifications relevant internal systems and databases, external sources of information, and internal and external communication and collaboration tools. Design a home page for your portal.

  Global trade and logistics firm

Global Trade and Logistics Firm-You are the CIO for a Global Trade and Logistics Company. This company transports goods globally via boat, train, plane and rail

  Database administrator for csu-global campus

You were recently hired as a database administrator for CSU-Global Campus. During grade reporting, CSG-Global cannot afford to have any downtime. You have been asked to develop a plan for monitoring SQL Server databases.

  Represent important understandings of identity and worldview

Flags have significant symbolic meaning in a culture and represent important understandings of identity and worldview. Explain how flags represent cultural patterns and characteristics. Explain their influence on your personal connection to a cult..

  Create a class called invoice that a hardware store

(Invoice Class) Create a class called Invoice that a hardware store might use to represent an invoice for an item sold at the store.

  New phone to the public called acmephone

Acme Corporation is a new startup that wishes to sale their new phone to the public called Acmephone, more secure version of the phone to business organizations

  Significant impact on digital forensics

Describe the plain view doctrine, and why it has such a significant impact on digital forensics?

  Eexploring and getting familiar with the website

1. (4pts) Go to the website www.w3schools.com and spend about 20 to 30 minutes looking around, exploring and getting familiar with the website.  From this exploration answer the following questions, from your perspective:

  Application design for a class involving systems analysis

What does a resource chart look like if I am trying to create one for a mobile application design for a class involving systems analysis and design?

  Compare and contrast distance-vector and link state routing

Compare and contrast distance-vector and link state routing

  Define two derived classes of the abstract class shapebase

Define two derived classes of the abstract class ShapeBase. Your two classes will be called Rig htArrow and LeftArrow.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd