Reference no: EM13786135
Data Mining by Evolutionary Computation and Genetic Learning
Problem: Problem solving by evolutionary algorithm
Inductive learning is one of the most commonly used learning approaches that simulate human learning process, e.g., learning by examples or mistakes. The process of inductive learning in general requires two steps training and testing (or verification). During the training period, examples are provided to the learning system and let the system to build a model (or patterns) for the given examples. During the testing period, the model (or patterns) built from the training period are tested or verified for accuracy. This training and testing steps can be repeated until it reaches a satisfactory accuracy level before real application. During those steps, parameters and models can be adjusted or modified as necessary.
In order to build an accurate learning system with predictive power, researchers have proposed many different approaches in the past. In this assignment, we will use evolutionary approach as a learning system that can learn a valid mathematical expression for a given data set, which is also known as symbolic regression problem briefly discussed in class.
To successfully complete this assignment, perform the following activities:
(a) Research existing systems that use evolutionary computation approaches such as genetic programming, genetic algorithm, or others and has learning or symbolic regression capability, select one, and learn how to use the system or write your own system if you wish.
(b) For a given data set that consists of value-pairs, (xi, yi) in a text file called "train.txt", perform a symbolic regression utilizing the system's learning capability to produce or learn a model in the form of mathematical function, f(x) that represents the data set. The function set may consist of Fset = {+, -, ∗, /, sin(x)}. All constants are in the range of [0, 1] and the range of x is in [0, 100]. Use the following error function for evaluating a function f, with respect to a particular training case pi: Error(pi) = Σ|pi - oi|, where pi is the output from a learned program p on the ith case and oi is the output of the ith case in the test data set.
(c) Once you have your system running and get a result for a model for the training data set. Test the model with test data set, "test.txt" for accuracy using the error function specified in (b). Both data sets, "train.txt" and "test.txt" will be posted later when you are ready.
(d) If you didn't find a perfect model during the training process for the given test data set, "test.txt", improve its performance by modifying various system parameters such as cross over rates, mutation rates, population sizes, improving/modifying how new individuals are created in the initial population, or making other necessary modifications that you believe it to be useful.
(e) Write a brief report that summarizes your activities and results including at least (1) name and source of the evolutionary learning system used, and a brief description about the system, (2) parameter settings for the system, (3) your strategies to reduce errors, (4) the best function learned in standard form of math expression with error information, e.g., (+ x 1 (* y 3)) is NOT considered as a standard math expression, (5) a brief justification on why you think this is the best function, optionally (6) retrospective comments about the system used, evolutionary approaches in general, etc.
What is the firms labor hours productivity
: What is the firms labor hours productivity after the changes - what is the percent change in the multifactor productivity before and after the changes?
|
The diagram below shows three islands in flordia bay
: The diagram below shows three islands in Flordia Bay. YOu rent a boat and a plan to visit each of these remot islands. If you are on island B, on what bearing should you navigate to go to island C?
|
Somatosensory cortex in the perception of pleasure and pain
: A diagram and description of the cutaneous system. A diagram and description of the function of the somatosensory cortex
|
Research suggests
: Research suggests that (1) individuals differ in their level of values development (2) individuals hold different sets of instrumental values at different stages of development, and (3) peoples value priorities do not change once they become adults. ..
|
Data mining by evolutionary computation
: Data Mining by Evolutionary Computation and Genetic Learning, Write a brief report that summarizes your activities and results including at least (1) name and source of the evolutionary learning system used, and a brief description about the syste..
|
How are powers balanced in the us government
: What are the three branches of government and their functions? How are powers balanced in the U.S. government? How does each branch of government make laws? Provide examples
|
What is the total area of the sidewalk
: Suppose you have a square yard that is x feet on each side. Suppose you put a 2 1/2 foot sidewalk around the edge of this yard, reducing the area of the yard. What is the new area of the yard? What is the total area of the sidewalk?
|
Effective tearms-
: Effective tearms (1) function so well they create their own magnetism, (2) are interested in others success as well as their own, and (3) devalue members who don't work cohesively with the rest of the team. Which statements are correct?
|
The basis for the christian ritual of communion
: In the opening chapter of The Gospel of John, what term (or title) refers to Christ?
|