Revisions to Getting started with machine learning a zero sum game?

replaced http://stackoverflow.com/ with https://stackoverflow.com/

Source Link

edited May 23, 2017 at 12:40

Community Bot

1

DE on Dr.Dobb's. An introductory article by Kenneth Price and Rainer Storn. It's old (1997) but very clear
DE on Wikipedia
DE homepage. Many implementations and further references.
Explain the Differential Evolution method Explain the Differential Evolution method

added 1949 characters in body

Source Link

edited May 6, 2016 at 8:01

manlio

4.3k
3
28
38

Create a population of N agents with random weights

While (Brendan hasn't stopped the program and
       population diversity is greater than a given threshold)

   randomly select Agent_b

   randomly select Agent_i (i <> b)
   randomly select Agent_j (j <> i and j <> b)
   randomly select Agent_k (k <> j and k <> i and k <> b)

   Agent_m = Agent_i + F (Agent_j - Agent_k)

   Agent_child = crossover(Agent_b, Agent_m)

   if (fitness(Agent_child) > fitness(Agent_baseAgent_b)
      replace Agent_baseAgent_b with Agent_child

The official page of DE has implementations in many languages (also Java).

I would expect "crossover" to average out the parent function weightings, but how does the variation arise?

In DE there's a "blend" of mutation and crossover.

(in the picture Agent_j = Xa, Agent_k = Xb, Agent_i = Xc. DE's mutation scheme, crossover not shown)

Traditional Evolutionary Strategies mutated vectors by adding zero-mean Gaussian noise to them but, to remain efficient, the mutation scheme had to adjust the standard deviation of the Gaussian noise to suit each parameter (this is also a function of time).

DE use a vector differential Xa - Xb, in place of Gaussian noise, to "perturb" another vector Xc:

Xc_ = Xc + F * (Xa - Xb)

F (differential weight) is a user-supplied scaling factor in the [0.1, 1.2] range.

This form of mutation (with population derived-noise) ensures that the search space is efficiently searched in each dimension.

For example, if the population becomes compact in one dimension, but remains widely dispersed along another, the differentials sampled from it will be small in one dimension yet large in the other. Whatever the case, step sizes will be comparable to the breadth of the region they are being used to search, even as these regions expand and contract during the course of evolution.

(from Dr.Dobb's Journal # 264 April 1997 pg. 22, "Differential Evolution" by K. Price and R. Storn)

Xc_ (Agent_m in the pseudo-code) is used to produce a trial vector via crossover with a random individual (Xi below, Agent_b in the pseudo-code):

Xt = crossover(Xi, Xc_)

The crossover operator produces an offspring by a series of binomial experiments: in this way it's established which parent contributes which trial vector parameter.

The best between Xt and Xi (Agent_child and Agent_b) survives.

Create a population of N agents with random weights

While (Brendan hasn't stopped the program and
       population diversity is greater than a given threshold)

   randomly select Agent_b

   randomly select Agent_i (i <> b)
   randomly select Agent_j (j <> i and j <> b)
   randomly select Agent_k (k <> j and k <> i and k <> b)

   Agent_m = Agent_i + F (Agent_j - Agent_k)

   Agent_child = crossover(Agent_b, Agent_m)

   if (fitness(Agent_child) > fitness(Agent_base)
      replace Agent_base with Agent_child

The official page of DE has implementations in many languages (also Java).

Create a population of N agents with random weights

While (Brendan hasn't stopped the program and
       population diversity is greater than a given threshold)

   randomly select Agent_b

   randomly select Agent_i (i <> b)
   randomly select Agent_j (j <> i and j <> b)
   randomly select Agent_k (k <> j and k <> i and k <> b)

   Agent_m = Agent_i + F (Agent_j - Agent_k)

   Agent_child = crossover(Agent_b, Agent_m)

   if (fitness(Agent_child) > fitness(Agent_b)
      replace Agent_b with Agent_child

The official page of DE has implementations in many languages (also Java).

I would expect "crossover" to average out the parent function weightings, but how does the variation arise?

In DE there's a "blend" of mutation and crossover.

(in the picture Agent_j = Xa, Agent_k = Xb, Agent_i = Xc. DE's mutation scheme, crossover not shown)

Traditional Evolutionary Strategies mutated vectors by adding zero-mean Gaussian noise to them but, to remain efficient, the mutation scheme had to adjust the standard deviation of the Gaussian noise to suit each parameter (this is also a function of time).

DE use a vector differential Xa - Xb, in place of Gaussian noise, to "perturb" another vector Xc:

Xc_ = Xc + F * (Xa - Xb)

F (differential weight) is a user-supplied scaling factor in the [0.1, 1.2] range.

This form of mutation (with population derived-noise) ensures that the search space is efficiently searched in each dimension.

For example, if the population becomes compact in one dimension, but remains widely dispersed along another, the differentials sampled from it will be small in one dimension yet large in the other. Whatever the case, step sizes will be comparable to the breadth of the region they are being used to search, even as these regions expand and contract during the course of evolution.

(from Dr.Dobb's Journal # 264 April 1997 pg. 22, "Differential Evolution" by K. Price and R. Storn)

Xc_ (Agent_m in the pseudo-code) is used to produce a trial vector via crossover with a random individual (Xi below, Agent_b in the pseudo-code):

Xt = crossover(Xi, Xc_)

The crossover operator produces an offspring by a series of binomial experiments: in this way it's established which parent contributes which trial vector parameter.

The best between Xt and Xi (Agent_child and Agent_b) survives.

added 114 characters in body

Source Link

edited May 5, 2016 at 9:42

manlio

4.3k
3
28
38

This is a good candidate for Differential Evolution.

DE is a very simple (but powerful) population based, stochastic function minimizer/maximizer.

A key point for integrating DE in your scheme is the fitness function:

double fitness(Agent_k)
   fit = 0

   repeat M times
      randomly extract an individual Agent_i (i <> k)

      switch (result of Agent_k vs Agent_i)
         case Agent_k wins:   fit = fit + 1
         case Agent_i loseswins:   fit = fit - 2
         case draw:           fit doesn't change
 
   return fit

I.e. an agent plays against M randomly selected opponents from the population (with replacement but avoiding self match).

The parameter M is important: large values give a precise measurement, small values could prevent convergence. I would start with M=5 (a value used in some chess-related experiments).

A sketch of the search is:

Create a population of N agents with random weights

While (Brendan hasn't stopped the program and
       population diversity is greater than a given threshold)

   randomly select Agent_b

   randomly select Agent_i (i <> b)
   randomly select Agent_j (j <> i and j <> b)
   randomly select Agent_k (k <> j and k <> i and k <> b)

   Agent_m = Agent_i + F (Agent_j - Agent_k)

   Agent_child = crossover(Agent_b, Agent_m)

   if (fitness(Agent_child) > fitness(Agent_base)
      replace Agent_base with Agent_child

This is a steady state population approach (the alternative is a generational approach. Take a look atapproach; further information in Essentials of Metaheuristics by Sean Luke for details).

For more details about how DE works take a look at:

DE on Dr.Dobb's. An introductory article by Kenneth Price and Rainer Storn. It's old (1997) but very clear
DE on Wikipedia
DE homepage. Many implementations and further references.
Explain the Differential Evolution method

DE will drive the search towards promising areas of the feasible space (without performing a systematic search as in your code).

Anyway this kind of optimization may easily require several days.

The official page of DE has implementations in many languages (also Java).

This is a good candidate for Differential Evolution.

DE is a very simple (but powerful) population based, stochastic function minimizer/maximizer.

A key point for integrating DE in your scheme is the fitness function:

double fitness(Agent_k)
   fit = 0

   repeat M times
      randomly extract an individual Agent_i (i <> k)

      switch (result of Agent_k vs Agent_i)
         case Agent_k wins:   fit = fit + 1
         case Agent_i loses:  fit = fit - 2
         case draw:           fit doesn't change
 
   return fit

I.e. an agent plays against M randomly selected opponents from the population (with replacement but avoiding self match).

The parameter M is important: large values give a precise measurement, small values could prevent convergence. I would start with M=5 (a value used in some chess-related experiments).

Create a population of N agents with random weights

While (Brendan hasn't stopped the program and
       population diversity is greater than a given threshold)

   randomly select Agent_b

   randomly select Agent_i (i <> b)
   randomly select Agent_j (j <> i and j <> b)
   randomly select Agent_k (k <> j and k <> i and k <> b)

   Agent_m = Agent_i + F (Agent_j - Agent_k)

   Agent_child = crossover(Agent_b, Agent_m)

   if (fitness(Agent_child) > fitness(Agent_base)
      replace Agent_base with Agent_child

This is a steady state population approach (the alternative is a generational approach. Take a look at Essentials of Metaheuristics by Sean Luke for details).

For more details about how DE works take a look at:

DE on Dr.Dobb's. An introductory article by Kenneth Price and Rainer Storn. It's old (1997) but very clear
DE on Wikipedia
DE homepage. Many implementations and further references.
Explain the Differential Evolution method

DE will drive the search towards promising areas of the feasible space (without performing a systematic search as in your code).

The official page of DE has implementations in many languages (also Java).

This is a good candidate for Differential Evolution.

DE is a very simple (but powerful) population based, stochastic function minimizer/maximizer.

A key point for integrating DE in your scheme is the fitness function:

double fitness(Agent_k)
   fit = 0

   repeat M times
      randomly extract an individual Agent_i (i <> k)

      switch (result of Agent_k vs Agent_i)
         case Agent_k wins:   fit = fit + 1
         case Agent_i wins:   fit = fit - 2
         case draw:           fit doesn't change
 
   return fit

I.e. an agent plays against M randomly selected opponents from the population (with replacement but avoiding self match).

The parameter M is important: large values give a precise measurement, small values could prevent convergence. I would start with M=5 (a value used in some chess-related experiments).

A sketch of the search is:

Create a population of N agents with random weights

While (Brendan hasn't stopped the program and
       population diversity is greater than a given threshold)

   randomly select Agent_b

   randomly select Agent_i (i <> b)
   randomly select Agent_j (j <> i and j <> b)
   randomly select Agent_k (k <> j and k <> i and k <> b)

   Agent_m = Agent_i + F (Agent_j - Agent_k)

   Agent_child = crossover(Agent_b, Agent_m)

   if (fitness(Agent_child) > fitness(Agent_base)
      replace Agent_base with Agent_child

This is a steady state population approach (the alternative is a generational approach; further information in Essentials of Metaheuristics by Sean Luke).

For more details about how DE works take a look at:

DE on Dr.Dobb's. An introductory article by Kenneth Price and Rainer Storn. It's old (1997) but very clear
DE on Wikipedia
DE homepage. Many implementations and further references.
Explain the Differential Evolution method

DE will drive the search towards promising areas of the feasible space (without performing a systematic search as in your code).

Anyway this kind of optimization may easily require several days.

The official page of DE has implementations in many languages (also Java).

Source Link

answered May 5, 2016 at 9:35

manlio

4.3k
3
28
38

Loading

Stack Exchange Network

Return to Answer