resource: Evolutionary computing, A.E.Eiben


Outline

  1. What is Evolution Strategies
  2. Introductory Example
  3. Representation
  4. Mutation

1. What is Evolution Strategies (ES)

Evolution strategies(ES) is another member of the evolutionary algorithm family.

ES technical summary tableau

2. Introductory Example

2.1 Task

  minimimise an n-dimensional function: Rn -> R

2.2 Original algorithm

 “two-membered ES” using

  • Vectors from Rn directly as chromosomes
  • Population size 1
  • Only mutation creating one child
  • Greedy selection

2.3 pseudocde

outline of simple two-membered evolution strategy

------------------------------------------------------------

Set t = 0

Create initial point xt = 〈 x1t ,…,xnt

REPEAT UNTIL (TERMIN.COND satisfied) DO

  Draw zi from a normal distr. for all i = 1,…,n

  yit = xit + zi

  IF f(xt) < f(yt) THEN xt+1 = xt

  ELSE xt+1 = yt

  Set t = t+1

OD

------------------------------------------------------------

2.4 Explanation

As is shown on the pseudocode above, given a current solution xt in the form of a vector of length n, a new candidate xt+1 is created by adding a random number zfor i ∈ {1,...,n} to each of the n components.

The random number Zi:

A Gaussian, or normal, distribution is used with zero mean and standard deviation σ for drawing the random numbers --> Zi

The distribution:

  • This distribution is symmetric about zero
  • has the feature that the probability of drawing a random number with any given magnitude is a rapidly decreasing function of the standard deviation σ. (more information about Gaussian distribution)

The standard deviation σ:

Thus the σ value is a parameter of the algorithm that determines the extent to which given values xi are perturbed by the mutation operator.

For this reason σ is often called the mutation step size. Theoretical studies motivated an on-line adjustment of step sizes by the famous 1/5 success rule.

1/5 success rule:

This rule states that the ratio of successful mutaions (those in which the child is fitter than the parent) to all mutations should be 1/5.

  • If the ratio is greater than 1/5, the step size should be increased to make a wider search of the space.
  • If the ratio is less than 1/5 then it should be decreased to concentrate the search more around the current solution.

The rule is executed at periodic intervals.

For instance, after k(50 or 100) iterations each σ is reset by

  • σ = σ / c    if ps > 1/5
  • σ = σ • c    if ps < 1/5
  • σ = σ         if ps = 1/5

Where ps is the relative frequency of successful mutations measured over a number of trials, and the parameter c is in the range [0.817,1]

As is apparent, using this mechanism the step sizes change based on feedback from the search process.

ps = (successful mutations)/k

2.5 Conclusion

This example illuminiates some essential characteristics of evolution strategies:

  1. Evolution strategies are typically used for continuous parameter optimisation.
  2. There is a strong emphasis on mutation for creating offspring.
  3. Mutation is implemented by adding some random noise drawn from a Gaussian distribution.
  4. Mutation parameters are changed during a run of the algorithm

3. Representation

Chromosomes consist of three parts:

  • Object variables: x1,…,xn
  • Strategy parameters: 
    • Mutation step sizes: σ1,…,σnσ
    • Rotation angles: α1,…, αnα

Full size: 〈 x1,…,xn, σ1,…,σn1,…, αk 〉,where k = n(n-1)/2 (no. of i,j pairs) ---This is the general form of individuals in ES

Strategy parameters can be divided into two sets:

  • σ valuess
  • α values

The σ values represent the mutation step sizes, and their number nσ is usually either 1 or n. For any easonable self-adaptation mechanism at least one σ must be present.

The α values, which represent interactions between the step sizes used for different variables, are not always used. In the most general case their number nα = ( n - nα/2 )( nα - 1 ).

Putting this all together, we obtain:

〈 x1,…,xn, σ1,…,σnσ ,α1,…, αnα 〉

4. Mutation

4.1 Main mechanism

Changing value by adding random noise drawn from normal distribution

The mutation operator in ES is based on a normal (Gaussian) distribution requiring two parameters: the mean ξ and the standard deviation σ.

Mutations then are realised by adding some Δxi to each xi, where the Δxi values are randomly drawn using the given Gaussian N(ξ,σ), with the corresponding probability density function.

xi' = xi + N(0,σ)

xi' can be seen as a new xi.

N(0,σ) here denotes a random number drawn from a Gaussian distribution with zero mean and standard deviation σ.

4.2 Key ideas

  • σ is part of the chromosome 〈 x1,…,xn, σ 〉
  • σ is also mutated into σ ’ (see later how)
  • Self-adaption

4.3 A simplest case

In the simplest case we would have one step size that applied to all the components xi and candidate solutions of the form <x1, ..., xn, σ>.

Mutations are then realised by replacing <x1, ..., xn, σ> by <x1', ..., xn', σ'>,

where σ' is the mutated value of σ and xi' = xi + N(0,σ)

4.4 Mutate the value of σ

The mutation step sizes(σ) are not set by the user; rather the σ is coevolving with the solutions.

In order to achieve this behaviour:

  1. modify the value of σ first
  2. mutate the xi values with the new σ value.

The rationale behind this is that a new individual <x', σ'> is effectively evaluated twice:

  1. Primarily, it is evaluated directly  for its viability during survivor selection based on f(x').
  2. Second, it is evaluated for its ability to create good offspring.

This happens indirectly: a given step size (σ) evaluates favourably if the offspring generated by using it prove viable (in the first sense).

To sum up, an individual <x', σ'> represents both a good x' that survived selection and a good σ' that proved successful in generating this good x' from x.

4.5 Uncorrelated Mutation with One Step Size(σ)

In the case of uncorrelated mutation with one step size, the same distribution is used to mutate each xi, therefore we only have one strategy parameter σ in each individual.

This σ is mutated each time step by multiplying it by a term eΓ, with Γ a random variable drawn each time from a normal distribution with mean 0 and standard deviation τ.

Since N(0,τ) = τ•N(0,1), the mutation mechanism is thus specified by the following formulas:

  • σ' = σ•eτ•N(0,1)
  • xi' = xi + σ'•Ni(0,1)

Furthermore, since standard deviations very close to zero are unwanted(they will have on average a negligible effect), the following boundary rule is used to force step sizes to be no smaller than a threshold:

  • σ ’ < ε0 ⇒ σ ’ = ε0

Tips:

  • N(0,1) denotes a draw from the standard normal distribution
  • Ni(0,1) denotes a separate draw from the standard normal distribution for each variable i.

The proportionality constant τ is an external parameter to be set by the user.

It is usually inversely proportional to the square root of the problem size:

  • τ ∝ 1/ n½

The parameter τ can be interpreted as a kind of learning rate, as in neural networks.

In the Fig below, the effects of mutation are shown in two dimensions. That is, we have an objective function IR2 -> IR, and individuals are of the form <x,y,σ>. Since there is only one σ, the mutation step size is the same in each direction (x and y), and the points in the search space where the offspring can be placed with a given probability form a circle around the individual to be mutated.

Mutation with n=2, nσ = 1, nα = 0. Part of a fitness landscape with a conical shape is shown. The black dot indicates an individual. Points where the offspring can be placed with a given probability form a circle. The probability of moving along the y-axis(little effect on fitness) is the same as that of moving along the x-axis(large effect on fitness)

4.6 Uncorrelated Mutation with n Step Sizes

4.7 Correlated Mutations

Evolutionary Computing: 5. Evolutionary Strategies(1)的更多相关文章

  1. Evolutionary Computing: 5. Evolutionary Strategies(2)

    Resource: Introduction to Evolutionary Computing, A.E.Eliben Outline recombination parent selection ...

  2. Evolutionary Computing: 4. Review

    Resource:<Introduction to Evolutionary Computing> 1. What is an evolutionary algorithm? There ...

  3. Evolutionary Computing: 1. Introduction

    Outline 什么是进化算法 能够解决什么样的问题 进化算法的重要组成部分 八皇后问题(实例) 1. 什么是进化算法 遗传算法(GA)是模拟生物进化过程的计算模型,是自然遗传学与计算机科学相互结合的 ...

  4. Evolutionary Computing: [reading notes]On the Life-Long Learning Capabilities of a NELLI*: A Hyper-Heuristic Optimisation System

    resource: On the Life-Long Learning Capabilities of a NELLI*: A Hyper-Heuristic Optimisation System ...

  5. Evolutionary Computing: Assignments

    Assignment 1: TSP Travel Salesman Problem Assignment 2: TTP Travel Thief Problem The goal is to find ...

  6. Evolutionary Computing: multi-objective optimisation

    1. What is multi-objective optimisation [wikipedia]: Multi-objective optimization (also known as mul ...

  7. Evolutionary Computing: 3. Genetic Algorithm(2)

    承接上一章,接着写Genetic Algorithm. 本章主要写排列表达(permutation representations) 开始先引一个具体的例子来进行表述 Outline 问题描述 排列表 ...

  8. Evolutionary Computing: 2. Genetic Algorithm(1)

    本篇博文讲述基因算法(Genetic Algorithm),基因算法是最著名的进化算法. 内容依然来自博主的听课记录和教授的PPT. Outline 简单基因算法 个体表达 变异 重组 选择重组还是变 ...

  9. 不就ideas嘛,谁没有!

    20160214 survey of current RDF triple storage systems survey of semantic web stack inference mechani ...

随机推荐

  1. 论文阅读(Xiang Bai——【CVPR2015】Symmetry-Based Text Line Detection in Natural Scenes)

    Xiang Bai--[CVPR2015]Symmetry-Based Text Line Detection in Natural Scenes 目录 作者和相关链接 方法概括 创新点和贡献 方法细 ...

  2. [原创]Visual Studio 使用 Just My Code引起无法断点

    今天遇到的问题,同样的代码,在一台机器上用Release配置可以命中断点,在另一台上用Release断点就都失效了.后来发现是因为断点失效的机器上设置了Just My Code.在Debug-Opti ...

  3. linux 使用/proc文件系统 实现用户空间与内核模块之间通信

    项目中可能会用到用户态和内核模块之间进行通信的功能.想到linux系统本身很多通信都是通过/proc文件系统来的,比如修改网络中连接跟踪表连接数限制/proc/sys/net/netfilter/nf ...

  4. 相机拍的图,电脑上画的图,word里的文字,电脑屏幕,手机屏幕,相机屏幕显示大小一切的一切都搞明白了!

    相机拍的图,电脑上画的图,word里的文字,电脑屏幕,手机屏幕,相机屏幕显示大小一切的一切都搞明白了! 先说图片X×dpi=点数dotX是图片实际尺寸,简单点,我们只算图片的高吧,比如说拍了张图片14 ...

  5. JSON对象长度和遍历方法

    摘自博客 (http://caibaojian.com/json-length.html) 原文链接:http://caibaojian.com/json-length.html JSON数组有长度j ...

  6. Linq解析带命名空间、前缀、Soap格式的XML

    关于XML,经常会用到,XML有一般的,同样也有二般的,更不要觉得会操作基础的XML就觉得自己已经精通XML操作,文中是对解析XML的方法进行介绍 1. 一般XML <?xml version= ...

  7. Java数据结构之对称矩阵的压缩算法---

    特殊矩阵 特殊矩阵是指这样一类矩阵,其中有许多值相同的元素或有许多零元素,且值相同的元素或零元素的分布有一定规律.一般采用二维数组来存储矩阵元素.但是,对于特殊矩阵,可以通过找出矩阵中所有值相同元素的 ...

  8. Linux中profile、bashrc、bash_profile之间的区别和联系

    /etc/profile:此文件为系统的每个用户设置环境信息,当用户第一次登录时,该文件被执行.并从/etc/profile.d目录的配置文件中搜集shell的设置. 英文描述为: # /etc/pr ...

  9. [Linux] vimdiff 快速比较和合并少量文件

    纯文本文件比较和合并工具一直是软件开发过程中比较重要的组成部分,vimdiff 能够在比较出来的多处差异之间快速定位,很容易的进行文件合并操作.在需要快速比较和合并少量文件的时候,vimdiff是很好 ...

  10. github在liunx上的搭建

    清屏:ctrl+l 1 在linux下安装git yum -y install git 查看版本 git --version 4 设置git的用户名和邮箱地址 git config --global ...