A  key part of getting transitions to happen when we want  them to is the design of reasonable cost functions.

we want to penalize and reward the right things. I am going to work through an example of  one way you  might think about

designing  a cost function. let is consider how we  would design a cost function for  vehicle speed.  on one hand ,we want to

get to our   destination quickly,  but on the other hand,  we do not want to break the law.   an essential quantity we have to control is the desired velocity of the car.

some velocities are more beneficial,some are even illegal.let is fill in this graph and try to assign some costs to every velocity.

for the  sake of simplicity,  let is assume that all of the cost  functions will have an output between zero and one.

We will adjust the importance of each cost function later by adjusting the weithts.

let is say the speed limit for the road we are on is here,  well ,we know that if we ate  going well above the speed limit, that should be maximum cost.

and maybe we want to set an ideal zero cost speed thst is slightly below the speed limit so that we have some buffer.

and then we can think about how much we want to penalize not moving at all.obviously ,not moving is bad.  but maybe not as bad  as breaking the speed limit, so we would put it here.

to keep it simple,   we could just say there is a linear cost between zero and the target speed.

and since breaking the law is a binary thing,  let is just say any speed greater than or equal the speed limit has maximal cost.   And again, we can arbitrarily connect these points with  a linear funcion and the flat maximum cost for anything above

the speed limit.  now ,in practice ,we might actually want to parametrize some of  these quantities so that we could later adjust them until we got the right behavior.

so first, we might define a parameter called stop cost for  the zero-velocity case and a parameter called buffer velocity which would probably be a few miles per hour.

then, out overall cost function has three domains.  If we are going less than the target speed, the cost function would look like this.

if   we are above the speed limit, the cost is just one.And if we are between the cost would look like this .

behavior planning——11 create a cost function speed penalty的更多相关文章

  1. behavior planning——14.implement a cost function in C++

    n most situations, a single cost function will not be sufficient to produce complex vehicle behavior ...

  2. behavior planning——13. implement a cost function in C++

    In the previous quizzes, you designed a cost function to choose a lane when trying to reach a goal i ...

  3. behavior planning——15.cost function design weightTweaking

    Designing cost functions is difficult and getting them all to cooperate to produce reasionable vehic ...

  4. machine learning(11) -- classification: advanced optimization 去求cost function最小值的方法

    其它的比gradient descent快, 在某些场合得到广泛应用的求cost function的最小值的方法 when have a large machine learning problem, ...

  5. behavior planning——12.example cost funtion -lane change penalty

      In the image above, the blue self driving car (bottom left) is trying to get to the goal (gold sta ...

  6. logistic回归具体解释(二):损失函数(cost function)具体解释

    有监督学习 机器学习分为有监督学习,无监督学习,半监督学习.强化学习.对于逻辑回归来说,就是一种典型的有监督学习. 既然是有监督学习,训练集自然能够用例如以下方式表述: {(x1,y1),(x2,y2 ...

  7. Linear regression with one variable - Cost function intuition I

    摘要: 本文是吴恩达 (Andrew Ng)老师<机器学习>课程,第二章<单变量线性回归>中第8课时<代价函数的直观认识 - 1>的视频原文字幕.为本人在视频学习过 ...

  8. behavior planning——10 behaior planning pseudocode

    One way to implement a transition function is by generating rough trajectories for each accessible & ...

  9. loss function与cost function

    实际上,代价函数(cost function)和损失函数(loss function 亦称为 error function)是同义的.它们都是事先定义一个假设函数(hypothesis),通过训练集由 ...

随机推荐

  1. golang之Sprintf函数

  2. Statement对象

    Statement 对象 创建 Statement 对象 在你准备使用 Statement 对象执行 SQL 语句之前,你需要使用 Connection 对象的 createStatement() 方 ...

  3. hack 记录

    0.寻找信号强的wifi,对于隐藏ssid 的可通过嗅探 1.wifi密码:wifi万能钥匙.minidwep-gtk.aircrack-ng 2.对于绑定mac地址的安全设置,可通过对活动的客户端网 ...

  4. DirectX11笔记(二)--Direct3D初始化1之基本概念

    原文:DirectX11笔记(二)--Direct3D初始化1之基本概念 版权声明:本文为博主原创文章,未经博主允许不得转载. https://blog.csdn.net/u010333737/art ...

  5. SQL优化系列(二)- 优化Top SQL

    优化最耗资源的N条SQL语句 如何从SGA或者AWR中找出最消耗资源的SQL, 例如最慢的20条SQL, 然后逐条优化? SQL自动优化工具SQL Tuning Expert Pro for Orac ...

  6. PHP获取用户客户端真实IP的解决方案是怎样呢?

    function getIp(){if (getenv("HTTP_CLIENT_IP") && strcasecmp(getenv("HTTP_CLIE ...

  7. ThinkPHP5.0中的build.php自动生成所需的目录结构的详细方法

    一.来到根目录下,找到bulid.php文件进行改写. 改写方法:保留常用的目录结构,其余按照需求改吧! 二.复制一份build.php文件到application目录下 此时根目录下的bulid.p ...

  8. GNU的__builtin_popcount函数

    用来计算32位的unsigned int中的1的个数, 其内部实现是根据查表法来计算的.

  9. op应用:官方,wifidog,portal,uci,luci,脚本,框架,usb

    http://wiki.openwrt.org/doc/starthttp://downloads.openwrt.org/docs/buildroot-documentation.htmlhttp: ...

  10. laravel 图片

    /** * 缩略图上传 */ public static function addPic() { $inputData = request()->all(); $rules = [ 'main_ ...