Deep and Beautiful. The Reward Prediction Error Hypothesis of Dopamine

郑重声明：原文参见标题，如有侵权，请联系作者，将会撤销发布！

Contents:

Abstract

1. Introduction

2. Reward-Prediction Error Meets Dopamine

3. Reward-Prediction Error and Incentive Salience: What Do They Explain?

4. Explanatory Depth, Reward-Prediction Error and Incentive Salience

　　4.1. Depth as scope, reward-prediction error and incentive salience

　　4.2. Depth as invariance, reward-prediction error and incentive salience

5. Conclusion

Abstract

　　根据多巴胺的奖励预测误差假设（RPEH），中脑多巴胺能神经元的相位活动表示特定事件的预测奖励与当前经历的奖励之间存在差异。可以说这个假设是深刻，优雅和美丽的，代表了计算神经科学的最大成功之一。本文研究了这种说法，为现有文献做出了两点贡献。首先，它对公式化定义RPEH和随后获得成功的主要步骤进行了全面的历史描述。其次，根据这一历史记录，它解释了RPEH在哪种意义上具有解释性，在何种情况下可以合理地认为它比多巴胺的刺激显著性假设更深远，多巴胺可以说是目前RPEH最重要的替代方案。

Keywords: 多巴胺（Dopamine）；奖励预测误差（Reward-Prediction Error）；解释深度（Explanatory Depth）；刺激显著性（Incentive Salience）；强化学习（Reinforcement Learning）

1. Introduction

2. Reward-Prediction Error Meets Dopamine

3. Reward-Prediction Error and Incentive Salience: What Do They Explain?

4. Explanatory Depth, Reward-Prediction Error and Incentive Salience

4.1. Depth as scope, reward-prediction error and incentive salience

4.2. Depth as invariance, reward-prediction error and incentive salience

5. Conclusion

Deep and Beautiful. The Reward Prediction Error Hypothesis of Dopamine的更多相关文章

Understanding dopamine and reinforcement learning: The dopamine reward prediction error hypothesis
郑重声明:原文参见标题,如有侵权,请联系作者,将会撤销发布! Abstract 在中脑多巴胺能神经元的研究中取得了许多最新进展.要了解这些进步以及它们之间的相互关系,需要对作为解释框架并指导正在进行的 ...
【转载】准人工智能分享Deep Mind报告 ——AI“元强化学习”
原文地址: https://www.sohu.com/a/231895305_200424 ------------------------------------------------------ ...
Curiosity-Driven Learning through Next State Prediction
Curiosity-Driven Learning through Next State Prediction 2019-10-19 20:43:17 This paper is from: http ...
【深度学习Deep Learning】资料大全
最近在学深度学习相关的东西,在网上搜集到了一些不错的资料,现在汇总一下: Free Online Books by Yoshua Bengio, Ian Goodfellow and Aaron C ...
（转）The 9 Deep Learning Papers You Need To Know About (Understanding CNNs Part 3)
Adit Deshpande CS Undergrad at UCLA ('19) Blog About The 9 Deep Learning Papers You Need To Know Abo ...
Applied Deep Learning Resources
Applied Deep Learning Resources A collection of research articles, blog posts, slides and code snipp ...
On Explainability of Deep Neural Networks
On Explainability of Deep Neural Networks « Learning F# Functional Data Structures and Algorithms is ...
机器学习(Machine Learning)&深度学习(Deep Learning)资料(Chapter 2)
##机器学习(Machine Learning)&深度学习(Deep Learning)资料(Chapter 2)---#####注:机器学习资料[篇目一](https://github.co ...
Deep learning_CNN_Review：A Survey of the Recent Architectures of Deep Convolutional Neural Networks——2019
CNN综述文章的翻译 [2019 CVPR] A Survey of the Recent Architectures of Deep Convolutional Neural Networks 翻 ...

随机推荐

LQB2013A05前缀判断
上一道题,,,把if条件写错了,,,,找了半天的bug我都快哭了, 好了好了看见这种填空题,先理解题意然后把代码copy下来,把空格注释掉,然后运行到编译没有错．再理一下它的思路 // // C ...
leetcode 翻转字符串
https://leetcode-cn.com/problems/reverse-words-in-a-string/ TLE代码: class Solution { public: string r ...
史蒂夫-乔布斯(Steve Jobs)斯坦福大学演讲稿(中英对照)
这是苹果公司和Pixar动画工作室的CEO Steve Jobs于2005年6月12号在斯坦福大学的毕业典礼上面的演讲稿. Thank you. I'm honored to be with you ...
Debug HashMap
目录 1,HashMap面试必问 2,Debug源码的心得体会 3,JDK 1.7 3.1 用debug分析一个元素是如何加入到HashMap中的[jdk1.7] 3.2 用debug分析HashMa ...
PHP array_fill() 函数
------------恢复内容开始------------ 实例用给定的键值填充数组: <?php$a1=array_fill(3,4,"blue");print_r($ ...
PHP date_get_last_errors() 函数
------------恢复内容开始------------ 实例返回解析日期字符串时的警告和错误: <?phpdate_create("gyuiyiuyui%&&/ ...
PHP fileatime() 函数
定义和用法 fileatime() 函数返回指定文件的上次访问时间. 如果成功,该函数将以 Unix 时间戳形式返回文件的上次访问时间.如果失败,则返回 FALSE. 语法 fileatime(fil ...
PHP xml_set_notation_decl_handler() 函数
定义和用法 xml_set_notation_decl_handler() 函数规定当解析器在 XML 文档中找到符号声明时被调用的函数. 如果成功,该函数则返回 TRUE.如果失败,则返回 FALS ...
CF R631 div2 1330 E Drazil Likes Heap
LINK:Drazil Likes Heap 那天打CF的时候开场A读不懂题 B码了30min才过(当时我怀疑B我写的过于繁琐了. C比B简单多了随便yy了一个构造发现是对的.D也超级简单 dp了 ...
5073 [Lydsy1710月赛]小A的咒语
LINK:[Lydsy1710月赛]小A的咒语每次给定两个串要求从a串中选出x段拼成B串能否做到.T组数据. \(n\leq 100000,m\leq 100000,T\leq 10,x\leq ...

Deep and Beautiful. The Reward Prediction Error Hypothesis of Dopamine

Deep and Beautiful. The Reward Prediction Error Hypothesis of Dopamine的更多相关文章

随机推荐

热门专题