Reinforcement learning in populations of spiking neurons

【Reinforcement learning in populations of spiking neurons】的更多相关文章

Reinforcement learning in populations of spiking neurons

郑重声明:原文参见标题,如有侵权,请联系作者,将会撤销发布! Abstract 尽管存在神经元变异性,但是群体编码被广泛认为是实现可靠行为响应的重要机制.但是,随着全局奖励信号与任何单独神经元的性能越来越不相关,标准的强化学习随着群体规模的增长而减慢.我们发现,如果除了全局奖励之外,有关群体响应的反馈能够调节突触可塑性,则学习会随着群体规模的增加而加快. 已经深入研究了神经元群体在编码感觉刺激中的作用1,2.但是,大多数带有脉冲神经元的强化学习模型都只关注单独神经元或小的神经元组件3-6.此外,…

Improved robustness of reinforcement learning policies upon conversion to spiking neuronal network platforms applied to Atari Breakout game

郑重声明:原文参见标题,如有侵权,请联系作者,将会撤销发布! arXiv:1903.11012v3 [cs.LG] 19 Aug 2019 Neural Networks, 25 November 2019 Abstract 深度强化学习(RL)在可以通过训练过的策略解决的任务上表现了出色的性能.在使用多层神经网络(NN)的前沿机器学习方法中,它起着主导作用.同时,深度RL要求对噪声的高灵敏度,不完整和误导输入数据.遵循生物学直觉,我们将使用脉冲神经网络(SNN)来解决深度RL解决方案的一些不足…

Training spiking neural networks for reinforcement learning

郑重声明:原文参见标题,如有侵权,请联系作者,将会撤销发布! 原文链接:https://arxiv.org/pdf/2005.05941.pdf Contents: Abstract Introduction 1 Reinforcement learning with a network of spiking agents 2 Related Work 2.0.1 Hedonism 2.0.2 Learning by reinforcement in spiking neural network…

A review of learning in biologically plausible spiking neural networks

郑重声明:原文参见标题,如有侵权,请联系作者,将会撤销发布! Contents: ABSTRACT 1. Introduction 2. Biological background 2.1. Spiking neuron models 2.2. Synaptic plasticity 2.2.1. Unsupervised learning 2.2.2. Supervised learning 2.2.3. Reinforcement learning 2.2.4. Delay learning…

(转) Deep Reinforcement Learning: Pong from Pixels

Andrej Karpathy blog About Hacker's guide to Neural Networks Deep Reinforcement Learning: Pong from Pixels May 31, 2016 This is a long overdue blog post on Reinforcement Learning (RL). RL is hot! You may have noticed that computers can now automatica…

(zhuan) Paper Collection of Multi-Agent Reinforcement Learning (MARL)

this blog from: https://github.com/LantaoYu/MARL-Papers Paper Collection of Multi-Agent Reinforcement Learning (MARL) This is a collection of research and review papers of multi-agent reinforcement learning (MARL). The sharing principle of these refe…

Understanding dopamine and reinforcement learning: The dopamine reward prediction error hypothesis

郑重声明:原文参见标题,如有侵权,请联系作者,将会撤销发布! Abstract 在中脑多巴胺能神经元的研究中取得了许多最新进展.要了解这些进步以及它们之间的相互关系,需要对作为解释框架并指导正在进行的实验探究的计算模型有深刻的理解.现在,理论和实验的这种相互交织非常清楚地表明,中脑多巴胺神经元的阶段性活动为突触改变提供了一个整体机制.这些突触改变反过来又为特定类别的强化学习机制提供了机械基础,而强化学习机制现在似乎已成为人类和动物行为的基础.这篇综述既描述了该结论的关键经验性发现,也描述了得出此…

Machine Learning Algorithms Study Notes(5)—Reinforcement Learning

Reinforcement Learning 对于控制决策问题的解决思路:设计一个回报函数(reward function),如果learning agent(如上面的四足机器人.象棋AI程序)在决定一步后,获得了较好的结果,那么我们给agent一些回报(比如回报函数结果为正),得到较差的结果,那么回报函数为负.比如,四足机器人,如果他向前走了一步(接近目标),那么回报函数为正,后退为负.如果我们能够对每一步进行评价,得到相应的回报函数,那么就好办了,我们只需要找到一条回报值最大的路径(每步的回…

(转) Playing FPS games with deep reinforcement learning

Playing FPS games with deep reinforcement learning 博文转自:https://blog.acolyer.org/2016/11/23/playing-fps-games-with-deep-reinforcement-learning/ When I wrote up 'Asynchronous methods for deep learning' last month, I made a throwaway remark that after…

(zhuan) Deep Reinforcement Learning Papers

Deep Reinforcement Learning Papers A list of recent papers regarding deep reinforcement learning. The papers are organized based on manually-defined bookmarks. They are sorted by time to see the recent papers first. Any suggestions and pull requests…