【python实现卷积神经网络】卷积层Conv2D反向传播过程

代码来源：https://github.com/eriklindernoren/ML-From-Scratch

卷积神经网络中卷积层Conv2D（带stride、padding）的具体实现：https://www.cnblogs.com/xiximayou/p/12706576.html

激活函数的实现（sigmoid、softmax、tanh、relu、leakyrelu、elu、selu、softplus）：https://www.cnblogs.com/xiximayou/p/12713081.html

损失函数定义（均方误差、交叉熵损失）：https://www.cnblogs.com/xiximayou/p/12713198.html

优化器的实现（SGD、Nesterov、Adagrad、Adadelta、RMSprop、Adam）：https://www.cnblogs.com/xiximayou/p/12713594.html

本节将根据代码继续学习卷积层的反向传播过程。

这里就只贴出Conv2D前向传播和反向传播的代码了：

def forward_pass(self, X, training=True):

        batch_size, channels, height, width = X.shape

        self.layer_input = X

        # Turn image shape into column shape

        # (enables dot product between input and weights)

        self.X_col = image_to_column(X, self.filter_shape, stride=self.stride, output_shape=self.padding)

        # Turn weights into column shape

        self.W_col = self.W.reshape((self.n_filters, -1))

        # Calculate output

        output = self.W_col.dot(self.X_col) + self.w0

        # Reshape into (n_filters, out_height, out_width, batch_size)

        output = output.reshape(self.output_shape() + (batch_size, ))

        # Redistribute axises so that batch size comes first

        return output.transpose(3,0,1,2)

    def backward_pass(self, accum_grad):

        # Reshape accumulated gradient into column shape

        accum_grad = accum_grad.transpose(1, 2, 3, 0).reshape(self.n_filters, -1)

        if self.trainable:

            # Take dot product between column shaped accum. gradient and column shape

            # layer input to determine the gradient at the layer with respect to layer weights

            grad_w = accum_grad.dot(self.X_col.T).reshape(self.W.shape)

            # The gradient with respect to bias terms is the sum similarly to in Dense layer

            grad_w0 = np.sum(accum_grad, axis=1, keepdims=True)

            # Update the layers weights

            self.W = self.W_opt.update(self.W, grad_w)

            self.w0 = self.w0_opt.update(self.w0, grad_w0)

        # Recalculate the gradient which will be propogated back to prev. layer

        accum_grad = self.W_col.T.dot(accum_grad)

        # Reshape from column shape to image shape

        accum_grad = column_to_image(accum_grad,

                                self.layer_input.shape,

                                self.filter_shape,

                                stride=self.stride,

                                output_shape=self.padding)

        return accum_grad

而在定义卷积神经网络中是在neural_network.py中　　

   def train_on_batch(self, X, y):

        """ Single gradient update over one batch of samples """

        y_pred = self._forward_pass(X)

        loss = np.mean(self.loss_function.loss(y, y_pred))

        acc = self.loss_function.acc(y, y_pred)

        # Calculate the gradient of the loss function wrt y_pred

        loss_grad = self.loss_function.gradient(y, y_pred)

        # Backpropagate. Update weights

        self._backward_pass(loss_grad=loss_grad)

        return loss, acc

还需要看一下self._forward_pas和self._backward_pass：

    def _forward_pass(self, X, training=True):

        """ Calculate the output of the NN """

        layer_output = X

        for layer in self.layers:

            layer_output = layer.forward_pass(layer_output, training)

        return layer_output

    def _backward_pass(self, loss_grad):

        """ Propagate the gradient 'backwards' and update the weights in each layer """

        for layer in reversed(self.layers):

            loss_grad = layer.backward_pass(loss_grad)

我们可以看到，在前向传播中会计算出self.layers中每一层的输出，把包括卷积、池化、激活和归一化等。然后在反向传播中从后往前更新每一层的梯度。这里我们以一个卷积层+全连接层+损失函数为例。网络前向传播完之后，最先获得的梯度是损失函数的梯度。然后将损失函数的梯度传入到全连接层，然后获得全连接层计算的梯度，传入到卷积层中，此时调用卷积层的backward_pass()方法。在卷积层中的backward_pass()方法中，如果设置了self.trainable，那么会计算出对权重W以及偏置项w0的梯度，然后使用优化器optmizer，也就是W_opt和w0_opt进行参数的更新，然后再计算对前一层的梯度。最后有一个colun_to_image()方法。

def column_to_image(cols, images_shape, filter_shape, stride, output_shape='same'):

    batch_size, channels, height, width = images_shape

    pad_h, pad_w = determine_padding(filter_shape, output_shape)

    height_padded = height + np.sum(pad_h)

    width_padded = width + np.sum(pad_w)

    images_padded = np.empty((batch_size, channels, height_padded, width_padded))

    # Calculate the indices where the dot products are applied between weights

    # and the image

    k, i, j = get_im2col_indices(images_shape, filter_shape, (pad_h, pad_w), stride)

    cols = cols.reshape(channels * np.prod(filter_shape), -1, batch_size)

    cols = cols.transpose(2, 0, 1)

    # Add column content to the images at the indices

    np.add.at(images_padded, (slice(None), k, i, j), cols)

    # Return image without padding

    return images_padded[:, :, pad_h[0]:height+pad_h[0], pad_w[0]:width+pad_w[0]]

该方法是将之间为了方便计算卷积进行的形状改变image_to_column()重新恢复成images_padded的格式。

像这种计算期间的各种的形状的变换就挺让人头疼的，还会碰到numpy中各式各样的函数，需要去查阅相关的资料。只要弄懂其中大致过程就可以了，加深相关知识的理解。

【python实现卷积神经网络】卷积层Conv2D反向传播过程的更多相关文章

关于LeNet-5卷积神经网络 S2层与C3层连接的参数计算的思考？？？
https://blog.csdn.net/saw009/article/details/80590245 关于LeNet-5卷积神经网络 S2层与C3层连接的参数计算的思考??? 首先图1是LeNe ...
卷积神经网络（CNN）的训练过程
卷积神经网络的训练过程卷积神经网络的训练过程分为两个阶段.第一个阶段是数据由低层次向高层次传播的阶段,即前向传播阶段.另外一个阶段是,当前向传播得出的结果与预期不相符时,将误差从高层次向底层次进行传 ...
《神经网络的梯度推导与代码验证》之CNN前向和反向传播过程的代码验证
在<神经网络的梯度推导与代码验证>之CNN的前向传播和反向梯度推导中,我们学习了CNN的前向传播和反向梯度求导,但知识仍停留在纸面.本篇章将基于深度学习框架tensorflow验证我们所 ...
Batch训练的反向传播过程
Batch训练的反向传播过程本文试图通过Softmax理解Batch训练的反向传播过程采用的网络包含一层全连接和一层softmax,具体网络如下图所示: 交叉熵成本函数: \[L = - \fra ...
深度学习原理与框架-卷积神经网络基本原理 1.卷积层的前向传播 2.卷积参数共享 3. 卷积后的维度计算 4. max池化操作 5.卷积流程图 6.卷积层的反向传播 7.池化层的反向传播
卷积神经网络的应用:卷积神经网络使用卷积提取图像的特征来进行图像的分类和识别分类相似图像搜索 ...
深度学习原理与框架-Tensorflow卷积神经网络-卷积神经网络mnist分类 1.tf.nn.conv2d(卷积操作) 2.tf.nn.max_pool(最大池化操作) 3.tf.nn.dropout(执行dropout操作) 4.tf.nn.softmax_cross_entropy_with_logits(交叉熵损失) 5.tf.truncated_normal(两个标准差内的正态分布)
1. tf.nn.conv2d(x, w, strides=[1, 1, 1, 1], padding='SAME') # 对数据进行卷积操作参数说明:x表示输入数据,w表示卷积核, stride ...
Python3 卷积神经网络卷积层，池化层，全连接层前馈实现
# -*- coding: utf-8 -*- """ Created on Sun Mar 4 09:21:41 2018 @author: markli " ...
深度神经网络（DNN）反向传播算法(BP)
在深度神经网络(DNN)模型与前向传播算法中,我们对DNN的模型和前向传播算法做了总结,这里我们更进一步,对DNN的反向传播算法(Back Propagation,BP)做一个总结. 1. DNN反向 ...
深度学习——深度神经网络（DNN）反向传播算法
深度神经网络(Deep Neural Networks,简称DNN)是深度学习的基础. 回顾监督学习的一般性问题.假设我们有$m$个训练样本$\{(x_1, y_1), (x_2, y_2), …, ...

随机推荐

props watch 接口抖动
readType (val) { this.innerReadType = '-' this.$nextTick(() => { this.innerReadType = val }) },
在5G+AI+Cl 拉动互联网走向物联网
大家好我是浅笑若风,今天在这里和大家聊聊的是:5G+AI+CL拉动互联网走向物联网在虫洞时空里我们早已能遇见到世界的尽头会是什么样子,微服务,微生活的迅速发展的时代.我们在虚拟的多次元世界购物.交易 ...
Socket编程简介
目录背景基础流程参考本文系读书笔记,非深入研究,也无代码,如非所需,请见谅. 哦,这里有份不错的:Linux的SOCKET编程详解背景花了好久的时间(大约一周,我太垃圾)看完了一篇英文文 ...
面试官：说说你对css效率的理解
大家好,我是小雨小雨,致力于分享有趣的.实用的技术文章. 内容分为翻译和原创,如果有问题,欢迎随时评论或私信,希望和大家一起进步. 大家的支持是我创作的动力. 选择器的优先级众所周知,选择器是有权重 ...
Natas0-34 Writeup
Natas是一个教授服务器端Web安全基础知识的 wargame,通过在每一关寻找Web安全漏洞,来获取通往下一关的秘钥,适合新手入门Web安全. 传送门~ 接下来给大家分享一下,1-34题的Writ ...
Android UI性能测试——使用 Systrace 查找问题
一官方文档翻译官文地址:https://developer.android.com/studio/command-line/systrace systrace命令允许您在系统级别上收集和检查所有运 ...
identityserver4源码解析_3_认证接口
目录 identityserver4源码解析_1_项目结构 identityserver4源码解析_2_元数据接口 identityserver4源码解析_3_认证接口 identityserver4 ...
洛谷1363 幻象迷宫dfs
题目网址:https://www.luogu.com.cn/problem/P1363 迷宫是无限多块地图拼接而成的,问是否可以在迷宫中走无限远.解决方案是dfs,走出初始地图之后的位置映射到原位置( ...
Servlet（二）----注解配置
## Servlet3.0 * 好处: * 支持注解配置.可以不需要web.xml了. * 步骤: 1.创建JavaEE项目,选择Servlet的版本3.0以上,可以不创建web.xml 2. ...
Azure Web: 数据库的创建与数据监控
介绍主题:Azure 大家都知道Azure云现在由于中国国策不一样,会有中国版Azure云和国际版Azure. 但是我们今天基于这个国际版的讲,因为我这个博客会比较international一点.(- ...

【python实现卷积神经网络】卷积层Conv2D反向传播过程

【python实现卷积神经网络】卷积层Conv2D反向传播过程的更多相关文章

随机推荐

热门专题