TensorFlow 训练MNIST数据集（2）—

　　在我的上一篇随笔中，采用了单层神经网络来对MNIST进行训练，在测试集中只有约90%的正确率。这次换一种神经网络（多层神经网络）来进行训练和测试。

1、获取MNIST数据

　　MNIST数据集只要一行代码就可以获取的到，非常方便。关于MNIST的基本信息可以参考我的上一篇随笔。

mnist = input_data.read_data_sets('./data/mnist', one_hot=True)

2、模型基本结构

　　本次采用的训练模型为三层神经网络结构，输入层节点数与MNIST一行数据的长度一致，为784；输出层节点数与数字的类别数一致，为10；隐藏层节点数为50个；每次训练的mini-batch数量为64,；最大训练周期为50000。

 inputSize  = 784

 outputSize = 10

 hiddenSize = 50

 batchSize  = 64

 trainCycle = 50000

3、输入层

　　输入层用于接收每次小批量样本的输入，先通过placeholder来进行占位，在训练时才传入具体的数据。值得注意的是，在生成输入层的tensor时，传入的shape中有一个‘None’，表示每次输入的样本的数量，该‘None’表示先不作具体的指定，在真正输入的时候再根据实际的数据来进行推断。这个很方便，但也是有条件的，也就是通过该方法返回的tensor不能使用简单的加（+）减（-）乘（*）除（/）符号来进行计算（否则将会报错），需要用TensorFlow中的相关函数来进行代替。

inputLayer = tf.placeholder(tf.float32, shape=[None, inputSize])

4、隐藏层

　　在神经网络中，隐藏层的作用主要是提取数据的特征（feature）。这里的权重参数采用了 tensorflow.truncated_normal() 函数来进行生成，与上次采用的 tensorflow.

random_normal() 不一样。这两者的作用都是生成指定形状、期望和标准差的符合正太分布随机变量。区别是 truncated_normal 函数对随机变量的范围有个限制（与期望的偏差在2个标准差之内，否则丢弃）。另外偏差项这里也使用了变量的形式，也可以采用常量来进行替代。

　　激活函数为sigmoid函数。

 hiddenWeight = tf.Variable(tf.truncated_normal([inputSize, hiddenSize], mean=0, stddev=0.1))

 hiddenBias   = tf.Variable(tf.truncated_normal([hiddenSize]))

 hiddenLayer  = tf.add(tf.matmul(inputLayer, hiddenWeight), hiddenBias)

 hiddenLayer  = tf.nn.sigmoid(hiddenLayer)

5、输出层

　　输出层与隐藏层类似，只是节点数不一样。

 outputWeight = tf.Variable(tf.truncated_normal([hiddenSize, outputSize], mean=0, stddev=0.1))

 outputBias   = tf.Variable(tf.truncated_normal([outputSize], mean=0, stddev=0.1))

 outputLayer  = tf.add(tf.matmul(hiddenLayer, outputWeight), outputBias)

 outputLayer  = tf.nn.sigmoid(outputLayer)

6、输出标签

　　跟输入层一样，也是先占位，在最后训练的时候再传入具体的数据。标签，也就是每一个样本的正确分类。

outputLabel = tf.placeholder(tf.float32, shape=[None, outputSize])

7、损失函数

　　这里采用的是交叉熵损失函数。注意用的是v2版本，第一个版本已被TensorFlow声明为deprecated，准备废弃了。

loss = tf.reduce_mean(tf.nn.softmax_cross_entropy_with_logits_v2(labels=outputLabel, logits=outputLayer))

8、优化器与目标函数

　　优化器采用了Adam梯度下降法，我试过了普通的GradientDescentOptimizer，效果不如Adam；也用过Adadelta，结果几乎收敛不了。

　　目标函数就是最小化损失函数。

optimizer = tf.train.AdamOptimizer()

target    = optimizer.minimize(loss)

9、训练过程

　　先创建一个会话，然后初始化tensors，最后进行迭代训练。模型的收敛速度很快，在1000次的时候就达到了大概90%的正确率。

 with tf.Session() as sess:

     sess.run(tf.global_variables_initializer())

     for i in range(trainCycle):

         batch = mnist.train.next_batch(batchSize)

         sess.run(target, feed_dict={inputLayer: batch[0], outputLabel: batch[1]})

         if i % 1000 == 0:

             corrected = tf.equal(tf.argmax(outputLabel, 1), tf.argmax(outputLayer, 1))

             accuracy = tf.reduce_mean(tf.cast(corrected, tf.float32))

             accuracyValue = sess.run(accuracy, feed_dict={inputLayer: batch[0], outputLabel: batch[1]})

             print(i, 'train set accuracy:', accuracyValue)

模型训练输出：

10、测试训练结果

　　在测数据集上测试。准确率达到96%，比单层的神经网络好很多。

     corrected = tf.equal(tf.argmax(outputLabel, 1), tf.argmax(outputLayer, 1))

     accuracy  = tf.reduce_mean(tf.cast(corrected, tf.float32))

     accuracyValue = sess.run(accuracy, feed_dict={inputLayer: mnist.test.images, outputLabel: mnist.test.labels})

     print("accuracy on test set:", accuracyValue)

测试集上的输出：

附：

　　完整代码如下：

 import tensorflow as tf

 from tensorflow.examples.tutorials.mnist import input_data

 mnist = input_data.read_data_sets('./data/mnist', one_hot=True)

 inputSize  = 784

 outputSize = 10

 hiddenSize = 50

 batchSize  = 64

 trainCycle = 50000

 # 输入层

 inputLayer = tf.placeholder(tf.float32, shape=[None, inputSize])

 # 隐藏层

 hiddenWeight = tf.Variable(tf.truncated_normal([inputSize, hiddenSize], mean=0, stddev=0.1))

 hiddenBias   = tf.Variable(tf.truncated_normal([hiddenSize]))

 hiddenLayer  = tf.add(tf.matmul(inputLayer, hiddenWeight), hiddenBias)

 hiddenLayer  = tf.nn.sigmoid(hiddenLayer)

 # 输出层

 outputWeight = tf.Variable(tf.truncated_normal([hiddenSize, outputSize], mean=0, stddev=0.1))

 outputBias   = tf.Variable(tf.truncated_normal([outputSize], mean=0, stddev=0.1))

 outputLayer  = tf.add(tf.matmul(hiddenLayer, outputWeight), outputBias)

 outputLayer  = tf.nn.sigmoid(outputLayer)

 # 标签

 outputLabel = tf.placeholder(tf.float32, shape=[None, outputSize])

 # 损失函数

 loss = tf.reduce_mean(tf.nn.softmax_cross_entropy_with_logits_v2(labels=outputLabel, logits=outputLayer))

 # 优化器

 optimizer = tf.train.AdamOptimizer()

 # 训练目标

 target = optimizer.minimize(loss)

 # 训练

 with tf.Session() as sess:

     sess.run(tf.global_variables_initializer())

     for i in range(trainCycle):

         batch = mnist.train.next_batch(batchSize)

         sess.run(target, feed_dict={inputLayer: batch[0], outputLabel: batch[1]})

         if i % 1000 == 0:

             corrected = tf.equal(tf.argmax(outputLabel, 1), tf.argmax(outputLayer, 1))

             accuracy = tf.reduce_mean(tf.cast(corrected, tf.float32))

             accuracyValue = sess.run(accuracy, feed_dict={inputLayer: batch[0], outputLabel: batch[1]})

             print(i, 'train set accuracy:', accuracyValue)

     # 测试

     corrected = tf.equal(tf.argmax(outputLabel, 1), tf.argmax(outputLayer, 1))

     accuracy  = tf.reduce_mean(tf.cast(corrected, tf.float32))

     accuracyValue = sess.run(accuracy, feed_dict={inputLayer: mnist.test.images, outputLabel: mnist.test.labels})

     print("accuracy on test set:", accuracyValue)

     sess.close()

本文地址：https://www.cnblogs.com/laishenghao/p/9736696.html

TensorFlow 训练MNIST数据集（2）—— 多层神经网络的更多相关文章

TensorFlow训练MNIST数据集（3） —— 卷积神经网络
前面两篇随笔实现的单层神经网络和多层神经网络, 在MNIST测试集上的正确率分别约为90%和96%.在换用多层神经网络后,正确率已有很大的提升.这次将采用卷积神经网络继续进行测试. 1.模型基本结构 ...
TensorFlow训练MNIST数据集（1） —— softmax 单层神经网络
1.MNIST数据集简介首先通过下面两行代码获取到TensorFlow内置的MNIST数据集: from tensorflow.examples.tutorials.mnist import inp ...
2、TensorFlow训练MNIST
装载自:http://www.tensorfly.cn/tfdoc/tutorials/mnist_beginners.html TensorFlow训练MNIST 这个教程的目标读者是对机器学习和T ...
使用caffe训练mnist数据集 - caffe教程实战（一）
个人认为学习一个陌生的框架,最好从例子开始,所以我们也从一个例子开始. 学习本教程之前,你需要首先对卷积神经网络算法原理有些了解,而且安装好了caffe 卷积神经网络原理参考:http://cs231 ...
实践详细篇-Windows下使用VS2015编译的Caffe训练mnist数据集
上一篇记录的是学习caffe前的环境准备以及如何创建好自己需要的caffe版本.这一篇记录的是如何使用编译好的caffe做训练mnist数据集,步骤编号延用上一篇 <实践详细篇-Windows下 ...
一个简单的TensorFlow可视化MNIST数据集识别程序
下面是TensorFlow可视化MNIST数据集识别程序,可视化内容是,TensorFlow计算图,表(loss, 直方图, 标准差(stddev)) # -*- coding: utf-8 -*- ...
MNIST数据集上卷积神经网络的简单实现（使用PyTorch)
设计的CNN模型包括一个输入层,输入的是MNIST数据集中28*28*1的灰度图两个卷积层, 第一层卷积层使用6个3*3的kernel进行filter,步长为1,填充1.这样得到的尺寸是(28+1* ...
TensorFlow训练MNIST报错ResourceExhaustedError
title: TensorFlow训练MNIST报错ResourceExhaustedError date: 2018-04-01 12:35:44 categories: deep learning ...
基于 tensorflow 的 mnist 数据集预测
1. tensorflow 基本使用方法 2. mnist 数据集简介与预处理 3. 聚类算法模型 4. 使用卷积神经网络进行特征生成 5. 训练网络模型生成结果 how to install ten ...

随机推荐

python基础知识回顾之元组
元组与列表的方法基本一样,只不过创建元组是用小括号()把元素括起来,两者的区别在于,元组的元素不可被修改. 元组被称为只读列表,即数据可以被查询,但不能被修改,列表的切片操作适用于元组. 元组写在小括 ...
python3.6和pip3安装
CenOS7 安装依赖环境 yum -y install openssl-devel ncurses-devel sqlite-devel readline-devel tk-devel gcc 编译 ...
SHGetFileInfo 报错异常问题
查看代码是否使用了 ::CoInitializeEx(NULL, COINIT_MULTITHREADED); 如果是,换成在每个线程调用 ::CoInitialize(NULL); 真够蛋疼的,查了 ...
LeetCode 休闲计划
老年退役选手的 LeetCode 休闲之旅前言不知不觉两年多的大学时光悄然流逝,浑浑噩噩的状态似乎从来没有离开过自己. 这两年刷题似乎一直是常态.在退役之后的现在,深感有些东西一段时间没有接触,很 ...
MySQL 在各种程序语音的连接字符串（转）
一.MySQL Connector/ODBC 2.50 (MyODBC 2.50)连接方式 1.本地数据库连接Driver={MySQL};Server=localhost;Option=16834; ...
为什么ConcurrentHashMap的读操作不需要加锁？
我们知道,ConcurrentHashmap(1.8)这个并发集合框架是线程安全的,当你看到源码的get操作时,会发现get操作全程是没有加任何锁的,这也是这篇博文讨论的问题--为什么它不需要加锁呢? ...
Win8下IIS的安装和站点的公布
版权声明:本文为博主原创文章,不经博主同意注明链接就可以转载. https://blog.csdn.net/Senior_lee/article/details/32939411 之前 ...
Oracle rdbms Brush password
Restore database user history account password 1. 用户状态 select * from user_astatus_map; select * from ...
Unicode,ISO-8859-1,GBK,UTF-8编码及相互转换（转载）
第二篇:JAVA字符编码系列二:Unicode,ISO-8859-1,GBK,UTF-8编码及相互转换 1.函数介绍在Java中,字符串用统一的Unicode编码,每个字符占用两个字节,与编码有关的两 ...
java字符串利用dom4j转 xml 且遍历
1.因为转换的格式不是标准格式,所以有时候获得xml根目录后rootElement.attributes() 取不到想要的属性所以需要通过迭代器来获取想要的值 public static void ...

TensorFlow 训练MNIST数据集（2）—— 多层神经网络

TensorFlow 训练MNIST数据集（2）—— 多层神经网络的更多相关文章

随机推荐

热门专题