基于tensorflow实现mnist手写识别（多层神经网络）

标题党其实也不多，一个输入层，三个隐藏层，一个输出层

老样子先上代码

导入mnist的路径很长，现在还记不住

import tensorflow as tf

import tensorflow.examples.tutorials.mnist.input_data as input_data

import numpy as np

import matplotlib.pyplot as plt

from time import time

mnist =  input_data.read_data_sets("data/",one_hot = True)

#导入Tensorflwo和mnist数据集等 常用库

设置输入层，X为样本数据，y是标签值

X 784是因为28*28，None是因为不知道需要用多少样本

Y 10是因为 0~9的预测输出,None理由同上

x = tf.placeholder(tf.float32,[None,784],name='X')

y = tf.placeholder(tf.float32,[None,10],name='Y')

3层这样写有点啰嗦下一版有个用函数实现的比这个好。

tf.truncated_normal([784,H1_NN],stddev = 0.1)以截断正态分布的随机初始化，数学原理不解释(budong),反正大小控制在stddev里面 方便后面训练

H1_NN = 256 #第一层神经元节点数

H2_NN = 64 #第二层神经元节点数

H3_NN = 32 #第三层神经元节点数

#第一层

W1 = tf.Variable(tf.truncated_normal([784,H1_NN],stddev = 0.1))

b1 = tf.Variable(tf.zeros(H1_NN))

#第二层

W2 = tf.Variable(tf.truncated_normal([H1_NN,H2_NN],stddev = 0.1))

b2 = tf.Variable(tf.zeros(H2_NN))

#第三层

W3 = tf.Variable(tf.truncated_normal([H2_NN,H3_NN],stddev = 0.1))

b3 = tf.Variable(tf.zeros(H3_NN))

#输出层

W4 = tf.Variable(tf.truncated_normal([H3_NN,10],stddev = 0.1))

b4 = tf.Variable(tf.zeros(10))

输出不多讲了前三层使用了Relu，最后输出因为是10分类所有使用了softmax

（今天写的时候记错了pred输出使用了loss函数的softmax计算导致程序报错，先记下来）

Y1 = tf.nn.relu(tf.matmul(x,W1)+b1) #使用Relu当作激活函数

Y2 = tf.nn.relu(tf.matmul(Y1,W2)+b2)#使用Relu当作激活函数

Y3 = tf.nn.relu(tf.matmul(Y2,W3)+b3)#使用Relu当作激活函数

forward = tf.matmul(Y3,W4)+b4

pred = tf.nn.softmax(forward)#输出层分类应用使用softmax当作激活函数

没错上面说的就是这个tf.nn.sofmax_cross_entropy_with_logits,不使用这个使用第一版的

#损失函数使用交叉熵

loss_function = tf.reduce_mean(tf.nn.softmax_cross_entropy_with_logits(logits = forward,labels = y))

会因为log为0导致梯度爆炸数学原理不太懂以后补一下,会了再来填充

loss_function = tf.reduce_mean(-tf.reduce_sum(y*tf.log(pred),

                                              reduction_indices=1))

超参数设置啥的没啥好说，值得一提total_batch 好像是类似一个洗牌的函数

#设置训练参数

train_epochs = 50

batch_size = 50

total_batch = int(mnist.train.num_examples/batch_size) #随机抽取样本

learning_rate = 0.01

display_step = 1

优化器，（反向传播？）不确定反正用来调整最优的w和b

#优化器

opimizer = tf.train.AdamOptimizer(learning_rate).minimize(loss_function)

利用argmax对比预测结果和标签值，方便后面统计准确率

#定义准确率

correct_prediction = tf.equal(tf.argmax(y,1),tf.argmax(pred,1))

accuracy = tf.reduce_mean(tf.cast(correct_prediction,tf.float32))

开始训练，后面会补一个保存和调用模型的代码不然以后模型大了不保存都要程序跑一次才能用太费时间，这里print我把用format的删了因为不太会用

#开始训练

sess = tf.Session()

init = tf.global_variables_initializer()

startTime = time()

sess.run(init)

for epochs in range(train_epochs):

    for batch in range(total_batch):

        xs,ys = mnist.train.next_batch(batch_size)#读取批次数据

        sess.run(opimizer,feed_dict={x:xs,y:ys})#执行批次数据训练

    #total_batch个批次训练完成后，使用验证数据计算误差与准确率

    loss,acc =  sess.run([loss_function,accuracy],

                        feed_dict={

                            x:mnist.validation.images,

                            y:mnist.validation.labels})

    #输出训练情况

    if(epochs+1) % display_step == 0:

        epochs += 1

        print("Train Epoch:",epochs,

               "Loss=",loss,"Accuracy=",acc)

duration = time()-startTime

print("Trian Finshed takes:","{:.2f}".format(duration))#显示预测耗时

最后50轮训练后准确率是0.97左右已经收敛了

使用测试集评估模型

#评估模型

accu_test =  sess.run(accuracy,feed_dict={x:mnist.test.images,y:mnist.test.labels})

print("model accuracy:",accu_test)

　　准确率0.9714，还行

到这里就结束了，最后把完整代码放上来方便以后看

import tensorflow as tf

import tensorflow.examples.tutorials.mnist.input_data as input_data

import numpy as np

import matplotlib.pyplot as plt

from time import time

mnist =  input_data.read_data_sets("data/",one_hot = True)

#导入Tensorflwo和mnist数据集等 常用库

x = tf.placeholder(tf.float32,[None,784],name='X')

y = tf.placeholder(tf.float32,[None,10],name='Y')

H1_NN = 256 #第一层神经元节点数

H2_NN = 64 #第二层神经元节点数

H3_NN = 32 #第三层神经元节点数

#第一层

W1 = tf.Variable(tf.truncated_normal([784,H1_NN],stddev = 0.1))

b1 = tf.Variable(tf.zeros(H1_NN))

#第二层

W2 = tf.Variable(tf.truncated_normal([H1_NN,H2_NN],stddev = 0.1))

b2 = tf.Variable(tf.zeros(H2_NN))

#第三层

W3 = tf.Variable(tf.truncated_normal([H2_NN,H3_NN],stddev = 0.1))

b3 = tf.Variable(tf.zeros(H3_NN))

#输出层

W4 = tf.Variable(tf.truncated_normal([H3_NN,10],stddev = 0.1))

b4 = tf.Variable(tf.zeros(10))

#计算结果

Y1 = tf.nn.relu(tf.matmul(x,W1)+b1) #使用Relu当作激活函数

Y2 = tf.nn.relu(tf.matmul(Y1,W2)+b2)#使用Relu当作激活函数

Y3 = tf.nn.relu(tf.matmul(Y2,W3)+b3)#使用Relu当作激活函数

forward = tf.matmul(Y3,W4)+b4

pred = tf.nn.softmax(forward)#输出层分类应用使用softmax当作激活函数

#损失函数使用交叉熵

loss_function = tf.reduce_mean(tf.nn.softmax_cross_entropy_with_logits(logits = forward,labels = y))

#设置训练参数

train_epochs = 50

batch_size = 50

total_batch = int(mnist.train.num_examples/batch_size) #随机抽取样本

learning_rate = 0.01

display_step = 1

#优化器

opimizer = tf.train.AdamOptimizer(learning_rate).minimize(loss_function)

#定义准确率

correct_prediction = tf.equal(tf.argmax(y,1),tf.argmax(pred,1))

accuracy = tf.reduce_mean(tf.cast(correct_prediction,tf.float32))

#开始训练

sess = tf.Session()

init = tf.global_variables_initializer()

startTime = time()

sess.run(init)

for epochs in range(train_epochs):

    for batch in range(total_batch):

        xs,ys = mnist.train.next_batch(batch_size)#读取批次数据

        sess.run(opimizer,feed_dict={x:xs,y:ys})#执行批次数据训练

    #total_batch个批次训练完成后，使用验证数据计算误差与准确率

    loss,acc =  sess.run([loss_function,accuracy],

                        feed_dict={

                            x:mnist.validation.images,

                            y:mnist.validation.labels})

    #输出训练情况

    if(epochs+1) % display_step == 0:

        epochs += 1

        print("Train Epoch:",epochs,

               "Loss=",loss,"Accuracy=",acc)

duration = time()-startTime

print("Trian Finshed takes:","{:.2f}".format(duration))#显示预测耗时

#评估模型

accu_test =  sess.run(accuracy,feed_dict={x:mnist.test.images,y:mnist.test.labels})

print("model accuracy:",accu_test)

全部代码

基于tensorflow实现mnist手写识别（多层神经网络）的更多相关文章

使用tensorflow实现mnist手写识别(单层神经网络实现)
import tensorflow as tf import tensorflow.examples.tutorials.mnist.input_data as input_data import n ...
基于tensorflow的MNIST手写识别
这个例子,是学习tensorflow的人员通常会用到的,也是基本的学习曲线中的一环.我也是! 这个例子很简单,这里,就是简单的说下,不同的tensorflow版本,相关的接口函数,可能会有不一样哟.在 ...
基于tensorflow的MNIST手写数字识别（二）--入门篇
http://www.jianshu.com/p/4195577585e6 基于tensorflow的MNIST手写字识别(一)--白话卷积神经网络模型基于tensorflow的MNIST手写数字识 ...
基于TensorFlow的MNIST手写数字识别-初级
一:MNIST数据集下载地址 MNIST是一个包含很多手写数字图片的数据集,一共4个二进制压缩文件分别是test set images,test set labels,training se ...
基于TensorFlow的MNIST手写数字识别-深入
构建多层卷积神经网络时需要多组W和偏移项b,我们封装2个方法来产生W和b 初级MNIST中用0初始化W和b,这里用噪声初始化进行对称打破,防止产生梯度0,同时用一个小的正值来初始化b避免dead ne ...
Tensorflow之基于MNIST手写识别的入门介绍
Tensorflow是当下AI热潮下,最为受欢迎的开源框架.无论是从Github上的fork数量还是star数量,还是从支持的语音,开发资料,社区活跃度等多方面,他当之为superstar. 在前面介 ...
Android+TensorFlow+CNN+MNIST 手写数字识别实现
Android+TensorFlow+CNN+MNIST 手写数字识别实现 SkySeraph 2018 Email:skyseraph00#163.com 更多精彩请直接访问SkySeraph个人站 ...
TensorFlow 入门之手写识别(MNIST) softmax算法
TensorFlow 入门之手写识别(MNIST) softmax算法 MNIST flyu6 softmax回归 softmax回归算法 TensorFlow实现softmax softmax回归算 ...
tensorflow笔记（四）之MNIST手写识别系列一
tensorflow笔记(四)之MNIST手写识别系列一版权声明:本文为博主原创文章,转载请指明转载地址 http://www.cnblogs.com/fydeblog/p/7436310.html ...

随机推荐

jenkins权限配置
1. 授权匿名账户权限 2 注册新用户,并且把匿名权限删除,添加用户权限 Overall(全局) Credentials(凭证) Slave(节点) Job(任务) View(视图) Administ ...
Rendering Engine 主流的浏览器内核（排版引擎、渲染引擎、解释引擎）有哪几种，分别的特点
一.A web browser engine A rendering engine is software that draws text and images on the screen. The ...
webpack初起步
webpack 用于编译 JavaScript 模块.一旦完成安装,就可以通过 webpack 的 CLI 或 API 与其配合交互. 首先创建一个目录,进入目录中,初始化npm 然后本地安装webp ...
2017-2018-1 20155310 《信息安全系统设计基础》实现mypwd
2017-2018-1 20155310 <信息安全系统设计基础> 实现mypwd 作业要求: 1.学习pwd命令 2.研究pwd实现需要的系统调用(man -k; grep),写出伪代码 ...
微信小程序的经纬度不想写死，需要转成number类型不能用浮点型
click: function (e) { var msg = this.data.placeData; var latitude = Number(msg.latitude) var longitu ...
PostgreSQL数据库表名的大小写实验
磨砺技术珠矶,践行数据之道,追求卓越价值回到上一级页面:PostgreSQL基础知识与基本操作索引页回到顶级页面:PostgreSQL索引页[作者高健@博客园 luckyjackgao@g ...
洛谷 2953 [USACO09OPEN]牛的数字游戏Cow Digit Game
洛谷 2953 [USACO09OPEN]牛的数字游戏Cow Digit Game 题目描述 Bessie is playing a number game against Farmer John, ...
3495: PA2010 Riddle
3495: PA2010 Riddle 链接分析: 每个点要么建首都,要么不建,并且一个点建了,会导致一些点不能建.所以可以考虑2-sat. 但是如果在每个郡里两两连边,边数是n^2的. 考虑用前缀 ...
setInterval只执行一次的原因
setInterval(arrow(),) 改为: setInterval(arrow,) 原因: arrow()这是一个函数调用,函数调用就会有返回值, 而arrow()没有返回值,所以这里的arr ...
2_C语言中的数据类型（八）运算符
1 运算符表达式和语句 1.1 基本运算符 1.1.1 = 数据对象:泛指数据在内存的存储区域左值:表示可以被更改的数据对象右值:能赋给左值的量 1 ...

基于tensorflow实现mnist手写识别 （多层神经网络）

基于tensorflow实现mnist手写识别 （多层神经网络）的更多相关文章

随机推荐

热门专题

基于tensorflow实现mnist手写识别（多层神经网络）

基于tensorflow实现mnist手写识别（多层神经网络）的更多相关文章