tf.contrib.rnn.static_rnn与tf.nn.dynamic_rnn区别

https://blog.csdn.net/u014365862/article/details/78238807

MachineLP的Github（欢迎follow）：https://github.com/MachineLP

我的GitHub：https://github.com/MachineLP/train_cnn-rnn-attention 自己搭建的一个框架，包含模型有：vgg(vgg16,vgg19), resnet(resnet_v2_50,resnet_v2_101,resnet_v2_152), inception_v4, inception_resnet_v2等。

chunk_size = 256
chunk_n = 160
rnn_size = 256
num_layers = 2
n_output_layer = MAX_CAPTCHA*CHAR_SET_LEN # 输出层

单层rnn：

tf.contrib.rnn.static_rnn：

输入：[步长,batch,input]

输出：[n_steps,batch,n_hidden]

还有rnn中加dropout

def recurrent_neural_network(data):
data = tf.reshape(data, [-1, chunk_n, chunk_size])
data = tf.transpose(data, [1,0,2])
data = tf.reshape(data, [-1, chunk_size])
data = tf.split(data,chunk_n)
# 只用RNN
layer = {'w_':tf.Variable(tf.random_normal([rnn_size, n_output_layer])), 'b_':tf.Variable(tf.random_normal([n_output_layer]))}
lstm_cell = tf.contrib.rnn.BasicLSTMCell(rnn_size)
outputs, status = tf.contrib.rnn.static_rnn(lstm_cell, data, dtype=tf.float32)
# outputs = tf.transpose(outputs, [1,0,2])
# outputs = tf.reshape(outputs, [-1, chunk_n*rnn_size])
ouput = tf.add(tf.matmul(outputs[-1], layer['w_']), layer['b_'])
return ouput

多层rnn：

tf.nn.dynamic_rnn：

输入：[batch,步长,input]
输出：[batch,n_steps,n_hidden]
所以我们需要tf.transpose(outputs, [1, 0, 2])，这样就可以取到最后一步的output

def recurrent_neural_network(data):
# [batch,chunk_n,input]
data = tf.reshape(data, [-1, chunk_n, chunk_size])
#data = tf.transpose(data, [1,0,2])
#data = tf.reshape(data, [-1, chunk_size])
#data = tf.split(data,chunk_n)
# 只用RNN
layer = {'w_':tf.Variable(tf.random_normal([rnn_size, n_output_layer])), 'b_':tf.Variable(tf.random_normal([n_output_layer]))}
#1
# lstm_cell1 = tf.contrib.rnn.BasicLSTMCell(rnn_size)
# outputs1, status1 = tf.contrib.rnn.static_rnn(lstm_cell1, data, dtype=tf.float32)
def lstm_cell():
return tf.contrib.rnn.LSTMCell(rnn_size)
def attn_cell():
return tf.contrib.rnn.DropoutWrapper(lstm_cell(), output_keep_prob=keep_prob)
# stack = tf.contrib.rnn.MultiRNNCell([attn_cell() for _ in range(0, num_layers)], state_is_tuple=True)
stack = tf.contrib.rnn.MultiRNNCell([lstm_cell() for _ in range(0, num_layers)], state_is_tuple=True)
# outputs, _ = tf.nn.dynamic_rnn(stack, data, seq_len, dtype=tf.float32)
outputs, _ = tf.nn.dynamic_rnn(stack, data, dtype=tf.float32)
# [batch,chunk_n,rnn_size] -> [chunk_n,batch,rnn_size]
outputs = tf.transpose(outputs, (1, 0, 2))
ouput = tf.add(tf.matmul(outputs[-1], layer['w_']), layer['b_'])
return ouput

tf.contrib.rnn.static_rnn与tf.nn.dynamic_rnn区别的更多相关文章

深度学习原理与框架-递归神经网络-RNN网络基本框架(代码?) 1.rnn.LSTMCell(生成单层LSTM) 2.rnn.DropoutWrapper(对rnn进行dropout操作) 3.tf.contrib.rnn.MultiRNNCell(堆叠多层LSTM) 4.mlstm_cell.zero_state(state初始化) 5.mlstm_cell(进行LSTM求解)
问题:LSTM的输出值output和state是否是一样的 1. rnn.LSTMCell(num_hidden, reuse=tf.get_variable_scope().reuse) # 构建 ...
关于tensorflow里面的tf.contrib.rnn.BasicLSTMCell 中num_units参数问题
这里的num_units参数并不是指这一层油多少个相互独立的时序lstm,而是lstm单元内部的几个门的参数,这几个门其实内部是一个神经网络,答案来自知乎: class TRNNConfig(obje ...
tf.contrib.rnn.core_rnn_cell.BasicLSTMCell should be replaced by tf.contrib.rnn.BasicLSTMCell.
For Tensorflow 1.2 and Keras 2.0, the line tf.contrib.rnn.core_rnn_cell.BasicLSTMCell should be repl ...
tensorflow教程:tf.contrib.rnn.DropoutWrapper
tf.contrib.rnn.DropoutWrapper Defined in tensorflow/python/ops/rnn_cell_impl.py. def __init__(self, ...
tf.contrib.rnn.LSTMCell 里面参数的意义
num_units:LSTM cell中的单元数量,即隐藏层神经元数量.use_peepholes:布尔类型,设置为True则能够使用peephole连接cell_clip:可选参数,float类型, ...
tensorflow笔记6：tf.nn.dynamic_rnn 和 bidirectional_dynamic_rnn：的输出，output和state，以及如何作为decoder 的输入
一.tf.nn.dynamic_rnn :函数使用和输出官网:https://www.tensorflow.org/api_docs/python/tf/nn/dynamic_rnn 使用说明: A ...
tf.nn.dynamic_rnn
tf.nn.dynamic_rnn(cell,inputs,sequence_length=None, initial_state=None,dtype=None, parallel_iteratio ...
TF之RNN：实现利用scope.reuse_variables()告诉TF想重复利用RNN的参数的案例—Jason niu
import tensorflow as tf # 22 scope (name_scope/variable_scope) from __future__ import print_function ...
第十六节，使用函数封装库tf.contrib.layers
这一节,介绍TensorFlow中的一个封装好的高级库,里面有前面讲过的很多函数的高级封装,使用这个高级库来开发程序将会提高效率. 我们改写第十三节的程序,卷积函数我们使用tf.contrib.lay ...

随机推荐

hashMap归纳
Hashmap的与hashtable的区别: Hashmap:允许key为空:查询速度快(他是非同步的:避免了同步中不必要的判断):不安全的(容易引发多线程安全问题) Hashtable:不允许k ...
java php c# 三种语言的AES加密互转
java php c# 三种语言的AES加密互转最近做的项目中有一个领取优惠券的功能,项目是用php写得,不得不佩服,php自带的方法简洁而又方便好用.项目是为平台为其他公司发放优惠券,结果很囧的是 ...
AspNetPager 控件使用
使用方法: 1.添加对AspNetPager.dll的引用 2.在页面上拖放控件 3. <%@ Register assembly="AspNetPager" namespa ...
HDU 4123 Bob’s Race（树形DP，rmq）
Bob’s Race Time Limit: 5000/2000 MS (Java/Others) Memory Limit: 32768/32768 K (Java/Others)Total ...
IOS-UITableView入门(2)
1.对于TableView .每一个item的视图基本都是一样的. 不同的仅仅有数据. IOS提供了一种缓存视图跟数据的方法.在 -UITableViewCell *) tableView:cellF ...
PG的集群技术：Pgpool-II与Postgres-XC Postgres-XL Postgres-XZ Postges-x2
https://segmentfault.com/a/1190000007012082 https://www.postgres-xl.org/ https://www.biaodianfu.com/ ...
Ubuntu使用安装或者卸载软件！！！
安装软件: 1.在应用商店里面下载安装 2.在终端sudo apt-get install 软件名 3.使用ppa:加入一个ppa源:sudo add-apt-repository ppa:user/ ...
codeforces round #257 div2 C、D
本来应该认真做这场的.思路都是正确的. C题,是先该横切完或竖切完,无法满足刀数要求.再考虑横切+竖切(竖切+横切), 由于横切+竖切(或竖切+横切)会对分割的东西产生交叉份数.从而最小的部分不会尽可 ...
ASP.NET MVC遍历ModelState的错误信息
在ASP.NET MVC中,ModelState中包含了验证失败的错误信息,具体被存储在ModelState.Values[i].Errors[j].ErrorMessage属性中.当然,通过打断点, ...
怎样正确的使用Cookie的Path详细解析
原文地址:http://java-zone.org/1052.html cookie 有路径--path,表示哪些路径下的文件有权限读取该 cookie. path 应该以 “/” 结尾,同名 coo ...

tf.contrib.rnn.static_rnn与tf.nn.dynamic_rnn区别

tf.contrib.rnn.static_rnn与tf.nn.dynamic_rnn区别

我的GitHub：https://github.com/MachineLP/train_cnn-rnn-attention 自己搭建的一个框架，包含模型有：vgg(vgg16,vgg19), resnet(resnet_v2_50,resnet_v2_101,resnet_v2_152), inception_v4, inception_resnet_v2等。

tf.contrib.rnn.static_rnn与tf.nn.dynamic_rnn区别的更多相关文章

随机推荐

热门专题