RNN入门（4）利用LSTM实现整数加法运算

本文将介绍LSTM模型在实现整数加法方面的应用。

我们以0-255之间的整数加法为例，生成的结果在0到510之间。为了能利用深度学习模型模拟整数的加法运算，我们需要将输入的两个加数和输出的结果用二进制表示，这样就能得到向量，如加数在0-255内，可以用8位0-1向量来表示，前面的空位用0填充；结果在0-510内，可以用9位0-1向量来表示，前面的空位用0填充。因为两个加数均在0-255内变化，所以共有256*256=65536个输入向量以及65536个输出向量，输入向量为两个加数的二进制向量的拼接结果，因而是个16为的输入向量。用以下的Python代码可以模拟以上过程：

import numpy as np

# 最多8位二进制

BINARY_DIM = 8

# 将整数表示成为binary_dim位的二进制数，高位用0补齐

def int_2_binary(number, binary_dim):

    binary_list = list(map(lambda x: int(x), bin(number)[2:]))

    number_dim = len(binary_list)

    result_list = [0]*(binary_dim-number_dim)+binary_list

    return result_list

# 将一个二进制数组转为整数

def binary2int(binary_array):

    out = 0

    for index, x in enumerate(reversed(binary_array)):

        out += x * pow(2, index)

    return out

# 将[0,2**BINARY_DIM)所有数表示成二进制

binary = np.array([int_2_binary(x, BINARY_DIM) for x in range(2**BINARY_DIM)])

# print(binary)

# 样本的输入向量和输出向量

dataX = []

dataY = []

for i in range(binary.shape[0]):

    for j in range(binary.shape[0]):

        dataX.append(np.append(binary[i], binary[j]))

        dataY.append(int_2_binary(i+j, BINARY_DIM+1))

# print(dataX)

# print(dataY)

# 重新特征X和目标变量Y数组，适应LSTM模型的输入和输出

X = np.reshape(dataX, (len(dataX), 2*BINARY_DIM, 1))

# print(X.shape)

Y = np.array(dataY)

# print(dataY.shape)

在以上代码中，得到的dataX和dataY以满足要求，但为了能让LSTM模型处理，需要改变这两个数据集的形状。

我们采用LSTM模型来训练上述数据，LSTM模型的结构很简单，就是简单的一层LSTM层，然后加上Dropout层，最后是全连接层，激活函数采用sigmoid函数，采用的损失函数为平均平方误差。整个结构的示意图如下：

模型训练的代码如下：

from keras.models import Sequential

from keras.layers import Dense

from keras.layers import Dropout

from keras.layers import LSTM

from keras import losses

from keras.utils import plot_model

# 定义LSTM模型

model = Sequential()

model.add(LSTM(256, input_shape=(X.shape[1], X.shape[2])))

model.add(Dropout(0.2))

model.add(Dense(Y.shape[1], activation='sigmoid'))

model.compile(loss=losses.mean_squared_error, optimizer='adam')

# print(model.summary())

# plot model

plot_model(model, to_file=r'./model.png', show_shapes=True)

# train model

epochs = 100

model.fit(X, Y, epochs=epochs, batch_size=128)

# save model

mp = r'./LSTM_Operation.h5'

model.save(mp)

该LSTM模型每批训练128个样本，共训练100次，采用Adam优化器减少损失值。

对这个模型进行训练，训练100次，损失值为0.0045。接下来我们就要用这个训练好的模型来预测。我们预测的方法为，虽然挑两个在0-255内的加数，转化为二进制向量作为输入向量，然后由LSTM模型输出结果，将该结果取整作为输出向量中的元素，最后将这个输出向量转化为整数，就是预测的两个加数的和。模型预测的代码如下：

# use LSTM model to predict

for _ in range(100):

    start = np.random.randint(0, len(dataX)-1)

    # print(dataX[start])

    number1 = dataX[start][0:BINARY_DIM]

    number2 = dataX[start][BINARY_DIM:]

    print('='*30)

    print('%s: %s'%(number1, binary2int(number1)))

    print('%s: %s'%(number2, binary2int(number2)))

    sample = np.reshape(X[start], (1, 2*BINARY_DIM, 1))

    predict = np.round(model.predict(sample), 0).astype(np.int32)[0]

    print('%s: %s'%(predict, binary2int(predict)))

预测的100组样本的输出结果如下：

==============================

[1 0 0 1 1 1 0 1]: 157

[0 1 1 1 0 0 0 1]: 113

[1 0 0 0 0 1 1 1 0]: 270

==============================

[1 1 1 0 1 0 1 0]: 234

[0 1 0 0 1 1 0 0]: 76

[1 0 0 1 1 0 1 1 0]: 310

==============================

[1 1 0 0 0 1 0 0]: 196

[1 1 0 1 1 0 1 1]: 219

[1 1 0 0 1 1 1 1 1]: 415

==============================

[0 0 1 1 1 0 1 0]: 58

[0 0 1 0 0 0 1 1]: 35

[0 0 1 0 1 1 1 0 1]: 93

==============================

[1 0 0 0 0 0 0 0]: 128

[0 1 1 1 1 0 0 1]: 121

[0 1 1 1 1 1 0 0 1]: 249

==============================

[1 1 1 1 0 1 1 0]: 246

[1 1 0 1 0 1 0 1]: 213

[1 1 1 0 0 1 0 1 1]: 459

==============================

[1 1 1 0 0 1 1 0]: 230

[1 0 0 0 0 0 0 0]: 128

[1 0 1 1 0 0 1 1 0]: 358

==============================

[1 0 1 0 0 0 1 1]: 163

[0 1 1 0 0 1 0 1]: 101

[1 0 0 0 0 1 0 0 0]: 264

==============================

[1 0 1 0 0 1 1 0]: 166

[0 1 0 1 0 0 0 0]: 80

[0 1 1 1 1 0 1 1 0]: 246

==============================

[0 0 0 0 1 0 1 1]: 11

[0 1 0 0 0 1 0 1]: 69

[0 0 1 0 1 0 0 0 0]: 80

==============================

[1 1 1 1 0 1 1 1]: 247

[0 1 1 1 0 0 0 0]: 112

[1 0 1 1 0 0 1 1 1]: 359

==============================

[1 0 1 0 1 0 0 1]: 169

[1 1 0 0 0 0 0 0]: 192

[1 0 1 1 0 1 0 0 1]: 361

==============================

[1 0 1 1 0 0 0 1]: 177

[1 0 0 0 1 0 1 1]: 139

[1 0 0 1 1 1 1 0 0]: 316

==============================

[0 1 0 0 0 1 1 0]: 70

[0 0 1 0 1 1 1 0]: 46

[0 0 1 1 1 0 1 0 0]: 116

==============================

[1 0 0 1 1 0 1 1]: 155

[1 1 0 0 0 0 0 1]: 193

[1 0 1 0 1 1 1 0 0]: 348

==============================

[1 0 1 1 0 0 1 0]: 178

[1 0 0 0 1 1 1 1]: 143

[1 0 1 0 0 0 0 0 1]: 321

==============================

[0 1 0 1 1 1 1 1]: 95

[1 1 1 0 0 1 0 0]: 228

[1 0 1 0 0 0 0 1 1]: 323

==============================

[1 0 0 1 1 1 1 0]: 158

[0 0 0 1 1 0 0 1]: 25

[0 1 0 1 1 0 1 1 1]: 183

==============================

[1 1 1 0 1 0 1 1]: 235

[1 1 0 0 0 0 0 1]: 193

[1 1 0 1 0 1 1 0 0]: 428

==============================

[0 1 0 1 1 1 0 1]: 93

[0 1 1 1 0 1 1 0]: 118

[0 1 1 0 1 0 0 1 1]: 211

==============================

[1 1 1 1 1 1 1 1]: 255

[1 1 1 1 1 1 1 0]: 254

[1 1 1 1 1 1 1 0 1]: 509

==============================

[0 1 0 1 1 0 0 1]: 89

[0 1 0 1 1 1 1 0]: 94

[0 1 0 1 1 0 1 1 1]: 183

==============================

[0 1 1 1 0 0 0 0]: 112

[0 0 1 1 0 1 0 0]: 52

[0 1 0 1 0 0 1 0 0]: 164

==============================

[1 0 0 0 0 0 0 0]: 128

[1 1 0 1 1 0 1 0]: 218

[1 0 1 0 1 1 0 1 0]: 346

==============================

[0 0 1 1 0 1 0 1]: 53

[1 0 1 1 1 1 1 0]: 190

[0 1 1 1 1 0 0 1 1]: 243

==============================

[0 1 1 1 1 0 0 0]: 120

[1 1 0 1 0 1 0 1]: 213

[1 0 1 0 0 1 1 0 1]: 333

==============================

[0 1 1 1 1 0 1 1]: 123

[1 1 1 0 1 1 0 1]: 237

[1 0 1 1 0 1 0 0 0]: 360

==============================

[1 0 0 1 1 0 1 0]: 154

[0 1 1 0 1 0 0 1]: 105

[1 0 0 0 0 0 0 1 1]: 259

==============================

[0 0 0 1 1 0 0 1]: 25

[0 1 0 1 1 0 1 0]: 90

[0 0 1 1 1 0 0 1 1]: 115

==============================

[1 1 1 1 0 0 0 1]: 241

[0 0 0 1 1 1 1 1]: 31

[1 0 0 0 1 0 0 0 0]: 272

==============================

[0 1 0 0 0 1 1 0]: 70

[1 1 1 0 1 0 0 1]: 233

[1 0 0 1 0 1 1 1 1]: 303

==============================

[1 0 1 0 1 1 0 1]: 173

[0 1 1 1 0 1 0 0]: 116

[1 0 0 1 0 0 0 0 1]: 289

==============================

[0 1 0 0 1 0 0 0]: 72

[1 1 1 1 1 0 1 0]: 250

[1 0 1 0 0 0 0 1 0]: 322

==============================

[1 1 1 1 0 0 0 0]: 240

[0 1 0 0 0 0 1 0]: 66

[1 0 0 1 1 0 0 1 0]: 306

==============================

[0 1 0 0 0 1 1 1]: 71

[1 0 0 1 0 1 1 0]: 150

[0 1 1 0 1 1 1 0 1]: 221

==============================

[0 1 1 0 1 1 0 1]: 109

[0 0 1 0 0 1 0 1]: 37

[0 1 0 0 1 0 0 1 0]: 146

==============================

[1 1 0 0 0 0 0 0]: 192

[1 1 1 0 0 0 0 1]: 225

[1 1 0 1 0 0 0 0 1]: 417

==============================

[1 0 0 0 0 0 1 1]: 131

[1 1 0 1 1 1 1 0]: 222

[1 0 1 1 0 0 0 0 1]: 353

==============================

[0 0 0 0 0 1 0 0]: 4

[1 1 1 0 0 0 1 0]: 226

[0 1 1 1 0 0 1 1 0]: 230

==============================

[1 1 1 0 1 1 1 1]: 239

[1 1 0 1 1 0 1 1]: 219

[1 1 1 0 0 1 0 1 0]: 458

==============================

[0 0 1 1 0 1 0 1]: 53

[1 1 1 1 0 0 1 0]: 242

[1 0 0 1 0 0 1 1 1]: 295

==============================

[1 0 0 1 0 0 0 1]: 145

[0 1 0 0 0 1 0 0]: 68

[0 1 1 0 1 0 1 0 1]: 213

==============================

[0 0 1 1 0 0 0 0]: 48

[1 0 1 1 0 1 1 1]: 183

[0 1 1 1 0 0 1 1 1]: 231

==============================

[0 1 1 0 0 1 1 1]: 103

[0 0 0 1 1 1 1 0]: 30

[0 1 0 0 0 0 1 0 1]: 133

==============================

[0 1 0 1 1 1 0 1]: 93

[1 1 0 1 0 0 1 0]: 210

[1 0 0 1 0 1 1 1 1]: 303

==============================

[1 0 0 0 1 0 1 0]: 138

[0 1 1 1 1 0 0 1]: 121

[1 0 0 0 0 0 0 1 1]: 259

==============================

[0 0 0 0 0 0 1 1]: 3

[0 0 1 1 0 0 0 1]: 49

[0 0 0 1 1 0 1 0 0]: 52

==============================

[1 0 0 0 0 0 1 0]: 130

[0 0 0 1 0 0 0 0]: 16

[0 1 0 0 1 0 0 1 0]: 146

==============================

[0 0 0 1 0 0 0 0]: 16

[1 0 0 1 0 0 1 0]: 146

[0 1 0 1 0 0 0 1 0]: 162

==============================

[0 1 0 1 0 1 0 0]: 84

[0 0 0 0 1 1 0 0]: 12

[0 0 1 1 0 0 0 0 0]: 96

==============================

[1 0 1 0 1 0 1 1]: 171

[1 1 0 1 1 0 1 1]: 219

[1 1 0 0 0 0 1 1 0]: 390

==============================

[1 1 1 1 1 1 1 0]: 254

[0 1 1 0 1 0 1 0]: 106

[1 0 1 1 0 1 0 0 0]: 360

==============================

[1 0 0 0 0 0 1 0]: 130

[0 0 0 0 1 1 1 0]: 14

[0 1 0 0 1 0 0 0 0]: 144

==============================

[1 0 1 0 0 1 0 1]: 165

[0 0 1 1 1 0 1 1]: 59

[0 1 1 1 0 0 0 0 0]: 224

==============================

[0 0 1 1 1 0 1 0]: 58

[1 1 1 1 0 0 1 0]: 242

[1 0 0 1 0 1 1 0 0]: 300

==============================

[0 1 0 0 1 1 0 1]: 77

[0 0 0 1 1 1 1 1]: 31

[0 0 1 1 0 1 1 0 0]: 108

==============================

[1 0 0 1 1 0 1 0]: 154

[0 1 0 1 0 1 0 1]: 85

[0 1 1 1 0 1 1 1 1]: 239

==============================

[0 1 1 0 1 1 0 1]: 109

[0 1 1 0 1 0 0 1]: 105

[0 1 1 0 1 0 1 1 0]: 214

==============================

[0 1 1 1 1 1 1 1]: 127

[0 1 1 1 0 0 1 0]: 114

[0 1 1 1 1 0 0 0 1]: 241

==============================

[0 1 1 0 0 1 0 1]: 101

[0 1 0 1 0 0 0 0]: 80

[0 1 0 1 1 0 1 0 1]: 181

==============================

[0 1 1 0 1 1 1 0]: 110

[0 1 0 1 0 1 1 0]: 86

[0 1 1 0 0 0 1 0 0]: 196

==============================

[0 0 0 1 0 0 1 1]: 19

[1 0 0 1 0 0 0 0]: 144

[0 1 0 1 0 0 0 1 1]: 163

==============================

[1 1 1 1 0 1 0 0]: 244

[1 1 0 1 0 0 1 1]: 211

[1 1 1 0 0 0 1 1 1]: 455

==============================

[0 0 0 0 1 1 1 0]: 14

[1 0 1 1 0 0 1 0]: 178

[0 1 1 0 0 0 0 0 0]: 192

==============================

[0 1 1 0 0 0 0 0]: 96

[1 0 0 1 1 1 0 0]: 156

[0 1 1 1 1 1 1 0 0]: 252

==============================

[0 0 1 1 0 1 0 0]: 52

[0 1 1 1 1 1 0 1]: 125

[0 1 0 1 1 0 0 0 1]: 177

==============================

[0 0 0 0 1 1 0 0]: 12

[0 1 0 1 1 1 0 1]: 93

[0 0 1 1 0 1 0 0 1]: 105

==============================

[0 1 1 0 0 1 0 1]: 101

[1 1 0 1 0 1 0 0]: 212

[1 0 0 1 1 1 0 0 1]: 313

==============================

[1 1 0 0 0 0 0 1]: 193

[1 1 0 0 1 1 0 1]: 205

[1 1 0 0 0 1 1 1 0]: 398

==============================

[0 1 1 1 0 0 1 0]: 114

[0 0 0 0 0 0 0 0]: 0

[0 0 1 1 1 0 0 1 0]: 114

==============================

[1 0 0 0 1 1 1 0]: 142

[1 0 1 1 1 1 0 1]: 189

[1 0 1 0 0 1 0 1 1]: 331

==============================

[1 0 1 1 0 1 1 1]: 183

[0 1 0 1 0 1 1 0]: 86

[1 0 0 0 0 1 1 0 1]: 269

==============================

[1 0 1 0 0 0 1 1]: 163

[1 1 1 0 0 1 0 1]: 229

[1 1 0 0 0 1 0 0 0]: 392

==============================

[0 0 1 1 0 0 0 1]: 49

[1 1 1 0 0 1 1 1]: 231

[1 0 0 0 1 1 0 0 0]: 280

==============================

[1 0 0 0 1 1 1 1]: 143

[1 0 1 0 1 0 0 0]: 168

[1 0 0 1 1 0 1 1 1]: 311

==============================

[0 1 0 0 0 0 0 0]: 64

[0 0 0 0 0 1 0 1]: 5

[0 0 1 0 0 0 1 0 1]: 69

==============================

[1 1 1 1 1 0 1 1]: 251

[1 0 1 1 1 0 0 1]: 185

[1 1 0 1 1 0 1 0 0]: 436

==============================

[1 1 1 0 1 1 1 0]: 238

[1 1 0 0 0 0 1 0]: 194

[1 1 0 1 1 0 0 0 0]: 432

==============================

[0 0 1 1 1 1 0 0]: 60

[0 0 0 1 0 1 1 1]: 23

[0 0 1 0 1 0 0 1 1]: 83

==============================

[0 1 1 1 0 1 0 0]: 116

[1 1 1 1 1 1 0 0]: 252

[1 0 1 1 1 0 0 0 0]: 368

==============================

[1 1 0 1 0 1 1 0]: 214

[1 1 1 1 0 1 0 0]: 244

[1 1 1 0 0 1 0 1 0]: 458

==============================

[1 1 1 1 1 1 1 0]: 254

[1 1 0 1 0 0 0 1]: 209

[1 1 1 0 0 1 1 1 1]: 463

==============================

[0 0 0 0 0 0 1 0]: 2

[0 0 0 0 1 1 0 1]: 13

[0 0 0 0 0 1 1 1 1]: 15

==============================

[0 1 1 0 0 1 1 1]: 103

[1 0 1 1 1 1 1 0]: 190

[1 0 0 1 0 0 1 0 1]: 293

==============================

[1 1 1 1 0 1 1 0]: 246

[0 1 0 1 0 0 1 0]: 82

[1 0 1 0 0 1 0 0 0]: 328

==============================

[0 1 1 1 0 0 1 1]: 115

[0 0 1 1 1 0 1 1]: 59

[0 1 0 1 0 1 1 1 0]: 174

==============================

[0 1 0 1 1 0 0 1]: 89

[0 1 1 0 1 0 1 1]: 107

[0 1 1 0 0 0 1 0 0]: 196

==============================

[0 1 0 0 0 1 0 0]: 68

[0 0 1 1 1 0 0 0]: 56

[0 0 1 1 1 1 1 0 0]: 124

==============================

[1 1 0 0 1 0 0 0]: 200

[1 0 1 0 0 0 1 0]: 162

[1 0 1 1 0 1 0 1 0]: 362

==============================

[1 1 1 1 0 0 1 1]: 243

[0 1 1 0 0 0 1 1]: 99

[1 0 1 0 1 0 1 1 0]: 342

==============================

[0 0 1 0 1 0 0 1]: 41

[0 1 0 0 1 0 0 1]: 73

[0 0 1 1 1 0 0 1 0]: 114

==============================

[0 0 0 1 1 1 0 1]: 29

[1 0 1 0 1 1 1 0]: 174

[0 1 1 0 0 1 0 1 1]: 203

==============================

[0 0 0 0 1 1 1 1]: 15

[0 0 1 1 1 1 0 1]: 61

[0 0 1 0 0 1 1 0 0]: 76

==============================

[1 1 1 1 1 0 1 1]: 251

[1 1 0 1 0 0 0 0]: 208

[1 1 1 0 0 1 0 1 1]: 459

==============================

[1 1 1 0 1 0 0 0]: 232

[0 1 1 0 0 0 1 0]: 98

[1 0 1 0 0 1 0 1 0]: 330

==============================

[1 0 1 1 0 1 0 0]: 180

[0 1 0 1 0 1 1 1]: 87

[1 0 0 0 0 1 0 1 1]: 267

==============================

[1 0 0 0 0 1 1 0]: 134

[1 0 0 1 0 1 0 1]: 149

[1 0 0 0 1 1 0 1 1]: 283

==============================

[1 0 1 0 1 1 0 1]: 173

[0 1 1 1 1 1 0 0]: 124

[1 0 0 1 0 1 0 0 1]: 297

==============================

[0 1 0 0 1 0 0 0]: 72

[0 1 1 0 0 0 1 1]: 99

[0 1 0 1 0 1 0 1 1]: 171

==============================

[1 1 0 1 0 1 0 1]: 213

[0 0 0 1 1 1 1 0]: 30

[0 1 1 1 1 0 0 1 1]: 243

可以看到，这个简单的LSTM模型的预测的结果全部正确。因此，这就可以用来模拟0-255内的整数的加法运算，是不是很神奇呢？

如果需要想将加数的范围扩大，只需要改变代码中的BINARY_DIM变量即可。但是，加数的范围越大，样本就越大，如2**10=1024内的加法，就会有1024*1024=1048576个样本，这样大的样本量的无疑需要更多的训练时间。

本文到此结束，感谢阅读_{如果不当之处，请速联系笔者，欢迎大家交流}祝您好运~

注意：本人现已开通微信公众号： Python爬虫与算法（微信号为：easy_web_scrape），欢迎大家关注哦~~

完整的Python代码如下：

import numpy as np

from keras.models import Sequential

from keras.layers import Dense

from keras.layers import Dropout

from keras.layers import LSTM

from keras import losses

from keras.utils import plot_model

# 最多8位二进制

BINARY_DIM = 8

# 将整数表示成为binary_dim位的二进制数，高位用0补齐

def int_2_binary(number, binary_dim):

    binary_list = list(map(lambda x: int(x), bin(number)[2:]))

    number_dim = len(binary_list)

    result_list = [0]*(binary_dim-number_dim)+binary_list

    return result_list

# 将一个二进制数组转为整数

def binary2int(binary_array):

    out = 0

    for index, x in enumerate(reversed(binary_array)):

        out += x * pow(2, index)

    return out

# 将[0,2**BINARY_DIM)所有数表示成二进制

binary = np.array([int_2_binary(x, BINARY_DIM) for x in range(2**BINARY_DIM)])

# print(binary)

# 样本的输入向量和输出向量

dataX = []

dataY = []

for i in range(binary.shape[0]):

    for j in range(binary.shape[0]):

        dataX.append(np.append(binary[i], binary[j]))

        dataY.append(int_2_binary(i+j, BINARY_DIM+1))

# print(dataX)

# print(dataY)

# 重新特征X和目标变量Y数组，适应LSTM模型的输入和输出

X = np.reshape(dataX, (len(dataX), 2*BINARY_DIM, 1))

# print(X.shape)

Y = np.array(dataY)

# print(dataY.shape)

# 定义LSTM模型

model = Sequential()

model.add(LSTM(256, input_shape=(X.shape[1], X.shape[2])))

model.add(Dropout(0.2))

model.add(Dense(Y.shape[1], activation='sigmoid'))

model.compile(loss=losses.mean_squared_error, optimizer='adam')

# print(model.summary())

# plot model

plot_model(model, to_file=r'./model.png', show_shapes=True)

# train model

epochs = 100

model.fit(X, Y, epochs=epochs, batch_size=128)

# save model

mp = r'./LSTM_Operation.h5'

model.save(mp)

# use LSTM model to predict

for _ in range(100):

    start = np.random.randint(0, len(dataX)-1)

    # print(dataX[start])

    number1 = dataX[start][0:BINARY_DIM]

    number2 = dataX[start][BINARY_DIM:]

    print('='*30)

    print('%s: %s'%(number1, binary2int(number1)))

    print('%s: %s'%(number2, binary2int(number2)))

    sample = np.reshape(X[start], (1, 2*BINARY_DIM, 1))

    predict = np.round(model.predict(sample), 0).astype(np.int32)[0]

    print('%s: %s'%(predict, binary2int(predict)))

RNN入门（4）利用LSTM实现整数加法运算的更多相关文章

POJ1503: Integer Inquiry(连续多个大整数加法运算)
#include<iostream> #include<cstring> using namespace std; string sum; ; string tool(stri ...
剑指offer第12题打印从1到n位数以及大整数加法乘法
字符和数字加减就是字符的ASCII码和数字直接加减. 方法一: 1)在字符串操作中给一个整形数字加(字符0)就是把它转化为字符,当然给一个字符减去(字符0)就可以把它转化为数字了:如果确实是最后 ...
NLP教程(5) - 语言模型、RNN、GRU与LSTM
作者:韩信子@ShowMeAI 教程地址:http://www.showmeai.tech/tutorials/36 本文地址:http://www.showmeai.tech/article-det ...
RNN 入门教程 Part 3 – 介绍 BPTT 算法和梯度消失问题
转载 - Recurrent Neural Networks Tutorial, Part 3 – Backpropagation Through Time and Vanishing Gradien ...
深度学习之循环神经网络RNN概述，双向LSTM实现字符识别
深度学习之循环神经网络RNN概述,双向LSTM实现字符识别 2. RNN概述 Recurrent Neural Network - 循环神经网络,最早出现在20世纪80年代,主要是用于时序数据的预测和 ...
AC日记——大整数加法 openjudge 1.6 10
10:大整数加法总时间限制: 1000ms 内存限制: 65536kB 描述求两个不超过200位的非负整数的和. 输入有两行,每行是一个不超过200位的非负整数,可能有多余的前导0. 输出 ...
JavaScript超大整数加法
原文:JavaScript超大整数加法什么是「超大整数」? JavaScript 采用 IEEE754标准中的浮点数算法来表示数字 Number. 我也没花时间去详细了解 IEEE754标准 ,但 ...
HDU1002——大整数加法
题目: I have a very simple problem for you. Given two integers A and B, your job is to calculate the S ...
Problem B: 大整数的加法运算
Problem B: 大整数的加法运算 Time Limit: 1 Sec Memory Limit: 128 MBSubmit: 112 Solved: 57[Submit][Status][W ...

随机推荐

【CF486E】LIS of Sequence题解
[CF486E]LIS of Sequence题解题目链接题意: 给你一个长度为n的序列a1,a2,...,an,你需要把这n个元素分成三类:1,2,3: 1:所有的最长上升子序列都不包含这个元素 ...
多个子域名前端网站调用同一个webAPI时session混用问题
session机制: 当程序需要为某个客户端的请求创建一个session的时候,服务器首先检查这个客户端的请求里是否已包含了一个session标识 - 称为session id,如果已包含一个sess ...
kvm-qcow2派生镜像的远程备份的方法！
在虚拟化环境中,关于虚拟机的远程备份是一个比较重要的环节,这个是有关于整个机房挂掉之后,仍然可以恢复的最后一招. 在kvm中这种情况可以通过直接备份虚拟机的镜像文件(qcow2)到远端存储解决. 但有 ...
OC字典的使用
在OC中,字符串.数组.字典是最常见的对象类型,但是在这三个当中,字典的用法相对较少,因为字典的属性和方法比较少,但是一个字典的用法比较复杂,因为在一个字典当中,既可以包含字符串,也可以包含数组,数组 ...
jQuery获取父级、兄弟节点的方法
一.jQuery的父节点查找方法 $(selector).parent(selector):获取父节点 $(selector).parentNode:以node[]的形式存放父节点,如果没有父节点,则 ...
JavaScript标识符与关键字和保留字
区分大小写 JavaScript中的一切(变量.函数名.操作符)都区分大小写.例如,变量名itbsl和变量名ITbsl是两个不同的变量. 标识符所谓标识符,就是指变量.函数.属性的名字,或者函数的参 ...
netty入门（一）
1. netty入门(一) 1.1. 传统socket编程在任何时候都可能有大量的线程处于休眠状态,只是等待输入或者输出数据就绪,这可能算是一种资源浪费. 需要为每个线程的调用栈都分配内存,其默认值 ...
【vim】模式与模式切换
很多初学者启动vim后,不知道怎么输入字符:按了半天字母,结果屏幕还是空的. vim和记事本或WORD不一样,不是一打开后就可以输入文字,此时它处于正常模式. vim一共有4个模式: 正常模式 (No ...
str() vs repr() in Python
str() 和 repr() 都是用作一个对象的字符表示. 1 str()的举例: s = 'Hello, Geeks.' print str(s) print str(2.0/11.0) 输出结果: ...
ubuntu 16.04 安装cuda的方法
很多神经网络架构都需要安装CUDA,安装这个的确费了我不少时间,是要总结一下流程了. 安装这个,最好使用官网的安装步骤和流程,不然,会走很多弯路: https://developer.nvidia.c ...

RNN入门（4）利用LSTM实现整数加法运算

RNN入门（4）利用LSTM实现整数加法运算的更多相关文章

随机推荐

热门专题