NLP-训练个model出来写诗

2018年新年，腾讯整出来个ai春联很吸引眼球，刚好有个需求让我看下能不能训出来个model来写出诗经一样的文风，求助了下小伙伴，直接丢过来2个github，原话是：

查了一下诗经一共38000个字，应该是可以训练出一个语言模型的。只是怕机器写出来的诗一般都没灵魂。https://github.com/hjptriplebee/Chinese_poem_generator； https://github.com/xue2han/AncientChinesePoemRNN.

我测试了，第一个没跑通，没有时间去check，所以直接第二个效果赞赞的。只跑了4000个epoch就给出了我惊喜，结果如下：

冲着这个效果这么厉害，一定要趁热扒一下背后的NLP技术。

首先参考AI对联背后的技术。

智能春联的核心技术从大的范畴上属于NLP，自然语言处理技术。创作春联又可以归类为其中的语言生成方向的技术，国内的语言生成研究可以追溯到20世纪90年代，至今已经探索了各种方法，主要有基于模版、随机生成并测试、基于遗传算法、基于实例推理、基于统计机器翻译等各种类型的方法。

本文举两个典型的技术途径作为案例：

1.第一种是没文化生成：即不去了解任何信息的内容，程序根本不知道文字内容是啥，只是从信息熵的角度进行随机生成与测试，在计算机眼里这只是“熵为**的一个随机数据序列”。专业的说法叫做不加领域知识的LSTM生成。LSTM是一种RNN网络（循环神经网络），适用于时序性较强的语言类样本，这里主要使用信息熵作为收敛的损失函数。这种方法生成的语言往往经不住推敲，缺乏意境、主题性等，主要是因为损失函数的定义缺少“文化”，过于强调“信息熵”。

RNN示意图

2.第二种是有文化生成：即在算法中增加了格律诗的领域知识，例如格律押韵、主题意境等。专业的说法叫基于主题模型的统计机器翻译生成。统计机器翻译主要是一类映射源语言与目标语言的模型。主要使用生成对联与参考标准集之间的相似度作为收敛的损失函数。这种方法的缺陷在于春联的质量与参考标准集强相关，容易陷入单一风格化，难以创造真正“属于机器自己的风格”。

神经机器翻译架构

一、不加领域知识的LSTM生成

1）从网上搜集了各式春联共6900对

2）将汉字编码为数字，或者叫做Encoder，并将数据分割为训练集和测试集

3）定义LSTM模型

4）用加权交叉熵损失函数训练模型，LOSS控制在1.5左右，训练结束

5）自动生成新的春联，需要再将数字转为汉字

感受一下代码

将汉字编码为数字，或者叫做Encoder，并将数据分割为训练集和测试集

couplet_file ="couplet.txt"#对联couplets = []with open(couplet_file,'r') as f: for line in f: try: content = line.replace(' ','') if '_' in content or '(' in content or '（' in content or '《' in content or '[' in content: continue if len(content) < * or len(content) > *: continue content = '[' + content + ']' # print chardet.detect(content) content = content.decode('utf-8') couplets.append(content) except Exception as e: pass# 按字数排序couplets = sorted(couplets,key=lambda line: len(line))print('对联总数: %d'%(len(couplets)))# 统计每个字出现次数all_words = []for couplet in couplets: all_words += [word for word in couplet]counter = collections.Counter(all_words)count_pairs = sorted(counter.items(), key=lambda x: -x[])words, _ = zip(*count_pairs)words = words[:len(words)] + (' ',)# 每个字映射为一个数字IDword_num_map = dict(zip(words, range(len(words))))to_num = lambda word: word_num_map.get(word, len(words))couplets_vector = [ list(map(to_num, couplet)) for couplet in couplets]# 每次取64首对联进行训练, 此参数可以调整batch_size = 64n_chunk = len(couplets_vector) // batch_sizex_batches = []y_batches = []for i in range(n_chunk): start_index = i * batch_size#起始位置 end_index = start_index + batch_size#结束位置 batches = couplets_vector[start_index:end_index] length = max(map(len,batches))#每个batches中句子的最大长度 xdata = np.full((batch_size,length), word_num_map[' '], np.int32) for row in range(batch_size): xdata[row,:len(batches[row])] = batches[row] ydata = np.copy(xdata) ydata[:,:-1] = xdata[:,1:] x_batches.append(xdata) y_batches.append(ydata)

定义LSTM模型（定义cell为一个128维的ht的cell。并使用MultiRNNCell 定义为两层的LSTM）

def neural_network(rnn_size=, num_layers=): cell = tf.nn.rnn_cell.BasicLSTMCell(rnn_size, state_is_tuple=True) cell = tf.nn.rnn_cell.MultiRNNCell([cell] * num_layers, state_is_tuple=True) initial_state = cell.zero_state(batch_size, tf.float32) with tf.variable_scope('rnnlm'): softmax_w = tf.get_variable("softmax_w", [rnn_size, len(words)+]) softmax_b = tf.get_variable("softmax_b", [len(words)+]) with tf.device("/cpu:0"): embedding = tf.get_variable("embedding", [len(words)+, rnn_size]) inputs = tf.nn.embedding_lookup(embedding, input_data) outputs, last_state = tf.nn.dynamic_rnn(cell, inputs, initial_state=initial_state, scope='rnnlm') output = tf.reshape(outputs,[-, rnn_size]) logits = tf.matmul(output, softmax_w) + softmax_b probs = tf.nn.softmax(logits) return logits, last_state, probs, cell, initial_state

用加权交叉熵损失函数训练模型，LOSS控制在1.5左右，训练结束

def train_neural_network(): logits, last_state, _, _, _ = neural_network() targets = tf.reshape(output_targets, [-]) loss = tf.contrib.legacy_seq2seq.sequence_loss_by_example([logits], [targets], [tf.ones_like(targets, dtype=tf.float32)], len(words)) cost = tf.reduce_mean(loss) learning_rate = tf.Variable(0.0, trainable=False) tvars = tf.trainable_variables() grads, _ = tf.clip_by_global_norm(tf.gradients(cost, tvars), ) optimizer = tf.train.AdamOptimizer(learning_rate) train_op = optimizer.apply_gradients(zip(grads, tvars)) with tf.Session() as sess: sess.run(tf.initialize_all_variables()) saver = tf.train.Saver(tf.all_variables()) for epoch in range(): sess.run(tf.assign(learning_rate, 0.01 * (0.97 ** epoch))) n =  for batche in range(n_chunk): train_loss, _ , _ = sess.run([cost, last_state, train_op], feed_dict={input_data: x_batches[n], output_targets: y_batches[n]}) n +=  print(epoch, batche, train_loss) if epoch %  == : saver.save(sess, './couplet.module', global_step=epoch)

自动生成新的春联

saver.restore(sess, 'couplet.module-98')

二、基于主题模型的统计机器翻译生成

1）准备统计模型的训练数据

格律诗训练语料来自互联网，其中包括《唐诗》、《全唐诗》、《全台词》等文献，以及从各大诗词论坛（例如诗词在线、天涯论坛诗词比兴等）抓取并筛选后的格律诗，总计287000多首。

2）设定主题模型，这里使用概率潜在语义分析，PLSA

例如：给定主题词“春日”，根据它在潜在主题空间中的分布向量，可以找出 “玉魄”、“红泥”和 “燕”等空间距离比较近的语义相关词。

3）基于主题模型的词汇扩展

4）定义算法：依照主题词生成首句的算法

5）定义基于统计机器翻译的二、三、四句生成模型

我们采用基于短语的统计机器翻译技术 ，PBSMT是目前一种主流的机器翻译技术，它的优势在于短语翻译结果的选词准确．由于诗词的生成讲求对仗，不涉及远距离语序调整问题，因此，诗词的生成非常适合采用基于短语的机器翻译算法来解决。

6）基于BLEU的评测方法，结果收敛后保存模型

BLEU 的直观思想是翻译结果越接近参考答案则翻译质量越好．相应的，我们认为如果根据给定上句生成的下句能够更贴近已有的参考下句则系统的生成质量越好，但由于诗词在内容表现上丰富多样，所以需要搜集拥有多个参考下句的数据样本加入答案集。BLEU通过对生成候选句与源语句的参考句进行１元词到N元词的重合度统计，结合下式衡量生成结果的好坏。

7）给定主题词，生成新的格律诗

论文全文下载，请在公众号回复：20180216

参考资料

关于RNN和LSTM原理的说明： http://www.jianshu.com/p/9dc9f41f0b29

LSTM深度学习写春联：http://blog.csdn.net/leadai/article/details/79015862

基于主题模型和统计机器翻译方法的中文格律诗自动生成：蒋锐滢，崔磊，何晶，周明，潘志庚

接下来看代码

参照[char-rnn-tensorflow](https://github.com/sherjilozair/char-rnn-tensorflow)，使用RNN的字符模型，学习并生成古诗。
数据来自于http://www16.zzu.edu.cn/qts/ ,总共4万多首唐诗。

tensorflow 1.0
python2

先看训练数据，poems.txt.截取片段

煌煌道宫，肃肃太清。礼光尊祖，乐备充庭。罄竭诚至，希夷降灵。云凝翠盖，风焰红旌。众真以从，九奏初迎。永惟休v，是锡和平。

种瓜黄台下，瓜熟子离离。一摘使瓜好，再摘使瓜稀。三摘犹自可，摘绝抱蔓归。

看出来去掉了标题和作者的干扰。这点很重要，我诗经训练出来的结果很奇葩，估计就是我标题没有去。

核心训练代码。train.py

from __future__ import print_function

import numpy as np

import tensorflow as tf

import argparse

import time

import os,sys

from six.moves import cPickle

from utils import TextLoader

from model import Model

def main():

    parser = argparse.ArgumentParser()

    parser.add_argument('--save_dir', type=str, default='save',

                       help='directory to store checkpointed models')

    parser.add_argument('--rnn_size', type=int, default=,

                       help='size of RNN hidden state')

    parser.add_argument('--num_layers', type=int, default=,

                       help='number of layers in the RNN')

    parser.add_argument('--model', type=str, default='lstm',

                       help='rnn, gru, or lstm')

    parser.add_argument('--batch_size', type=int, default=,

                       help='minibatch size')

    parser.add_argument('--num_epochs', type=int, default=,

                       help='number of epochs')

    parser.add_argument('--save_every', type=int, default=,

                       help='save frequency')

    parser.add_argument('--grad_clip', type=float, default=.,

                       help='clip gradients at this value')

    parser.add_argument('--learning_rate', type=float, default=0.002,

                       help='learning rate')

    parser.add_argument('--decay_rate', type=float, default=0.97,

                       help='decay rate for rmsprop')

    parser.add_argument('--init_from', type=str, default=None,

                       help="""continue training from saved model at this path. Path must contain files saved by previous training process:

                            'config.pkl'        : configuration;

                            'chars_vocab.pkl'   : vocabulary definitions;

                            'iterations'        : number of trained iterations;

                            'losses-*'          : train loss;

                            'checkpoint'        : paths to model file(s) (created by tf).

                                                  Note: this file contains absolute paths, be careful when moving files around;

                            'model.ckpt-*'      : file(s) with model definition (created by tf)

                        """)

    args = parser.parse_args()

    train(args)

def train(args):

    data_loader = TextLoader(args.batch_size)

    args.vocab_size = data_loader.vocab_size

    # check compatibility if training is continued from previously saved model

    if args.init_from is not None:

        # check if all necessary files exist

        assert os.path.isdir(args.init_from)," %s must be a a path" % args.init_from

        assert os.path.isfile(os.path.join(args.init_from,"config.pkl")),"config.pkl file does not exist in path %s"%args.init_from

        assert os.path.isfile(os.path.join(args.init_from,"chars_vocab.pkl")),"chars_vocab.pkl.pkl file does not exist in path %s" % args.init_from

        ckpt = tf.train.get_checkpoint_state(args.init_from)

        assert ckpt,"No checkpoint found"

        assert ckpt.model_checkpoint_path,"No model path found in checkpoint"

        assert os.path.isfile(os.path.join(args.init_from,"iterations")),"iterations file does not exist in path %s " % args.init_from

        # open old config and check if models are compatible

        with open(os.path.join(args.init_from, 'config.pkl'),'rb') as f:

            saved_model_args = cPickle.load(f)

        need_be_same=["model","rnn_size","num_layers"]

        for checkme in need_be_same:

            assert vars(saved_model_args)[checkme]==vars(args)[checkme],"Command line argument and saved model disagree on '%s' "%checkme

        # open saved vocab/dict and check if vocabs/dicts are compatible

        with open(os.path.join(args.init_from, 'chars_vocab.pkl'),'rb') as f:

            saved_chars, saved_vocab = cPickle.load(f)

        assert saved_chars==data_loader.chars, "Data and loaded model disagree on character set!"

        assert saved_vocab==data_loader.vocab, "Data and loaded model disagree on dictionary mappings!"

    with open(os.path.join(args.save_dir, 'config.pkl'), 'wb') as f:

        cPickle.dump(args, f)

    with open(os.path.join(args.save_dir, 'chars_vocab.pkl'), 'wb') as f:

        cPickle.dump((data_loader.chars, data_loader.vocab), f)

    model = Model(args)

    with tf.Session() as sess:

        tf.global_variables_initializer().run()

        saver = tf.train.Saver(tf.global_variables())

        iterations =

        # restore model and number of iterations

        if args.init_from is not None:

            saver.restore(sess, ckpt.model_checkpoint_path)

            with open(os.path.join(args.save_dir, 'iterations'),'rb') as f:

                iterations = cPickle.load(f)

        losses = []

        for e in range(args.num_epochs):

            sess.run(tf.assign(model.lr, args.learning_rate * (args.decay_rate ** e)))

            data_loader.reset_batch_pointer()

            for b in range(data_loader.num_batches):

                iterations +=

                start = time.time()

                x, y = data_loader.next_batch()

                feed = {model.input_data: x, model.targets: y}

                train_loss, _ , _ = sess.run([model.cost, model.final_state, model.train_op], feed)

                end = time.time()

                sys.stdout.write('\r')

                info = "{}/{} (epoch {}), train_loss = {:.3f}, time/batch = {:.3f}" \

                    .format(e * data_loader.num_batches + b,

                            args.num_epochs * data_loader.num_batches,

                            e, train_loss, end - start)

                sys.stdout.write(info)

                sys.stdout.flush()

                losses.append(train_loss)

                if (e * data_loader.num_batches + b) % args.save_every == \

                    or (e==args.num_epochs- and b == data_loader.num_batches-): # save for the last result

                    checkpoint_path = os.path.join(args.save_dir, 'model.ckpt')

                    saver.save(sess, checkpoint_path, global_step = iterations)

                    with open(os.path.join(args.save_dir,"iterations"),'wb') as f:

                        cPickle.dump(iterations,f)

                    with open(os.path.join(args.save_dir,"losses-"+str(iterations)),'wb') as f:

                        cPickle.dump(losses,f)

                    losses = []

                    sys.stdout.write('\n')

                    print("model saved to {}".format(checkpoint_path))

            sys.stdout.write('\n')

if __name__ == '__main__':

    main()

再看下model.py

#-*- coding:utf- -*-

import tensorflow as tf

from tensorflow.contrib import rnn

from tensorflow.contrib import legacy_seq2seq

import numpy as np

class Model():

    def __init__(self, args,infer=False):

        self.args = args

        if infer:

            args.batch_size = 

        if args.model == 'rnn':

            cell_fn = rnn.BasicRNNCell

        elif args.model == 'gru':

            cell_fn = rnn.GRUCell

        elif args.model == 'lstm':

            cell_fn = rnn.BasicLSTMCell

        else:

            raise Exception("model type not supported: {}".format(args.model))

        cell = cell_fn(args.rnn_size,state_is_tuple=False)

        self.cell = cell = rnn.MultiRNNCell([cell] * args.num_layers,state_is_tuple=False)

        self.input_data = tf.placeholder(tf.int32, [args.batch_size, None])

        # the length of input sequence is variable.

        self.targets = tf.placeholder(tf.int32, [args.batch_size, None])

        self.initial_state = cell.zero_state(args.batch_size, tf.float32)

        with tf.variable_scope('rnnlm'):

            softmax_w = tf.get_variable("softmax_w", [args.rnn_size, args.vocab_size])

            softmax_b = tf.get_variable("softmax_b", [args.vocab_size])

            with tf.device("/cpu:0"):

                embedding = tf.get_variable("embedding", [args.vocab_size, args.rnn_size])

                inputs = tf.nn.embedding_lookup(embedding, self.input_data)

        outputs, last_state = tf.nn.dynamic_rnn(cell,inputs,initial_state=self.initial_state,scope='rnnlm')

        output = tf.reshape(outputs,[-, args.rnn_size])

        self.logits = tf.matmul(output, softmax_w) + softmax_b

        self.probs = tf.nn.softmax(self.logits)

        targets = tf.reshape(self.targets, [-])

        loss = legacy_seq2seq.sequence_loss_by_example([self.logits],

                [targets],

                [tf.ones_like(targets,dtype=tf.float32)],

                args.vocab_size)

        self.cost = tf.reduce_mean(loss)

        self.final_state = last_state

        self.lr = tf.Variable(0.0, trainable=False)

        tvars = tf.trainable_variables()

        grads, _ = tf.clip_by_global_norm(tf.gradients(self.cost, tvars),

                args.grad_clip)

        optimizer = tf.train.AdamOptimizer(self.lr)

        self.train_op = optimizer.apply_gradients(zip(grads, tvars))

    def sample(self, sess, chars, vocab, prime=u'', sampling_type=):

        def pick_char(weights):

            if sampling_type == :

                sample = np.argmax(weights)

            else:

                t = np.cumsum(weights)

                s = np.sum(weights)

                sample = int(np.searchsorted(t, np.random.rand()*s))

            return chars[sample]

        for char in prime:

            if char not in vocab:

                return u"{} is not in charset!".format(char)

        if not prime:

            state = self.cell.zero_state(, tf.float32).eval()

            prime = u'^'

            result = u''

            x = np.array([list(map(vocab.get,prime))])

            [probs,state] = sess.run([self.probs,self.final_state],{self.input_data: x,self.initial_state: state})

            char = pick_char(probs[-])

            while char != u'$':

                result += char

                x = np.zeros((,))

                x[,] = vocab[char]

                [probs,state] = sess.run([self.probs,self.final_state],{self.input_data: x,self.initial_state: state})

                char = pick_char(probs[-])

            return result

        else:

            result = u'^'

            for prime_char in prime:

                result += prime_char

                x = np.array([list(map(vocab.get,result))])

                state = self.cell.zero_state(, tf.float32).eval()

                [probs,state] = sess.run([self.probs,self.final_state],{self.input_data: x,self.initial_state: state})

                char = pick_char(probs[-])

                while char != u'，' and char != u'。':

                    result += char

                    x = np.zeros((,))

                    x[,] = vocab[char]

                    [probs,state] = sess.run([self.probs,self.final_state],{self.input_data: x,self.initial_state: state})

                    char = pick_char(probs[-])

                result += char

            return result[:]

数据预处理utils.py

#-*- coding:utf- -*-

import codecs

import os

import collections

from six.moves import cPickle,reduce,map

import numpy as np

BEGIN_CHAR = '^'

END_CHAR = '$'

UNKNOWN_CHAR = '*'

MAX_LENGTH = 

class TextLoader():

    def __init__(self, batch_size, max_vocabsize=, encoding='utf-8'):

        self.batch_size = batch_size

        self.max_vocabsize = max_vocabsize

        self.encoding = encoding

        data_dir = './data'

        input_file = os.path.join(data_dir, "shijing.txt")

        vocab_file = os.path.join(data_dir, "vocab.pkl")

        tensor_file = os.path.join(data_dir, "data.npy")

        if not (os.path.exists(vocab_file) and os.path.exists(tensor_file)):

            print("reading text file")

            self.preprocess(input_file, vocab_file, tensor_file)

        else:

            print("loading preprocessed files")

            self.load_preprocessed(vocab_file, tensor_file)

        self.create_batches()

        self.reset_batch_pointer()

    def preprocess(self, input_file, vocab_file, tensor_file):

        def handle_poem(line):

            line = line.replace(' ','')

            if len(line) >= MAX_LENGTH:

                index_end = line.rfind(u'。',,MAX_LENGTH)

                index_end = index_end if index_end >  else MAX_LENGTH

                line = line[:index_end+]

            return BEGIN_CHAR+line+END_CHAR

        with codecs.open(input_file, "r", encoding=self.encoding) as f:

            lines = list(map(handle_poem,f.read().strip().split('\n')))

        counter = collections.Counter(reduce(lambda data,line: line+data,lines,''))

        count_pairs = sorted(counter.items(), key=lambda x: -x[])

        chars, _ = zip(*count_pairs)

        self.vocab_size = min(len(chars),self.max_vocabsize - ) +

        self.chars = chars[:self.vocab_size-] + (UNKNOWN_CHAR,)

        self.vocab = dict(zip(self.chars, range(len(self.chars))))

        unknown_char_int = self.vocab.get(UNKNOWN_CHAR)

        with open(vocab_file, 'wb') as f:

            cPickle.dump(self.chars, f)

        get_int = lambda char: self.vocab.get(char,unknown_char_int)

        lines = sorted(lines,key=lambda line: len(line))

        self.tensor = [ list(map(get_int,line)) for line in lines ]

        with open(tensor_file,'wb') as f:

            cPickle.dump(self.tensor,f)

    def load_preprocessed(self, vocab_file, tensor_file):

        with open(vocab_file, 'rb') as f:

            self.chars = cPickle.load(f)

        with open(tensor_file,'rb') as f:

            self.tensor = cPickle.load(f)

        self.vocab_size = len(self.chars)

        self.vocab = dict(zip(self.chars, range(len(self.chars))))

    def create_batches(self):

        self.num_batches = int(len(self.tensor) / self.batch_size)

        self.tensor = self.tensor[:self.num_batches * self.batch_size]

        unknown_char_int = self.vocab.get(UNKNOWN_CHAR)

        self.x_batches = []

        self.y_batches = []

        for i in range(self.num_batches):

            from_index = i * self.batch_size

            to_index = from_index + self.batch_size

            batches = self.tensor[from_index:to_index]

            seq_length = max(map(len,batches))

            xdata = np.full((self.batch_size,seq_length),unknown_char_int,np.int32)

            for row in range(self.batch_size):

                xdata[row,:len(batches[row])] = batches[row]

            ydata = np.copy(xdata)

            ydata[:,:-] = xdata[:,:]

            self.x_batches.append(xdata)

            self.y_batches.append(ydata)

    def next_batch(self):

        x, y = self.x_batches[self.pointer], self.y_batches[self.pointer]

        self.pointer +=

        return x, y

    def reset_batch_pointer(self):

        self.pointer =

测试案例sample.py

#-*- coding:utf- -*-

from __future__ import print_function

import numpy as np

import tensorflow as tf

import argparse

import time

import os

from six.moves import cPickle

from utils import TextLoader

from model import Model

from six import text_type

def main():

    parser = argparse.ArgumentParser()

    parser.add_argument('--save_dir', type=str, default='save',

                       help='model directory to store checkpointed models')

    parser.add_argument('--prime', type=str, default='',

                       help=u'输入指定文字生成藏头诗')

    parser.add_argument('--sample', type=int, default=,

                       help='0 to use max at each timestep, 1 to sample at each timestep')

    args = parser.parse_args()

    sample(args)

def sample(args):

    with open(os.path.join(args.save_dir, 'config.pkl'), 'rb') as f:

        saved_args = cPickle.load(f)

    with open(os.path.join(args.save_dir, 'chars_vocab.pkl'), 'rb') as f:

        chars, vocab = cPickle.load(f)

    model = Model(saved_args, True)

    with tf.Session() as sess:

        tf.global_variables_initializer().run()

        saver = tf.train.Saver(tf.global_variables())

        ckpt = tf.train.get_checkpoint_state(args.save_dir)

        if ckpt and ckpt.model_checkpoint_path:

            saver.restore(sess, ckpt.model_checkpoint_path)

            print(model.sample(sess, chars, vocab, args.prime.decode('utf-8',errors='ignore'), args.sample))

if __name__ == '__main__':

    main()

python sample.py rnn神经网络会生成一首全新的古诗。例如： ”帝以诚求备，堪留百勇杯。教官日与失，共恨五毛宣。鸡唇春疏叶，空衣滴舞衣。丑夫归晚里，此地几何人。”
python sample.py --prime <这里输入指定汉字> rnn神经网络会利用输入的汉字生成一首藏头诗。例如： python sample.py --prime 如花似月 会得到 “如尔残回号，花枝误晚声。似君星度上，月满二秋寒。”

NLP-训练个model出来写诗的更多相关文章

简单明朗的 RNN 写诗教程
目录简单明朗的 RNN 写诗教程数据集介绍代码思路输入 and 输出训练集构建生成一首完整的诗代码实现读取文件统计字数构建word 与 id的映射转成one-hot代码随机打乱 ...
为你写诗：3 步搭建 Serverless AI 应用
作者 | 杜万(倚贤) 阿里巴巴技术专家本文整理自 1 月 2 日社群分享,每月 2 场高质量分享,点击加入社群. 关注"阿里巴巴云原生"公众号,回复关键词 0102 即可下载本 ...
深度学习（三）之LSTM写诗
目录数据预处理构建数据集模型结构生成诗根据上文生成诗生成藏头诗参考根据前文生成诗: 机器学习业,圣贤不可求.临戎辞蜀计,忠信尽封疆.天子咨两相,建章应四方.自疑非俗态,谁复念鹪鹩. 生 ...
AI：为你写诗，为你做不可能的事
最近,一档全程高能的神仙节目,高调地杀入了我们的视野: 没错,就是撒贝宁主持,董卿.康辉等央视名嘴作为评审嘉宾,同时集齐央视"三大名嘴"同台的央视<主持人大赛>,这够不 ...
Qt侠：像写诗一样写代码，玩游戏一样的开心心情，还能领工资！
[软]上海-Qt侠 2017/7/12 16:11:20我完全是兴趣主导,老板不给我钱,我也要写好代码!白天干,晚上干,周一周五干,周末继续干!编程已经深入我的基因,深入我的骨髓,深入我的灵魂!当我解 ...
神经网络写诗（charRNN）
https://github.com/chenyuntc/pytorch-book 基于pytorch ,许多有趣的小应用.感谢作者! 作者的代码写得非常清晰,配置方法也很明确,只需要按照提示,安装依 ...
tensorflow自动写诗
1.目录结构 2.入口类 # coding = utf-8 """ 注意:RNN使用的数据为序列化的数据 RNN网络:主要由多个LSTM计算单元组成,依靠BPTT算法进行 ...
急速搭建 Serverless AI 应用：为你写诗
前言首先介绍下在本文出现的几个比较重要的概念: 函数计算(Function Compute): 函数计算是一个事件驱动的服务,通过函数计算,用户无需管理服务器等运行情况,只需编写代码并上传.函数计算 ...
用JAVA日志来写诗
工欲善其事,必先利其器很多程序员可能都忘了记录应用程序的行为是一件多么重要的事,当遇到多线程环境下高压力导致的并发bug时,你就能体会到记录log的重要性. 有的人很高兴的就在代码里加上了这么句: ...

随机推荐

SharePoint 2013 How to Backup Site Collection Automatically With a PowerShell Script
In this post I will introduce a way how to run a script for backing up SharePoint data which could b ...
Android自带的TTS功能
在Android1.6之后添加了TextToSpeech,也叫TTS,把相应的文字转化成语音播报,增强了用户体验.可以根据语言播报界面上的控件如下: 可以选择的语言但有的语言不支持,比如中文就不支 ...
Spring-MVC配置Gson做为Message Converter解析Json
Spring-MVC配置Gson做为Message Converter解析Json 在学习Spring的时候看到可以使用@RequestBody 和@ResponseBody注解来是的Spring自动 ...
将多个 docx 文件使用 POI 进行合并，生成单个文档，包含图片
1 添加 maven 依赖,需要使用 poi 的依赖项 <!-- https://mvnrepository.com/artifact/org.apache.poi/poi-scratchpad ...
H.264 RTP PAYLOAD 格式
H.264 视频 RTP 负载格式 1. 网络抽象层单元类型 (NALU) NALU 头由一个字节组成, 它的语法如下: +---------------+ |0|1|2|3|4|5|6|7 ...
html与表格（table）相关的属性
<table> 标签定义 HTML 表格.简单的 HTML 表格由 table 元素以及一个或多个 tr.th 或 td 元素组成.tr 元素定义表格行,th 元素定义表头,td 元素定义 ...
dubbo调用服务出现如下异常
log4j:WARN No appenders could be found for logger (org.springframework.context.support.ClassPathXmlA ...
hbase ERROR: wrong number of arguments (3 for 4)
hbase(main):036:0> get 'ddl', 'example', 'info:age'COLUMN ...
Nginx(四)：压缩功能详解
gzip (GNU-ZIP) 是一种压缩技术.经过 gzip 压缩后页面大小可以变为原来的 30%甚至更小. 这样,用户浏览页面的时候速度会快得多. gzip 的压缩页面需要浏览器和服务器双方都支持 ...
c++中浮点数精度设置
1.包含头文件<iomanip>,附注manip是manipulator,操控的简写. 2.第一种写法: cout<<setiosflags(ios::); 第二种写法: co ...

NLP-训练个model出来写诗

NLP-训练个model出来写诗的更多相关文章

随机推荐

热门专题