Implement TensorFlow's next_batch for own data

The version of numpy data

import numpy as np

class Dataset:

    def __init__(self, data):

        self._index_in_epoch = 0

        self._epochs_completed = 0

        self._data = data

        self._num_examples = data.shape[0]

        pass

    @property

    def data(self):

        return self._data

    def next_batch(self, batch_size, shuffle=True):

        start = self._index_in_epoch

        if start == 0 and self._epochs_completed == 0:

            idx = np.arange(0, self._num_examples)

            np.random.shuffle(idx)  # shuffle indexe

            self._data = self.data[idx]  # get the shuffled data

        # go to the data of next batch

        if start + batch_size > self._num_examples:

            '''

            note: when start  == self._num_examples, data_rest_part = np.array([])

            '''

            self._epochs_completed += 1

            # print(self.data)

            rest_num_examples = self._num_examples - start

            data_rest_part = self.data[start:self._num_examples]

            idx_update = np.arange(0, self._num_examples)

            np.random.shuffle(idx_update)

            self._data = self.data[idx_update]  # get another shuffled data

            start = 0

            self._index_in_epoch = batch_size - rest_num_examples

            end = self._index_in_epoch

            data_new_part = self._data[start:end]

            return np.concatenate((data_rest_part, data_new_part), axis=0)

        else:

            self._index_in_epoch += batch_size

            end = self._index_in_epoch

            return self._data[start:end]

dataset = Dataset(np.arange(0, 10))

for i in range(10):

    print(dataset.next_batch(6))

print(dataset.data)

The version of pandas data

import numpy as np

import pandas as pd

class Dataset:

    def __init__(self, data):

        self._index_in_epoch = 0

        self._epochs_completed = 0

        self._data = data

        self._num_examples = data.shape[0]

        pass

    @property

    def data(self):

        return self._data

    def next_batch(self, batch_size, shuffle=True):

        start = self._index_in_epoch

        if start == 0 and self._epochs_completed == 0:

            idx = np.arange(0, self._num_examples)

            np.random.shuffle(idx)  # shuffle index

            self._data = self.data.iloc[idx,:]  # get the shuffled data

        # go to the data of next batch

        if start + batch_size > self._num_examples:

            '''

            note: when start  == self._num_examples, data_rest_part = np.array([])

            '''

            self._epochs_completed += 1

            # print(self.data) # this is for debug

            rest_num_examples = self._num_examples - start

            data_rest_part = self.data.iloc[start:self._num_examples,:]

            idx_update = np.arange(0, self._num_examples)

            np.random.shuffle(idx_update)

            self._data = self.data.iloc[idx_update,:]  # get another shuffled data

            start = 0

            self._index_in_epoch = batch_size - rest_num_examples

            end = self._index_in_epoch

            data_new_part = self._data.iloc[start:end,:]

            return pd.concat((data_rest_part, data_new_part), axis=0)

        else:

            self._index_in_epoch += batch_size

            end = self._index_in_epoch

            return self._data[start:end]

df = pd.DataFrame()

df['a']=np.arange(10)

df['b']=np.arange(10)*10

dataset = Dataset(df)

for i in range(10):

    print(dataset.next_batch(5))

print(dataset.data)

Implement TensorFlow's next_batch for own data的更多相关文章

Tensorflow - Implement for generating some 3-dimensional phony data and fitting them with a plane.
Coding according to TensorFlow 官方文档中文版 import tensorflow as tf import numpy as np ''' Intro. for thi ...
tensorflow.python.framework.errors_impl.PermissionDeniedError: /data; Permission denied
在linux系统中,tensorflow跑mnist数据集出现错误,本应该自动下载的数据集将mnist自动下载的路径,由/data/mnist之前的/删掉即可.改为data/mnist.
tensorflow的object detection的data augmention的使用
在protoc的目录下有data augmention的提示,而且注意是repeated,也就是你要这样写: 不能写在一个data_aumentation_options下面,至于有哪些选项可以用,可 ...
How to use Data Iterator in TensorFlow
How to use Data Iterator in TensorFlow one_shot_iterator initializable iterator reinitializable iter ...
[TensorFlow] Introduction to TensorFlow Datasets and Estimators
Datasets and Estimators are two key TensorFlow features you should use: Datasets: The best practice ...
逻辑回归，附tensorflow实现
本文旨在通过二元分类问题.多元分类问题介绍逻辑回归算法,并实现一个简单的数字分类程序在生活中,我们经常会碰到这样的问题: 根据苹果表皮颜色判断是青苹果还是红苹果根据体温判断是否发烧这种答案只有两 ...
学习笔记TF057:TensorFlow MNIST，卷积神经网络、循环神经网络、无监督学习
MNIST 卷积神经网络.https://github.com/nlintz/TensorFlow-Tutorials/blob/master/05_convolutional_net.py .Ten ...
逻辑斯特回归tensorflow实现
calss #!/usr/bin/python2.7 #coding:utf-8 from __future__ import print_function import tensorflow as ...
TensorFlow在win10上的安装与使用（三）
本篇博客介绍最经典的手写数字识别Mnist在tf上的应用. Mnist有两种模型,一种是将其数据集看作是没有关系的像素值点,用softmax回归来做.另一种就是利用卷积神经网络,考虑局部图片像素的相关 ...

随机推荐

什么是 Serverless 应用引擎？优势有哪些？
Serverless 应用引擎(Serverless App Engine,简称 SAE)是面向应用的 Serverless PaaS 平台,能够帮助 PaaS 层用户免运维 IaaS,按需使用,按量 ...
给Repater增加等号
//不改变数据结构的情况下,增加行号.对Application服务器压力增大,减少DB服务器压力. protected void repShow_ItemDataBound(object sen ...
应对Hadoop集群数据疯长，这里祭出了4个治理对策！
一.背景在目前规模比较大的互联网公司中,总数据量能达到10PB甚至几十PB数据量的公司,我认为中国已经有超过了20家了.而在这些公司中,也有很多家公司的日数据增长达到100TB+ 了. 所以我们每 ...
colspan和rowspan
colspan和rowspan这两个属性用于创建特殊的表格. colspan用来指定单元格横向跨越的列数:colspan就是合并列的,colspan=2的话就是合并两列. rowspan用来指定单元格 ...
SQL-Serverの自動採番(IDENTITY値)の取得・リセット
システムに必要なテーブルで.自動的に番号を振っていくものが必要なときがあります. たとえば.各種の伝票データの伝票番号の様なものです. プログラム処理上.データを登録した直後に.自動採番された値を取得 ...
docker使用国内镜像加速
在daemon.json文件里以下国内镜像 { "registry-mirrors": [ "https://registry.docker-cn.com", ...
linux 之实现定时任务
一.方式一 (1)命令行的方法: 一.方式一需求:每分钟执行一次/etc 目录的添加到/tmp/a.txt 中 (1) touch a.txt创建文件 (2) crotab -e 进行任务的定制 ...
SSD源码解读——网络搭建
之前,对SSD的论文进行了解读,可以回顾之前的博客:https://www.cnblogs.com/dengshunge/p/11665929.html. 为了加深对SSD的理解,因此对SSD的源码进 ...
五，pod控制器应用进阶
目录 Pod 资源标签给资源打标签标签选择器 Pod 生命周期 pod状态探测 livenessProbe 状态探测 livenessProbe exec 测试 livenessProbe ht ...
Java入门指南-04 顺序、分支、循环
顺序结构从上至下,依次执行 if 语句在 Java 里,用 if 语句来实现“当满足 XXX 条件时,执行 YYY”这样的逻辑判断.例如,在使用共享单车时需要检查人的年纪.如果在 12 岁以下,则禁 ...

Implement TensorFlow's next_batch for own data

The version of numpy data

The version of pandas data

Implement TensorFlow's next_batch for own data的更多相关文章

随机推荐

热门专题