python MLP 神经网络使用 MinMaxScaler 没有 StandardScaler效果好

MLP 64,2 preprocessing.MinMaxScaler().fit(X)
                               test confusion_matrix:
[[129293   2734]
[   958 23375]]
             precision    recall f1-score   support

0 0.99 0.98 0.99 132027
1 0.90 0.96 0.93 24333

avg / total 0.98 0.98 0.98 156360

all confusion_matrix:
[[646945 13384]
[ 4455 117015]]
precision recall f1-score support

0 0.99 0.98 0.99 660329
1 0.90 0.96 0.93 121470

avg / total 0.98 0.98 0.98 781799

black verify confusion_matrix:
[0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 1 1 0 0 0
0 0 0 0 0]
/root/anaconda2/lib/python2.7/site-packages/sklearn/metrics/classification.py:1137: UndefinedMetricWarning: Recall and F-score are ill-defined and being set to 0.0 in labels with no true samples.
'recall', 'true', average, warn_for)
precision recall f1-score support

0 0.00 0.00 0.00 0
1 1.00 0.07 0.13 42

avg / total 1.00 0.07 0.13 42

white verify confusion_matrix:
[1 1 1 1 1 1 0]
precision recall f1-score support

0 1.00 0.14 0.25 7
1 0.00 0.00 0.00 0

avg / total 1.00 0.14 0.25 7

unknown_verify:
[1 0 0 1 1 0 0 0 1 1 0 0 1 0 1 1 1 1 1 1 1 1 1 1 1 1 1 1 0 0 1 0 0 1 1 1 1
0 1 1 1 1 0 1 0 0 1 0 1 0 1 0 0 1 0 0 1 1 0 0 1 0 0 0 1 0 1 1 0 0 1 0 0 0]

MLP 64，2 使用preprocessing.StandardScaler().fit(X)
[[131850    180]
[   230 24100]]
             precision    recall f1-score   support

0 1.00 1.00 1.00 132030
1 0.99 0.99 0.99 24330

avg / total 1.00 1.00 1.00 156360

all confusion_matrix:
[[659500 829]
[ 1195 120275]]
precision recall f1-score support

0 1.00 1.00 1.00 660329
1 0.99 0.99 0.99 121470

avg / total 1.00 1.00 1.00 781799

black verify confusion_matrix:
[0 1 1 0 0 0 0 1 1 1 0 1 1 1 1 1 1 0 1 1 1 0 0 0 1 1 1 0 0 0 1 1 1 1 1 1 1
0 0 0 1 1]
/root/anaconda2/lib/python2.7/site-packages/sklearn/metrics/classification.py:1137: UndefinedMetricWarning: Recall and F-score are ill-defined and being set to 0.0 in labels with no true samples.
'recall', 'true', average, warn_for)
precision recall f1-score support

0 0.00 0.00 0.00 0
1 1.00 0.62 0.76 42

avg / total 1.00 0.62 0.76 42

white verify confusion_matrix:
[0 0 1 0 1 1 0]
precision recall f1-score support

0 1.00 0.57 0.73 7
1 0.00 0.00 0.00 0

avg / total 1.00 0.57 0.73 7

unknown_verify:
[1 0 0 0 1 0 1 1 0 0 1 0 1 1 0 1 0 1 0 0 0 0 1 0 1 0 0 0 0 0 0 1 0 0 1 0 0
0 1 1 1 0 0 0 0 0 0 0 0 0 0 1 0 0 1 0 0 1 0 0 0 0 0 0 0 0 0 0 1 1 0 0 0 0]

代码：

    from sklearn import preprocessing

    scaler = preprocessing.StandardScaler().fit(X)

    #scaler = preprocessing.MinMaxScaler().fit(X)

    X = scaler.transform(X)

    print("standard X sample:", X[:3])

    black_verify = scaler.transform(black_verify)

    print(black_verify)

    white_verify = scaler.transform(white_verify)

    print(white_verify)

    unknown_verify = scaler.transform(unknown_verify)

    print(unknown_verify)

    # ValueError: operands could not be broadcast together with shapes (756140,75) (42,75) (756140,75)

    for i in range(20):

        X = np.concatenate((X, black_verify))

        y += black_verify_labels

    labels = ['white', 'CC']

    if True:

        # pdb.set_trace()

        ratio_of_train = 0.8

        X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=(1 - ratio_of_train))

        # X_train=preprocessing.normalize(X_train)

        # X_test=preprocessing.normalize(X_test)

        clf = MLPClassifier(solver='sgd', batch_size=128, learning_rate='adaptive', max_iter=256,

                            hidden_layer_sizes=(64, 2), random_state=1)

        """

        clf = sklearn.ensemble.RandomForestClassifier(n_estimators=n_estimators, verbose=verbose, n_jobs=n_jobs,

                                                      random_state=random_state, oob_score=True)

        """

        clf.fit(X_train, y_train)

        print "test confusion_matrix:"

        # print clf.feature_importances_

        y_pred = clf.predict(X_test)

        print(sklearn.metrics.confusion_matrix(y_test, y_pred))

        print(classification_report(y_test, y_pred))

    else:

        #clf = pickle.loads(open("mpl-acc97-recall98.pkl", 'rb').read())

        clf = pickle.loads(open("mlp-add-topx10.model", 'rb').read())

        y_pred = clf.predict(X)

        print(sklearn.metrics.confusion_matrix(y, y_pred))

        print(classification_report(y, y_pred))

        import sys

        #sys.exit(0)

    print "all confusion_matrix:"

    y_pred = clf.predict(X)

    print(sklearn.metrics.confusion_matrix(y, y_pred))

    print(classification_report(y, y_pred))

python MLP 神经网络使用 MinMaxScaler 没有 StandardScaler效果好的更多相关文章

"多层感知器"--MLP神经网络算法
提到人工智能(Artificial Intelligence,AI),大家都不会陌生,在现今行业领起风潮,各行各业无不趋之若鹜,作为技术使用者,到底什么是AI,我们要有自己的理解. 目前,在人工智能中 ...
MLP神经网络实例--手写识别
1.导入MNIST数据集直接使用fetch_mldata会报错,错误信息是python3.7把fetch_mldata方法移除了,所以需要单独下载数据集从这个网站上下载数据集: https://gi ...
c++实现mlp神经网络
之前一直用theano训练样本,最近需要转成c或c++实现.在网上参考了一下其它代码,还是喜欢c++.但是看了几份cpp代码之后,发现都多少有些bug,很不爽.由于本人编码能力较弱,还花了不少时间改正 ...
深度学习与计算机视觉：基于Python的神经网络的实现
在前面两篇文章介绍了深度学习的一些基本概念,本文则使用Python实现一个简单的深度神经网络,并使用MNIST数据库进行测试. 神经网络的实现,包括以下内容: 神经网络权值的初始化正向传播误差评估 ...
python视频神经网络 Tensorflow
python视频神经网络 Tensorflow 模块视频教程 (带源码) 所属网站分类: 资源下载 > python视频教程作者:smile 链接:http://www.pythonhei ...
python 实现神经网络算法
注: Scratch是一款由麻省理工学院(MIT) 设计开发的一款面向少年的简易编程工具.这里写链接内容本文翻译自“IMPLEMENTING A NEURAL NETWORK FRO ...
使用python制作神经网络——搭建框架
一.神经网络的大体结构可分为三个函数,分别如下: 1.初始化函数设定输入层节点,隐藏层节点和输出层节点的数量. 2.训练学习给定训练集样本后,优化权重. 3.查询给定输入,从输出节点给出答案所 ...
python keras 神经网络框架的使用以及实例
先吐槽一下这个基于theano的keras有多难装,反正我是在windows下折腾到不行(需要64bit,vs c++2015),所以自己装了一个双系统.这才感到linux系统的强大之初,难怪大公司都 ...
Python实现神经网络算法识别手写数字集
最近忙里偷闲学习了一点机器学习的知识,看到神经网络算法时我和阿Kun便想到要将它用Python代码实现.我们用了两种不同的方法来编写它.这里只放出我的代码. MNIST数据集基于美国国家标准与技术研究 ...

随机推荐

linux无线网络配置_转
转自:http://www.cnblogs.com/dartagnan/archive/2010/12/05/2003521.html 一位资生linux 原文:http://www.hpl.hp ...
JVM调优- jmap（转）
http://blog.csdn.net/fenglibing/article/details/6411953 1.介绍打印出某个java进程(使用pid)内存内的,所有‘对象’的情况(如:产生那些 ...
C语言基础知识【数据类型】
C 数据类型1.在 C 语言中,数据类型指的是用于声明不同类型的变量或函数的一个广泛的系统.变量的类型决定了变量存储占用的空间,以及如何解释存储的位模式.2.C 中的类型可分为以下几种:序号类 ...
Java Enum 比较用 == 还是 eques
我是把枚举当作常量来使用的,枚举中还有两个自己的属性,关注到这个地方的朋友对枚举已经有了认识,这里就不再编写枚举的demo了,这里我直接说结果吧,在枚举中使用==和equals比较效果是一样的,查看源 ...
python解释器安装教程
1. 首先,打开python的官网:python.org 2. 首页downloads下打开, 3. 最上边是两个最新的版本,长期计划,推荐使用python3,如果长期打算用p3,默认使用最新版本.如 ...
iOS应用的执行原理
本文转自:http://www.cnblogs.com/oc-bowen/p/6061261.html http://www.cnblogs.com/oc-bowen/p/6061178.html 一 ...
Python菜鸟之路：Python基础
一.Python版本升级至3.0的必然性 In November 2014, it was announced that Python 2.7 would be supported until 202 ...
NOI-linux下VIM的个人常用配置
路径:/etc/vim/vimrc 打开终端:Ctrl+Alt+T 输入:sudo vim或gedit /etc/vim/vimrc (推荐用gedit,更好操作) 以下是我的配置: "我的 ...
教你管理SQL备份与恢复系列（1-20）
原链接:https://bbs.51cto.com/thread-1147908-1.html 教你备份与恢复数据库,直接下面下文档吧. 教你备份与恢复数据库(1)事务 http://bbs.51ct ...
java面向对象入门之创建类
/* Name:如何创建类的实例 Power by Stuart Date:2015-4-23*/ //一个bike测试类 public class bikeTest{ //bike 一个变量 Str ...

python MLP 神经网络使用 MinMaxScaler 没有 StandardScaler效果好

python MLP 神经网络使用 MinMaxScaler 没有 StandardScaler效果好的更多相关文章

随机推荐

热门专题