《机器学习实战》学习笔记第五章 —

一.有关笔记：

2.吴恩达机器学习笔记（十一） —— Large Scale Machine Learning

二.Python源码（不带正则项）：

 # coding:utf-8

 '''

 Created on Oct 27, 2010

 Logistic Regression Working Module

 @author: Peter

 '''

 from numpy import *

 def sigmoid(inX):

     return 1.0 / (1 + exp(-inX))

 def gradAscent(dataMatIn, classLabels):

     dataMatrix = mat(dataMatIn)  # convert to NumPy matrix

     labelMat = mat(classLabels).transpose()  # convert to NumPy matrix

     m, n = shape(dataMatrix)

     alpha = 0.001

     maxCycles = 500

     weights = ones((n, 1))

     for k in range(maxCycles):  # heavy on matrix operations

         h = sigmoid(dataMatrix * weights)  # matrix mult

         error = (labelMat - h)  # vector subtraction

         weights = weights + alpha * dataMatrix.transpose() * error  # matrix mult

     return weights

 def stocGradAscent0(dataMatrix, classLabels,numIter=150):

     m, n = shape(dataMatrix)

     alpha = 0.01

     weights = ones(n)  # initialize to all ones

     for j in range(numIter):

         for i in range(m):

             h = sigmoid(sum(dataMatrix[i] * weights))

             error = classLabels[i] - h

             weights = weights + alpha * error * dataMatrix[i]

     return weights

 def stocGradAscent1(dataMatrix, classLabels, numIter=150):

     m, n = shape(dataMatrix)

     weights = ones(n)  # initialize to all ones

     for j in range(numIter):

         dataIndex = range(m)

         for i in range(m):

             alpha = 4 / (1.0 + j + i) + 0.0001  # apha decreases with iteration, does not

             randIndex = int(random.uniform(0, len(dataIndex)))  # go to 0 because of the constant

             h = sigmoid(sum(dataMatrix[randIndex] * weights))

             error = classLabels[randIndex] - h

             weights = weights + alpha * error * dataMatrix[randIndex]

             del (dataIndex[randIndex])

     return weights

 def classifyVector(inX, weights):

     prob = sigmoid(sum(inX * weights))

     if prob > 0.5:

         return 1.0

     else:

         return 0.0

 def colicTest():

     frTrain = open('horseColicTraining.txt')

     frTest = open('horseColicTest.txt')

     trainingSet = []

     trainingLabels = []

     for line in frTrain.readlines():

         currLine = line.strip().split('\t')

         lineArr = []

         for i in range(21):

             lineArr.append(float(currLine[i]))

         trainingSet.append(lineArr)

         trainingLabels.append(float(currLine[21]))

     trainWeights = stocGradAscent1(array(trainingSet), trainingLabels,500)

     errorCount = 0; numTestVec = 0.0

     for line in frTest.readlines():

         numTestVec += 1.0

         currLine = line.strip().split('\t')

         lineArr = []

         for i in range(21):

             lineArr.append(float(currLine[i]))

         if int(classifyVector(array(lineArr), trainWeights)) != int(currLine[21]):

             errorCount += 1

     errorRate = (float(errorCount) / numTestVec)

     print "the error rate of this test is: %f" % errorRate

     return errorRate

 def multiTest():

     numTests = 10; errorSum = 0.0

     for k in range(numTests):

         errorSum += colicTest()

     print "after %d iterations the average error rate is: %f" % (numTests, errorSum / float(numTests))

 if __name__=="__main__":

     multiTest()

三.Batch gradient descent、Stochastic gradient descent、Mini-batch gradient descent 的性能比较

1.Batch gradient descent

 def gradAscent(dataMatIn, classLabels):

     dataMatrix = mat(dataMatIn)  # convert to NumPy matrix

     labelMat = mat(classLabels).transpose()  # convert to NumPy matrix

     m, n = shape(dataMatrix)

     alpha = 0.001

     maxCycles = 500

     weights = ones((n, 1))

     for k in range(maxCycles):  # heavy on matrix operations

         h = sigmoid(dataMatrix * weights)  # matrix mult

         error = (labelMat - h)  # vector subtraction

         weights = weights + alpha * dataMatrix.transpose() * error  # matrix mult

     return weights

其运行结果：

错误率为：28.4%

2.Stochastic gradient descent

 def stocGradAscent0(dataMatrix, classLabels,numIter=150):

     m, n = shape(dataMatrix)

     alpha = 0.01

     weights = ones(n)  # initialize to all ones

     for j in range(numIter):

         for i in range(m):

             h = sigmoid(sum(dataMatrix[i] * weights))

             error = classLabels[i] - h

             weights = weights + alpha * error * dataMatrix[i]

     return weights

迭代次数为150时，错误率为：46.3%

迭代次数为500时，错误率为：32.8%

迭代次数为800时，错误率为：38.8%

3.Mini-batch gradient descent

 def stocGradAscent1(dataMatrix, classLabels, numIter=150):

     m, n = shape(dataMatrix)

     weights = ones(n)  # initialize to all ones

     for j in range(numIter):

         dataIndex = range(m)

         for i in range(m):

             alpha = 4 / (1.0 + j + i) + 0.0001  # apha decreases with iteration, does not

             randIndex = int(random.uniform(0, len(dataIndex)))  # go to 0 because of the constant

             h = sigmoid(sum(dataMatrix[randIndex] * weights))

             error = classLabels[randIndex] - h

             weights = weights + alpha * error * dataMatrix[randIndex]

             del (dataIndex[randIndex])

     return weights

迭代次数为150时，错误率为：37.8%

迭代次数为500时，错误率为：35.2%

迭代次数为800时，错误率为：37.3%

4.综上：

1.在训练数据集较小且特征较少的时候，使用Batch gradient descent的效果是最好的。但如果不能满足这个条件，则可使用Mini-batch gradient descent，并设置合适的迭代次数。

2.对于Stochastic gradient descent 和 Mini-batch gradient descent 而言，并非迭代次数越多效果越好。不知为何？

《机器学习实战》学习笔记第五章 —— Logistic回归的更多相关文章

Programming Entity Framework-dbContext 学习笔记第五章
### Programming Entity Framework-dbContext 学习笔记第五章将图表添加到Context中的方式及容易出现的错误方法结果警告 Add Root 图标中的 ...
[HeadFrist-HTMLCSS学习笔记]第五章认识媒体：给网页添加图像
[HeadFrist-HTMLCSS学习笔记]第五章认识媒体:给网页添加图像干货 JPEG.PNG.GIF有何不同 JPEG适合连续色调图像,如照片:不支持透明度:不支持动画:有损格式 PNG适合单 ...
第五章 Logistic回归
第五章 Logistic回归假设现在有一些数据点,我们利用一条直线对这些点进行拟合(该线称为最佳拟合直线),这个拟合过程就称作回归. 为了实现Logistic回归分类器,我们可以在每个特征上都乘以一 ...
《Spring实战》学习笔记-第五章：构建Spring web应用
之前一直在看<Spring实战>第三版,看到第五章时发现很多东西已经过时被废弃了,于是现在开始读<Spring实战>第四版了,章节安排与之前不同了,里面应用的应该是最新的技术. ...
【马克-to-win】学习笔记—— 第五章异常Exception
第五章异常Exception [学习笔记] [参考:JDK中文(类 Exception)] java.lang.Object java.lang.Throwable java.lang.Except ...
【机器学习实战学习笔记(2-2)】决策树python3.6实现及简单应用
文章目录 1.ID3及C4.5算法基础 1.1 计算香农熵 1.2 按照给定特征划分数据集 1.3 选择最优特征 1.4 多数表决实现 2.基于ID3.C4.5生成算法创建决策树 3.使用决策树进行分 ...
【机器学习实战学习笔记(1-1)】k-近邻算法原理及python实现
笔者本人是个初入机器学习的小白,主要是想把学习过程中的大概知识和自己的一些经验写下来跟大家分享,也可以加强自己的记忆,有不足的地方还望小伙伴们批评指正,点赞评论走起来~ 文章目录 1.k-近邻算法概述 ...
opencv图像处理基础 (《OpenCV编程入门--毛星云》学习笔记一---五章)
#include <QCoreApplication> #include <opencv2/core/core.hpp> #include <opencv2/highgu ...
学习笔记第五章使用CSS美化网页文本
第五章使用CSS美化网页文本学习重点定义字体类型.大小.颜色等字体样式: 设计文本样式,如对齐.行高.间距等: 能够灵活设计美观.实用的网页正文版式. 5.1 字体样式 5.1.1 定义字体 ...

随机推荐

cdn日志统一下载程序
最近实现了网宿cdn,阿里云cdn,腾讯cdn的日志统一格式下载程序,使用简单方便,有需要详见代码: https://github.com/mikeluwen/CdnLogDownload
debian SSD ext4 4K 对齐
新入手了一台thinkpad, 原来的机械硬盘是500G的, 于是购入一块镁光的MX200 250G的SSD来新装debian stable(jessie) 1, 安装系统的之前按住F1进入bios后 ...
Sqlserver建立Oracle的鏈接服務器
--建立数据库链接服务器 EXEC sp_addlinkedserver @server =N'TestOracle', --要创建的链接服务器别名 @srvproduct=N'Oracle', -- ...
mysql中百万级别分页查询性能优化
前提条件: 1.表的唯一索引 2.百万级数据 SQL语句: select c.* FROM ( SELECT a.logid FROM tableA a where 1 = 1 <#if pho ...
iOS iPhoneX/iPhoneXS/iPhoneXR/iPhoneXS Max系列适配
以前异性屏只有一款iPhoneX,所以在适配的时候直接判断高度是否等于812即可判断是否是iPhoneX #define IS_IPHONE_X (IS_IPHONE && SCREE ...
JavaScript_DOM编程艺术第二版[阅]
前两年迫于项目的需要,只是拿来JQuery用到项目中,并没有实质上理解javascript(貌似其他人也是这么干的)~ 随着最近几年,得益于Nodejs, React, Vue等,javascript ...
JavaScript跨浏览器事件处理
var EventUtil = { getEvent: function(event){ return event ? event : window.event; }, getTarget: func ...
Dockerfile安装KOD可道云
[root@docker01 base2]# cat Dockerfile FROM centos:6.8 RUN yum install openssh-server -y RUN /etc/ini ...
ActiveMQ与xml rpc
最近项目在做平台间的消息传递,也让我对平台间消息的传递进行了深一步的探讨.先叙述一下概况公司上一个版本用的是winform做的监控软件,主要做设备的通信和控制,基本的连接如下
2016 acm香港网络赛 A题. A+B Problem (FFT)
原题地址:https://open.kattis.com/problems/aplusb FFT代码参考kuangbin的博客:http://www.cnblogs.com/kuangbin/arch ...

《机器学习实战》学习笔记第五章 —— Logistic回归

《机器学习实战》学习笔记第五章 —— Logistic回归的更多相关文章

随机推荐

热门专题