机器学习算法整理（七）支持向量机以及SMO算法实现

以下均为自己看视频做的笔记，自用，侵删！

还参考了：http://www.ai-start.com/ml2014/

在监督学习中，许多学习算法的性能都非常类似，因此，重要的不是你该选择使用学习算法A还是学习算法B，而更重要的是，应用这些算法时，所创建的大量数据在应用这些算法时，表现情况通常依赖于你的水平。比如：你为学习算法所设计的特征量的选择，以及如何选择正则化参数，诸如此类的事。还有一个更加强大的算法广泛的应用于工业界和学术界，它被称为支持向量机(Support Vector Machine)。与逻辑回归和神经网络相比，支持向量机，或者简称SVM，在学习复杂的非线性方程时提供了一种更为清晰，更加强大的方式。

容错能力越强越好

b为平面的偏正向，w为平面的法向量，x到平面的映射：

先求的是，距分界线距离最小的点；然后再求的是什么样的w和b，使得这样的点，距离分界线的值最大。

放缩之后：; 又要取其为min，即取 yi*(w^T*Q(xi) + b) = 1 =>

补充：

下面的 x_i · x_j 是算的内积：如，(3_i, 3_i) · (3_j, 3_j) ==> 3_i * 3_j + 3_i * 3_j = 18;

如上面实例，x2就是没有发挥作用的数据点，α为0；x1, x2就是支持向量，α不为0的点；

松弛因子：

核变换：低微不可分==> 映射到高维

举例：

SMO算法实现：

除了α1，α2当成变量，其他的α都当成常数项。

（L是取值的下界，H是取值的上界）

代码实现：

import numpy as np

def loadDataSet(fileName):

    dataMat = []; labelMat = []

    fr = open(fileName)

    for line in fr.readlines():

        lineArr = line.strip().split('\t')

        dataMat.append([float(lineArr[0]), float(lineArr[1])])

        labelMat.append(float(lineArr[2]))

    return dataMat,labelMat

def selectJrand(i,m):

    j=i #we want to select any J not equal to i

    while (j==i):

        j = int(np.random.uniform(0,m))

    return j

# 控制aj的上下界

def clipAlpha(aj,H,L):

    if aj > H: 

        aj = H

    if L > aj:

        aj = L

    return aj

# dataMat: 数据; classLabels: Y值; C:V值; toler:容忍程度; maxIter: 最大迭代次数

def smoSimple(dataMatIn, classLabels, C, toler, maxIter):

    #初始化操作

    dataMatrix = np.mat(dataMatIn); labelMat = np.mat(classLabels).transpose()

    b = 0; m,n = np.shape(dataMatrix)

    # 进行初始化

    alphas = np.mat(np.zeros((m,1)))

    iter = 0

    while (iter < maxIter):

        alphaPairsChanged = 0

        # m : 数据样本数

        for i in range(m):

            # 先计算FXi，这里只用了线性的 kernel，相当于不变。

            fXi = float(np.multiply(alphas,labelMat).T*(dataMatrix*dataMatrix[i,:].T)) + b

            Ei = fXi - float(labelMat[i])#if checks if an example violates KKT conditions

            # 设置限制条件

            if ((labelMat[i]*Ei < -toler) and (alphas[i] < C)) or ((labelMat[i]*Ei > toler) and (alphas[i] > 0)):

                #随机再选一个不等于i的数

                j = selectJrand(i,m)

                fXj = float(np.multiply(alphas,labelMat).T*(dataMatrix*dataMatrix[j,:].T)) + b

                Ej = fXj - float(labelMat[j])

                alphaIold = alphas[i].copy(); alphaJold = alphas[j].copy();

                # 控制边界条件

                # 定义了上(H)下(L)界取值范围

                if (labelMat[i] != labelMat[j]):

                    L = max(0, alphas[j] - alphas[i])

                    H = min(C, C + alphas[j] - alphas[i])

                else:

                    L = max(0, alphas[j] + alphas[i] - C)

                    H = min(C, alphas[j] + alphas[i])

                if L==H: print "L==H"; continue

                #算出 eta = K11 + K22 - K12

                #这里需要添加一个负号

                eta = 2.0 * dataMatrix[i,:]*dataMatrix[j,:].T - dataMatrix[i,:]*dataMatrix[i,:].T - dataMatrix[j,:]*dataMatrix[j,:].T

                if eta >= 0: print "eta>=0"; continue

                # 即这里就用一个减号

                alphas[j] -= labelMat[j]*(Ei - Ej)/eta

                # 控制上下界

                alphas[j] = clipAlpha(alphas[j],H,L)

                if (abs(alphas[j] - alphaJold) < 0.00001): print "j not moving enough"; continue

                # 算出αi

                alphas[i] += labelMat[j]*labelMat[i]*(alphaJold - alphas[j])#update i by the same amount as j

                                                                        #the update is in the oppostie direction

                b1 = b - Ei- labelMat[i]*(alphas[i]-alphaIold)*dataMatrix[i,:]*dataMatrix[i,:].T - labelMat[j]*(alphas[j]-alphaJold)*dataMatrix[i,:]*dataMatrix[j,:].T

                b2 = b - Ej- labelMat[i]*(alphas[i]-alphaIold)*dataMatrix[i,:]*dataMatrix[j,:].T - labelMat[j]*(alphas[j]-alphaJold)*dataMatrix[j,:]*dataMatrix[j,:].T

                if (0 < alphas[i]) and (C > alphas[i]): b = b1

                elif (0 < alphas[j]) and (C > alphas[j]): b = b2

                else: b = (b1 + b2)/2.0

                alphaPairsChanged += 1

                print "iter: %d i:%d, pairs changed %d" % (iter,i,alphaPairsChanged)

        if (alphaPairsChanged == 0): iter += 1

        else: iter = 0

        print "iteration number: %d" % iter

    return b,alphas

if __name__ == '__main__':

    dataMat,labelMat = loadDataSet('testSet.txt')

    b,alphas = smoSimple(dataMat, labelMat, 0.06, 0.01, 100)

    print 'b:',b

    print 'alphas',alphas[alphas>0]

SMO实例：

import matplotlib.pyplot as plt

import numpy as np

%matplotlib inline

from matplotlib.colors import ListedColormap

def plot_decision_regions(X, y, classifier, test_idx=None, resolution=0.02):

    # setup marker generator and color map

    markers = ('s', 'x', 'o', '^', 'v')

    colors = ('red', 'blue', 'lightgreen', 'gray', 'cyan')

    cmap = ListedColormap(colors[:len(np.unique(y))])

    # plot the decision surface

    x1_min, x1_max = X[:, 0].min() - 1, X[:, 0].max() + 1

    x2_min, x2_max = X[:, 1].min() - 1, X[:, 1].max() + 1

    xx1, xx2 = np.meshgrid(np.arange(x1_min, x1_max, resolution), np.arange(x2_min, x2_max, resolution))

    Z = classifier.predict(np.array([xx1.ravel(), xx2.ravel()]).T)

    Z = Z.reshape(xx1.shape)

    plt.contourf(xx1, xx2, Z, alpha=0.4, cmap=cmap)

    plt.xlim(xx1.min(), xx1.max())

    plt.ylim(xx2.min(), xx2.max())

    # plot class samples

    for idx, cl in enumerate(np.unique(y)):

        plt.scatter(x=X[y == cl, 0], y=X[y == cl, 1],alpha=0.8, c=cmap(idx),marker=markers[idx], label=cl)

    # highlight test samples

    if test_idx:

        X_test, y_test = X[test_idx, :], y[test_idx]

        plt.scatter(X_test[:, 0], X_test[:, 1], c='', alpha=1.0, linewidth=1, marker='o', s=55, label='test set')

from sklearn import datasets

import numpy as np

from sklearn.cross_validation import train_test_split

iris = datasets.load_iris() # 由于Iris是很有名的数据集，scikit-learn已经原生自带了。

X = iris.data[:, [1, 2]]

y = iris.target # 标签已经转换成0，1，2了

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.3, random_state=0) # 为了看模型在没有见过数据集上的表现，随机拿出数据集中30%的部分做测试

# 为了追求机器学习和最优化算法的最佳性能，我们将特征缩放

from sklearn.preprocessing import StandardScaler

sc = StandardScaler()

sc.fit(X_train) # 估算每个特征的平均值和标准差

sc.mean_ # 查看特征的平均值，由于Iris我们只用了两个特征，所以结果是array([ 3.82857143,  1.22666667])

sc.scale_ # 查看特征的标准差，这个结果是array([ 1.79595918,  0.77769705])

X_train_std = sc.transform(X_train)

# 注意：这里我们要用同样的参数来标准化测试集，使得测试集和训练集之间有可比性

X_test_std = sc.transform(X_test)

X_combined_std = np.vstack((X_train_std, X_test_std))

y_combined = np.hstack((y_train, y_test))

# 导入SVC

from sklearn.svm import SVC

svm1 = SVC(kernel='linear', C=0.1, random_state=0) # 用线性核

svm1.fit(X_train_std, y_train)

svm2 = SVC(kernel='linear', C=10, random_state=0) # 用线性核

svm2.fit(X_train_std, y_train)

fig = plt.figure(figsize=(10,6))

ax1 = fig.add_subplot(1,2,1)

#ax2 = fig.add_subplot(1,2,2)

plot_decision_regions(X_combined_std, y_combined, classifier=svm1)

plt.xlabel('petal length [standardized]')

plt.ylabel('petal width [standardized]')

plt.title('C = 0.1')

ax2 = fig.add_subplot(1,2,2)

plot_decision_regions(X_combined_std, y_combined, classifier=svm2)

plt.xlabel('petal length [standardized]')

plt.ylabel('petal width [standardized]')

plt.title('C = 10')

plt.show()

svm1 = SVC(kernel='rbf', random_state=0, gamma=0.1, C=1.0) # 令gamma参数中的x分别等于0.1和10

svm1.fit(X_train_std, y_train) 

svm2 = SVC(kernel='rbf', random_state=0, gamma=10, C=1.0)

svm2.fit(X_train_std, y_train) 

fig = plt.figure(figsize=(10,6))

ax1 = fig.add_subplot(1,2,1)

plot_decision_regions(X_combined_std, y_combined, classifier=svm1)

plt.xlabel('petal length [standardized]')

plt.ylabel('petal width [standardized]')

plt.title('gamma = 0.1')

ax2 = fig.add_subplot(1,2,2)

plot_decision_regions(X_combined_std, y_combined, classifier=svm2)

plt.xlabel('petal length [standardized]')

plt.ylabel('petal width [standardized]')

plt.title('gamma = 10')

plt.show()

机器学习算法整理（七）支持向量机以及SMO算法实现的更多相关文章

SVM-非线性支持向量机及SMO算法
SVM-非线性支持向量机及SMO算法如果您想体验更好的阅读:请戳这里littlefish.top 线性不可分情况线性可分问题的支持向量机学习方法,对线性不可分训练数据是不适用的,为了满足函数间隔大 ...
[置顶] 【机器学习PAI实践七】文本分析算法实现新闻自动分类
一.背景新闻分类是文本挖掘领域较为常见的场景.目前很多媒体或是内容生产商对于新闻这种文本的分类常常采用人肉打标的方式,消耗了大量的人力资源.本文尝试通过智能的文本挖掘算法对于新闻文本进行分类.无需任 ...
《机器学习_07_01_svm_硬间隔支持向量机与SMO》
一.简介支持向量机(svm)的想法与前面介绍的感知机模型类似,找一个超平面将正负样本分开,但svm的想法要更深入了一步,它要求正负样本中离超平面最近的点的距离要尽可能的大,所以svm模型建模可以分为 ...
支持向量机的smo算法（MATLAB code）
建立smo.m % function [alpha,bias] = smo(X, y, C, tol) function model = smo(X, y, C, tol) % SMO: SMO al ...
受限玻尔兹曼机（RBM）学习笔记（七）RBM 训练算法
去年 6 月份写的博文<Yusuke Sugomori 的 C 语言 Deep Learning 程序解读>是囫囵吞枣地读完一个关于 DBN 算法的开源代码后的笔记,当时对其中涉及的算 ...
改进的SMO算法
S. S. Keerthi等人在Improvements to Platt's SMO Algorithm for SVM Classifier Design一文中提出了对SMO算法的改进,纵观SMO ...
SMO算法--SVM(3)
SMO算法--SVM(3) 利用SMO算法解决这个问题: SMO算法的基本思路: SMO算法是一种启发式的算法(别管启发式这个术语, 感兴趣可了解), 如果所有变量的解都满足最优化的KKT条件, 那么 ...
Leetcode——二叉树常考算法整理
二叉树常考算法整理希望通过写下来自己学习历程的方式帮助自己加深对知识的理解,也帮助其他人更好地学习,少走弯路.也欢迎大家来给我的Github的Leetcode算法项目点star呀~~ 二叉树常考算法 ...
机器学习之支持向量机（二）：SMO算法
注:关于支持向量机系列文章是借鉴大神的神作,加以自己的理解写成的:若对原作者有损请告知,我会及时处理.转载请标明来源. 序: 我在支持向量机系列中主要讲支持向量机的公式推导,第一部分讲到推出拉格朗日对 ...

随机推荐

k8s之使用secret获取私有仓库镜像
一.前言其实这次实践算不上特别复杂,只是在实践过程中遇到了一些坑,以及填坑的方法是非常值得在以后的学习过程中参考借鉴的二.知识准备 1.harbor是一个企业级的镜像仓库,它比起docker re ...
在windows10上搭建caffe
caffe环境的搭建一直是让我最头疼的,最近在Windows10上成功搭建了caffe,在此对搭建过程进行记录. 安装主要是按照caffe github上的安装说明进行的,caffe的github主页 ...
raft--分布式一致性协议
0. 写在前面的话一直从事分布式对象存储工作,在分布式对象存储的运营,开发等工作中,数据一致性是至关重要的.因此想写一篇关于分布式一致性的文章.一来,可以和大家分享.二来,可以提高自己的文字提炼能力 ...
第十五次ScrumMeeting博客
第十五次ScrumMeeting博客本次会议于12月4日(一)22时整在3公寓725房间召开,持续30分钟. 与会人员:刘畅.辛德泰.张安澜.赵奕.方科栋. 1. 每个人的工作(有Issue的内容和 ...
【机器学习】Apriori算法——原理及代码实现（Python版）
Apriopri算法 Apriori算法在数据挖掘中应用较为广泛,常用来挖掘属性与结果之间的相关程度.对于这种寻找数据内部关联关系的做法,我们称之为:关联分析或者关联规则学习.而Apriori算法就是 ...
【Beta阶段】第一次Scrum Meeting！
本次会议为第一次Scrum Meeting会议~ 会议时长:20分会议地点:依旧是7公寓1楼会客室昨日任务一览明日任务一览刘乾预定任务:(未完成)#128 学习如何在Github上自动构 ...
nginx安装（转发）
Nginx("engine x")是一款轻量级的HTTP和反向代理服务器.相比于Apache.lighttpd等,它具有占有内存少.并发能力强.稳定性高等优势.它最常见的用途就是提 ...
json-server（copy）
https://blog.csdn.net/wangle_style/article/details/79455508(原文章地址) 新版vue-cli如何使用json-server来mork 原创 ...
C++ STL 整理
一.一般介绍 STL(Standard Template Library),即标准模板库,是一个具有工业强度的,高效的C++程序库.它被容纳于C++标准程序库(C++ Standard Library ...
idea 导入项目后不能执行main方法
点击右键,出来不能run/debug 项目分为多个mouel模块,很多模块进来后在idea中丢失了(暂时不知道原因) 我们需要做的就是把丢失的模块加进来 ctrl+alt+shift+s 快捷键或 ...

机器学习算法整理（七）支持向量机以及SMO算法实现

机器学习算法整理（七）支持向量机以及SMO算法实现的更多相关文章

随机推荐

热门专题