Logistic Regression and Newton's Method

Data

For this exercise, suppose that a high school has a dataset representing 40 students who were admitted to college and 40 students who were not admitted. Each training example contains a student's score on two standardized exams and a label of whether the student was admitted.

Your task is to build a binary classification model that estimates college admission chances based on a student's scores on two exams. In your training data,

a. The first column of your x array represents all Test 1 scores, and the second column represents all Test 2 scores.

b. The y vector uses '1' to label a student who was admitted and '0' to label a student who was not admitted.

Plot the data

Load the data for the training examples into your program and add the intercept term into your x matrix.

Before beginning Newton's Method, we will first plot the data using different symbols to represent the two classes. In Matlab/Octave, you can separate the positive class and the negative class using the find command:
% find returns the indices of the

% rows meeting the specified condition

pos = find(y == 1); neg = find(y == 0);

% Assume the features are in the 2nd and 3rd

% columns of x

plot(x(pos, 2), x(pos,3), '+'); hold on

plot(x(neg, 2), x(neg, 3), 'o')
Your plot should look like the following:

Newton's Method

在logistic regression问题中，logistic函数表达式如下：

这样做的好处是可以把输出结果压缩到0~1之间。而在logistic回归问题中的损失函数与线性回归中的损失函数不同，这里定义的为：

如果采用牛顿法来求解回归方程中的参数，则参数的迭代公式为：

其中一阶导函数和hessian矩阵表达式如下：

code

% Exercise  -- Logistic Regression

clear all; close all; clc

x = load('ex4x.dat');

y = load('ex4y.dat');

[m, n] = size(x);

% Add intercept term to x

x = [ones(m, ), x]; 

% Plot the training data

% Use different markers for positives and negatives

figure

pos = find(y); neg = find(y == );%find是找到的一个向量，其结果是find函数括号值为真时的值的编号

plot(x(pos, ), x(pos,), '+')

hold on

plot(x(neg, ), x(neg, ), 'o')

hold on

xlabel('Exam 1 score')

ylabel('Exam 2 score')

% Initialize fitting parameters

theta = zeros(n+, );

% Define the sigmoid function 匿名函数

g = inline('1.0 ./ (1.0 + exp(-z))'); 

% Newton's method

MAX_ITR = ;

J = zeros(MAX_ITR, );

for i = :MAX_ITR

    % Calculate the hypothesis function

    z = x * theta;

    h = g(z);%转换成logistic函数

    % Calculate gradient and hessian.

    % The formulas below are equivalent to the summation formulas

    % given in the lecture videos.

    grad = (/m).*x' * (h-y);%梯度的矢量表示法

    H = (/m).*x' * diag(h) * diag(1-h) * x;%hessian矩阵的矢量表示法

    % Calculate J (for testing convergence)

    J(i) =(/m)*sum(-y.*log(h) - (-y).*log(-h));%损失函数的矢量表示法

    theta = theta - H\grad;%是这样子的吗？

end

% Display theta

theta

% Calculate the probability that a student with

% Score  on exam  and score  on exam

% will not be admitted

prob =  - g([, , ]*theta)

%画出分界面

% Plot Newton's method result

% Only need  points to define a line, so choose two endpoints

plot_x = [min(x(:,))-,  max(x(:,))+];

% Calculate the decision boundary line，plot_y的计算公式见博客下面的评论。

plot_y = (-./theta()).*(theta().*plot_x +theta());

plot(plot_x, plot_y)

legend('Admitted', 'Not admitted', 'Decision Boundary')

hold off

% Plot J

figure

plot(:MAX_ITR-, J, 'o--', 'MarkerFaceColor', 'r', 'MarkerSize', )

xlabel('Iteration'); ylabel('J')

% Display J

J

matlab

diag函数功能：矩阵对角元素的提取和创建对角阵

设以下X为方阵，v为向量

1、X = diag(v,k)当v是一个含有n个元素的向量时，返回一个n+abs(k)阶方阵X，向量v在矩阵X中的第k个对角线上，k=0表示主对角线，k>0表示在主对角线上方，k<0表示在主对角线下方。例1：

v=[1 2 3];
diag(v, 3)

ans =

0 0 0 1 0 0

0 0 0 0 2 0

0 0 0 0 0 3

0 0 0 0 0 0

0 0 0 0 0 0

0 0 0 0 0 0

注：从主对角矩阵上方的第三个位置开始按对角线方向产生数据的

例2：

v=[1 2 3];

diag(v, -1)

ans =

0 0 0 0

1 0 0 0

0 2 0 0

0 0 3 0

注：从主对角矩阵下方的第一个位置开始按对角线方向产生数据的

2、X = diag(v)

向量v在方阵X的主对角线上，类似于diag(v,k),k=0的情况。

例3：

v=[1 2 3];

diag(v)

ans =

1 0 0

0 2 0

0 0 3

注：写成了对角矩阵的形式

3、v = diag(X,k)

返回列向量v，v由矩阵X的第k个对角线上的元素形成

例4：

v=[1 0 3;2 3 1;4 5 3];

diag(v,1)

ans =

0

1

注：把主对角线上方的第一个数据作为起始数据，按对角线顺序取出写成列向量形式

4、v = diag(X)返回矩阵X的主对角线上的元素，类似于diag(X,k)，k=0的情况例5：

v=[1 0 0;0 3 0;0 0 3];

diag(v)

ans =

1

3

3

或改为：

v=[1 0 3;2 3 1;4 5 3];

diag(v)

ans =

1

3

3

注：把主对角线的数据取出写成列向量形式

5、diag(diag(X))

取出X矩阵的对角元，然后构建一个以X对角元为对角的对角矩阵。

例6：

X=[1 2;3 4]
diag(diag(X))

X =

1 2

3 4

ans =

1 0

0 4

Logistic Regression and Newton's Method的更多相关文章

转载 Deep learning：四(logistic regression练习)
前言: 本节来练习下logistic regression相关内容,参考的资料为网页:http://openclassroom.stanford.edu/MainFolder/DocumentPage ...
斯坦福CS229机器学习课程笔记 part2：分类和逻辑回归 Classificatiion and logistic regression
Logistic Regression 逻辑回归 1.模型逻辑回归解决的是分类问题,并且是二元分类问题(binary classification),y只有0,1两个取值.对于分类问题使用线性回归不 ...
Regularized logistic regression
要解决的问题是,给出了具有2个特征的一堆训练数据集,从该数据的分布可以看出它们并不是非常线性可分的,因此很有必要用更高阶的特征来模拟.例如本程序中个就用到了特征值的6次方来求解. Data To be ...
逻辑回归模型（Logistic Regression）及Python实现
逻辑回归模型(Logistic Regression)及Python实现 http://www.cnblogs.com/sumai 1.模型在分类问题中,比如判断邮件是否为垃圾邮件,判断肿瘤是否为阳 ...
Logistic Regression Vs Decision Trees Vs SVM: Part I
Classification is one of the major problems that we solve while working on standard business problem ...
Stanford机器学习---第三讲. 逻辑回归和过拟合问题的解决 logistic Regression & Regularization
原文:http://blog.csdn.net/abcjennifer/article/details/7716281 本栏目(Machine learning)包括单参数的线性回归.多参数的线性回归 ...
logistic regression的一些问题，不平衡数据，时间序列，求解惑
Logistic Regression 1.在有时间序列的特征数据中,怎么运用LR? 不光是LR,其他的模型也是. 有很多基本的模型变形之后,变成带时序的模型.但,个人觉得,这类模型大多不靠谱. 我觉 ...
Machine Learning - 第3周（Logistic Regression、Regularization）
Logistic regression is a method for classifying data into discrete outcomes. For example, we might u ...
Python实践之（七）逻辑回归（Logistic Regression）
机器学习算法与Python实践之(七)逻辑回归(Logistic Regression) zouxy09@qq.com http://blog.csdn.net/zouxy09 机器学习算法与Pyth ...

随机推荐

tf.nn.top_k(input, k, name=None)和tf.nn.in_top_k(predictions, targets, k, name=None)
tf.nn.top_k(input, k, name=None) 这个函数的作用是返回 input 中每行最大的 k 个数,并且返回它们所在位置的索引. input: 一个张量,数据类型必须是以下之一 ...
事件代理（event的target属性）
event的target属性一个题:请通过事件代理实现当点击每一个li标签,弹出相应li标签内的内容 Event对象提供了一个属性叫target,可以返回事件的目标节点,我们称为事件源,也就是说,t ...
Mysql 主从主主复制总结(详细)
环境:Centos 6.9,Mysql 8.0 配置准备:1.为Centos配置好网络,使用远程工具连接. 2.安装mysql,途径不限.mysql8.0和5.7区别不大,8.0在配置主从的时候默认开 ...
学习Go语言之使用channel避免竞态问题
// 使用channel避免竞态问题 package main import ( "fmt" "sync" ) var ( i int wg sync.Wait ...
[NOIP2009] 靶形数独（搜索）
P1074 靶形数独题目描述小城和小华都是热爱数学的好学生,最近,他们不约而同地迷上了数独游戏,好胜的他们想用数独来一比高低.但普通的数独对他们来说都过于简单了,于是他们向 Z 博士请教,Z 博士 ...
第6章8节《MonkeyRunner源代码剖析》Monkey原理分析-事件源-事件源概览-小结
本章我们重点环绕处理网络过来的命令的MonkeySourceNetwork这个事件源来阐述学习Monkey是怎样处理MonkeyRunner过来的命令的.以下总结下MonkeyRunner从启动Mon ...
关于Android手机MTP模式连接的一些设置（win7和ubuntu下，以红米1s为例）
有些手机的MTP模式在电脑上识别不了,须要一些设置才干够,以下就网上收集来的一些设置方法集中贴过来: 一. win7下參考:http://blog.ammrli.com/?p=1117 1.在设备管 ...
XMPP使用简单介绍--登录
在现阶段的通信服务中,各种标准都有.因此会出现无法实现相互连通,而XMPP(Extensible Message and presence Protocol)协议的出现,实现了整个及时通信服务协议的互 ...
tp5项目搭建思路
按照需求,创建主体的目录结构,一般包括管理后台admin,前台展示index,app接口api. admin中又包含controller,model,view,其他等等. 一些js,css,image ...
syslog介绍-CS架构来采集系统日志
syslog架构 Unix/Linux系统中的大部分日志都是通过一种叫做syslog的机制产生和维护的.syslog是一种标准的协议,分为客户端和服务器端,客户端是产生日志消息的一方,而服务器端负责接 ...

Logistic Regression and Newton's Method

Logistic Regression and Newton's Method的更多相关文章

随机推荐

热门专题