Linear regression with one variable

摘要: 本文是吴恩达 (Andrew Ng)老师《机器学习》课程，第二章《单变量线性回归》中第6课时《模型概述》的视频原文字幕。为本人在视频学习过程中逐字逐句记录下来以便日后查阅使用。现分享给大家。如有错误，欢迎大家批评指正，在此表示诚挚地感谢！同时希望对大家的学习能有所帮助。

Our first learning algorithm will be linear regression. In this video (article), you'll see what the model looks like. And more importantly, you'll also see what the overall process of supervised learning looks like.

Let's use some motivating example of predicting housing prices. We're going to use a data set of housing prices from the city of Portland, Oregon. And here I'm gonna plot my data set of a number of houses that were different sizes that were sold for a range of different prices. Let's say that given this data set, you have a friend that's trying to sell a house and let's see if your friend's house is size of 1,250 square feet, and you want to tell them how much they might be able to sell the house for. Well one thing you could do is fit a model. Maybe fit a straight line to this data. Looks something like that, and based on that, maybe you could tell your friend that, let's say maybe, he can sell the house for around $220,000. So, this is an example of a supervised learning algorithm. And it's supervised learning because we're given the, quotes, "right answer" for each of following examples. Namely we're told what was the actual house, what was the house price of each of the houses in our data set were sold for. And moreover, this is an example of a regression problem, where the term regression refers to the fact that we're predicting a real-valued output, namely the price. And just to remind you the other most common type of supervised learning is called the classification problem, where we predict discrete-valued output. Such as if we are looking at cancer tumors and trying to decide if a tumor is malignant or benign. So that's a zero-one valued discrete output.

More formally, in supervised learning, we have a data set and this data set is called a training set. So, for housing price example, we have a training set of different housing prices, and our job is to learn from this data how to predict prices of houses. Let's define some notation that we're using throughout this course. We're going to define quite a lot of symbols. It's okay if you don't remember all the symbols right now, but as the course progresses, it will be useful to have a convenient notation. So, I'm gonna use lower case m throughout this course to denote the number of training examples. So, in this data set, if I have, you know, let's say 47 rows in this table, then I have 47 training examples, and m=47. Let me use lowercase x to denote the input variables, often also called the features. So that would be the x's here, it would be our input features. And I'm gonna use y to denote my output variables, or the target variable which I'm going to predict. And so that's the second column here. Looking on notation, I'm going to use (x,y) to denote a single training example. So, a single row in this table corresponds to a single training example. And to refer to a specific training example, I'm going to use this notation $(x^{(i)}, y^{(i)})$ . And, we're going to use this to refer to the $i^{th}$ training example. So, this superscript i over here, this is not exponentiation right? This $(x^{(i)}, y^{(i)})$ , the superscript i in parentheses that's just an index into my training set, and refers to the $i^{th}$ row in this table, okay? So, this is not x to the power of i, y to power of i. Instead $(x^{(i)}, y^{(i)})$ just refers to the $i^{th}$ of this table. So, for example, $x^{(1)}$ refers to the input value for the first training example, so that's 2104. That's the x in the first row. $x^{(2)}$ will be equal to 1416 right? That's the second x, and $y^{(1)}$ will be equal to 460. That's the y value for my first training example. That's what that (1) refers to.

So as mentioned, occasionally I'll ask you a question to let you check your understanding, and a few seconds in this video a multiple-choice question will pop up in the video. When it does, please use your mouse to select what you think is the right answer. What defined by the training set is? So here's how this supervised learning algorithm works. We saw that with the training set like our training set of housing prices and we feed that to our learning algorithm. Is the job of a learning algorithm to then output a function, which by convention is usually denoted lowercase h, and h stands for hypothesis. And what the job of the hypothesis is, it's a function that takes as input the size of a house, like maybe the size of the new house your friend trying to sell, so it takes in the value of x, and it tries to output the estimated value of y for the corresponding house. So h is a function that maps from x's to y's. People often ask me, you know, why is the function called hypothesis. Some of you may know the meaning of the term hypothesis, from the dictionary or from the science or whatever. It turns out that in machine learning, this is a name that was used in early days of machine learning and it kinda stuck. Because maybe not a great name for this sort of function, for mapping from sizes of houses to the predictions, I think the term hypothesis, isn't the best possible name for this, but this is the standard terminology that people use in machine learning. So don't worry too much about why people call it that. When designing a learning algorithm, the next thing we need to decide is how do we represent this hypothesis h. For this and next few videos, I'm going to choose our initial choice, for representing the hypothesis, will be the following. We're going to represent h as follows. And we will write this as:

$h_{\theta }(x)=\theta _{0}+\theta _{1}x$ .

And as a shorthand, sometimes instead of writing $h_{\theta }(x)$ , I'll just write this as $h(x)$ . But more often I'll write it as a subscript $\theta$ over there. And plotting this in the pictures, all this means is that, we are going to predict that y is a linear function of x. Right, so that's the data set. And what this function is doing, is predicting that y is some straight-line function of x. That's $h_{\theta }(x)=\theta _{0}+\theta _{1}x$ , okay? And why a linear function? Well, sometimes we'll want to fit more complicated perhaps non-linear functions as well. But since this linear case is the simple building block, we'll start with this example first of fitting linear functions, and we'll build on this to eventually have more complex models, and more complex learning algorithms. Let me also give this particular model a name. This model is called linear regression or this, for example, is actually linear regression with one variable, with the variable being x. That's the predicting all the prices as functions of one variable x. And another name for this model is univariate linear regression. And univariate is just a fancy way of saying one variable. So, that's linear regression. In the next video (article) we'll start to talk about just how we go about implementing this model.

<end>

Linear regression with one variable - Model representation的更多相关文章

机器学习笔记1——Linear Regression with One Variable
Linear Regression with One Variable Model Representation Recall that in *regression problems*, we ar ...
MachineLearning ---- lesson 2 Linear Regression with One Variable
Linear Regression with One Variable model Representation 以上篇博文中的房价预测为例,从图中依次来看,m表示训练集的大小,此处即房价样本数量:x ...
Lecture0 -- Introduction&&Linear Regression with One Variable
Introduction What is machine learning? Tom Mitchell provides a more modern definition: "A compu ...
Machine Learning 学习笔记2 - linear regression with one variable(单变量线性回归)
一.Model representation(模型表示) 1.1 训练集由训练样例(training example)组成的集合就是训练集(training set), 如下图所示, 其中(x,y) ...
机器学习 (一) 单变量线性回归 Linear Regression with One Variable
文章内容均来自斯坦福大学的Andrew Ng教授讲解的Machine Learning课程,本文是针对该课程的个人学习笔记,如有疏漏,请以原课程所讲述内容为准.感谢博主Rachel Zhang的个人笔 ...
Stanford机器学习---第二讲. 多变量线性回归 Linear Regression with multiple variable
原文:http://blog.csdn.net/abcjennifer/article/details/7700772 本栏目(Machine learning)包括单参数的线性回归.多参数的线性回归 ...
Stanford机器学习---第一讲. Linear Regression with one variable
原文:http://blog.csdn.net/abcjennifer/article/details/7691571 本栏目(Machine learning)包括单参数的线性回归.多参数的线性回归 ...
Ng第二课：单变量线性回归(Linear Regression with One Variable)
二.单变量线性回归(Linear Regression with One Variable) 2.1 模型表示 2.2 代价函数 2.3 代价函数的直观理解 2.4 梯度下降 2.5 梯度下 ...
【cs229-Lecture2】Linear Regression with One Variable (Week 1)(含测试数据和源码)
从Ⅱ到Ⅳ都在讲的是线性回归,其中第Ⅱ章讲得是简单线性回归(simple linear regression, SLR)(单变量),第Ⅲ章讲的是线代基础,第Ⅳ章讲的是多元回归(大于一个自变量). 本文的 ...

随机推荐

Property or method "openPageOffice" is not defined on the instance but referenced during render. Make sure that this property is reactive, either in the data option, or for class-based components, by
Property or method "openPageOffice" is not defined on the instance but referenced during r ...
如何查看JVM的内存
学过java的人都知道,jvm是解释运行java的,java能够作为跨平台语言,也是因为jvm的存在,合理的使用jvm内存可以帮助程序很好的运行.那么,怎么查看jvm的内存使用情况呢,下面本文介绍一下 ...
使用 EasyExcel 写Excel数据（表头动态）
引入 jar 包 <dependency> <groupId>com.alibaba</groupId> <artifactId>easyexcel&l ...
基于ARM的SoC设计入门[转]
原文:基于ARM的SoC设计入门我们跳过所有对ARM介绍性的描述,直接进入工程师们最关心的问题.要设计一个基于ARM的SoC,我们首先要了解一个基于ARM的SoC的结构.图1是一个典型的SoC的结构 ...
CSPS模拟75&76
感觉自己还是太菜了... 最近考试一直想不出来正解.难受(然而蒟蒻的博客没人看也要不来小猪peiqi的图) 模拟75:血炸... 考场上推了快两个小时的T1式子,然后心态炸裂,然后我也不知道自己干了什 ...
在Android中使用OpenGL ES开发第（五）节：GLSL基础语法
一.前期基础储备笔者之前的四篇文综述了Android中使用OpenGL ES绘制基本图形和实现了简单的相机预览,初次接触OpenGL ES开发的读者可能对其中新的概念比较迷惑,尤其是其中的顶点着色器( ...
Java三大特征--多态
1.定义允许不同类的对象对同一消息做出响应,即同一消息可以根据发送对象的不同而采用多种不同的行为方式. 2.存在条件 2.1存在继承关系 2.2子类重写了父类方法 2.3父类类型的变量指向子类对象的 ...
B. Uniqueness（尺取）
B. Uniqueness time limit per test 2 seconds memory limit per test 256 megabytes input standard input ...
微信小程序之简单记账本开发记录（一）
下载并安装微信开发者工具在选择开发记账本程序的时候犹豫着选择android studio还是微信小程序最后选择了微信小程序,因其便利和快捷. 话不多说,第一步,下载并安装微信开发者工具.下面是教程 ...
JIRA恢复备份后无法上传附件
1.在恢复JIRA 备份数据和附件后,上传附件失败,这一般是恢复附件时没有修改附件的拥有者和组创建JIRA平台,会自动创建一个服务器的账户,如果是服务器第一次部署JIRA那么账户肯定是jira,如果 ...

Linear regression with one variable - Model representation

Linear regression with one variable - Model representation的更多相关文章

随机推荐

热门专题