1、What is Maximum Likelihood?

极大似然是一种找到最可能解释一组观测数据的函数的方法。

Maximum Likelihood is a way to find the most likely function to explain a set of observed data. 

在基本统计学中,通常给你一个模型来计算概率。例如,你可能被要求找出X大于2的概率,给定如下泊松分布:X ~ Poisson (2.4)。在这个例子中,已经给定了你泊松分布的参数 λ(2.4),在现实生活中,您没有这么奢侈,因为您没有确定参数的模型:您必须将数据与模型相匹配。这就是最大可能性(MLE)的作用。在统计学中,最大似然估计(maximum likelihood estimation, MLE)是在给定观测值的情况下估计统计模型参数的一种方法。MLE试图在给定观测值的情况下找到使似然函数最大化的参数值。得到的估计称为最大似然估计,也缩写为MLE。

In elementary statistics, you are usually given a model to find probabilities. For example, you might be asked to find the probability that X is greater than 2, given the following Poisson distribution:
X ~ Poisson (2.4)
In this example, you are given the parameter, λ, of 2.4 for the Possion distribution. In real life, you don’t have the luxury of having a model given to you: you’ll have to fit your data to a model. That’s where Maximum Likelihood (MLE) comes in.
In statistics, maximum likelihood estimation (MLE) is a method of estimating the parameters of a statistical model, given observations. MLE attempts to find the parameter values that maximize the likelihood function, given the observations. The resulting estimate is called a maximum likelihood estimate, which is also abbreviated as MLE.

MLE采用已知的概率分布模型(如正态分布),并将数据集与这些分布进行比较,以便找到数据的合适匹配。一个分布模型对应的参数可以有无穷个。例如正态分布的均值可以是0,也可以是100亿以上。最大似然估计是找到最可能生成待测样本的总体参数的一种方法。数据与模型的匹配程度称为“拟合优度”。

MLE takes known probability distributions (like the normal distribution) and compares data sets to those distributions in order to find a suitable match for the data. A Family of distributions can have an infinite amount of possible parameters. For example, the mean of the normal distribution could be equal to zero, or it could be equal to ten billion and beyond. Maximum Likelihood Estimation is one way to find the parameters of the population that is most likely to have generated the sample being tested. How well the data matches the model is known as “Goodness of Fit.” 

例如,研究人员可能有兴趣找出吃特定食物的老鼠的平均体重增加。研究人员无法测量每只老鼠的体重,所以只能取样。大鼠体重增加呈正态分布;最大似然估计可用于求基于该样本的总体增重的均值和方差

For example, a researcher might be interested in finding out the mean weight gain of rats eating a particular diet. The researcher is unable to weigh every rat in the population so instead takes a sample. Weight gains of rats tend to follow a normal distribution; Maximum Likelihood Estimation can be used to find the mean and variance of the weight gain in the general population based on this sample

MLE根据似然函数的最大值来选择模型参数。

MLE chooses the model parameters based on the values that maximize the Likelihood Function.

2、The Likelihood Function(似然函数,是一种表示概率的方法;似然表示得到样本的概率;最大似然表示的是得到样本最大概率的参数)

给定一个特定的概率分布模型,样本的似然是得到样本的概率。似然函数是一种表示概率的方法:最大概率得到样本的参数是最大似然估计。

一句话:似然表示概率;似然函数表示得到概率的方法;最大似然表示的得到最大概率的参数

The likelihood of a sample is the probability of getting that sample, given a specified probability distribution model. The likelihood function is a way to express that probability: the parameters that maximize the probability of getting that sample are the Maximum Likelihood Estimators. 

假设你有一组从一个未知分布参数Θ的总体得到的随机变量X1, X2…Xn。该分布的概率密度函数(PDF) f(Xi,Θ)模型,Xi是随机变量的集合,Θ是未知参数。最大似然函数你想知道Θ最可能的值是什么,得到随机变量Xi。本例的联合概率密度函数为:

Let’s suppose you had a set of random variables X1, X2…Xn taken from an unknown population distribution with parameter Θ. This distribution has a probability density function (PDF) of f(Xi,Θ) where f is the model, Xi is the set of random variables and Θ is the unknown parameter. For the maximum likelihood function you want to know what the most likely value for Θ is, given the set of random variables Xi. The joint probability density function for this example is:

3、The Basic Idea

It seems reasonable that a good estimate of the unknown parameter θ would be the value of θ that maximizes the probability, errrr... that is, the likelihood... of getting the data we observed. (So, do you see from where the name "maximum likelihood" comes?) So, that is, in a nutshell, the idea behind the method of maximum likelihood estimation. But how would we implement the method in practice? Well, suppose we have a random sample X1X2,..., Xn for which the probability density (or mass) function of each Xi is f(xiθ). Then, the joint probability mass (or density) function of X1X2,..., Xn, which we'll (not so arbitrarily) call L(θ) is:

The first equality is of course just the definition of the joint probability mass function. The second equality comes from that fact that we have a random sample, which implies by definition that the Xare independent. And, the last equality just uses the shorthand mathematical notation of a product of indexed terms. Now, in light of the basic idea of maximum likelihood estimation, one reasonable way to proceed is to treat the "likelihood functionL(θ) as a function of θ, and find the value of θ that maximizes it.

4、example1

假设权重随机选择的美国女大学生与未知的正态分布均值μ和标准差σ。随机抽取的10名美国女大学生的体重(以磅为单位)如下:

115   122   130   127   149   160   152   138  149   180 

根据上面给出的定义,识别似然函数和μ的极大似然估计量,所有的美国女大学生的平均重量。使用给定的样本,找到一个最大似然估计的μ。

Based on the definitions given above, identify the likelihood function and the maximum likelihood estimator of μ, the mean weight of all American female college students. Using the given sample, find a maximum likelihood estimate of μ as well.

5、example2

Suppose we have a random sample X1X2,..., Xn where:

  • Xi = 0 if a randomly selected student does not own a sports car, and
  • Xi = 1 if a randomly selected student does own a sports car.

Assuming that the Xi are independent Bernoulli random variables with unknown parameter p, find the maximum likelihood estimator of p, the proportion of students who own a sports car.

6、文献

https://newonlinecourses.science.psu.edu/stat414/node/191/(写的很好,里面有很多的例子)

https://en.wikipedia.org/wiki/Maximum_likelihood_estimation

https://www.statisticshowto.datasciencecentral.com/maximum-likelihood-estimation/

Maximum Likelihood及Maximum Likelihood Estimation的更多相关文章

  1. MLE vs MAP: the connection between Maximum Likelihood and Maximum A Posteriori Estimation

    Reference:MLE vs MAP. Maximum Likelihood Estimation (MLE) and Maximum A Posteriori (MAP), are both a ...

  2. LeetCode: Maximum Product Subarray && Maximum Subarray &子序列相关

    Maximum Product Subarray Title: Find the contiguous subarray within an array (containing at least on ...

  3. likelihood(似然) and likelihood function(似然函数)

    知乎上关于似然的一个问题:https://www.zhihu.com/question/54082000 概率(密度)表达给定下样本随机向量的可能性,而似然表达了给定样本下参数(相对于另外的参数)为真 ...

  4. [Bayes] Understanding Bayes: A Look at the Likelihood

    From: https://alexanderetz.com/2015/04/15/understanding-bayes-a-look-at-the-likelihood/ Reading note ...

  5. [LeetCode] Maximum Depth of Binary Tree 二叉树的最大深度

    Given a binary tree, find its maximum depth. The maximum depth is the number of nodes along the long ...

  6. LeetCode 104. Maximum Depth of Binary Tree

    Problem: Given a binary tree, find its maximum depth. The maximum depth is the number of nodes along ...

  7. [LintCode] Maximum Depth of Binary Tree 二叉树的最大深度

    Given a binary tree, find its maximum depth. The maximum depth is the number of nodes along the long ...

  8. [Leetcode][JAVA] Minimum Depth of Binary Tree && Balanced Binary Tree && Maximum Depth of Binary Tree

    Minimum Depth of Binary Tree Given a binary tree, find its minimum depth. The minimum depth is the n ...

  9. LeetCode:Maximum Depth of Binary Tree_104

    LeetCode:Maximum Depth of Binary Tree [问题再现] Given a binary tree, find its maximum depth. The maximu ...

随机推荐

  1. python-requests数据驱动延伸

    在 python-requests模块的讲解和应用 基础上进行数据驱动的延伸 task_01_requests.py #-*- coding:utf-8 -*- #task_01_requests.p ...

  2. ES6,变量,函数-参数,结构赋值

    变量 var 1.可以重复声明. 无法限制修改-, 没有块级作用域 let不能重复声明,变量-可以修改,块级作const不能重复声明,常量-不能修改,块级作 函数——箭头函数function 名字() ...

  3. layer.js关闭子窗口及刷新父窗口

    在需要layer.js弹窗口时,当编辑完窗口内容,需要关闭及刷新父窗口时: $("#senddata").click(function(){var id = $('input[na ...

  4. 使用tpcc-mysql进行性能测试

    tpcc-mysql的github地址:https://github.com/Percona-Lab/tpcc-mysql tpcc-mysql源代码进行编译,服务器上需要安装有mysql git c ...

  5. 如何将相册中的动态GIF图转化成NSData类型

    http://www.cocoachina.com/bbs/read.php?tid=151776

  6. Loadrunner:win10下Vuser 运行脚本通过,Controller执行用户并发报错

    现象:win7安装LR可以正常使用,将win7升级到win10之后,运行场景之后报错:Error (-81024): LR_VUG: The 'QTWeb' type is not supported ...

  7. 零配置使用springboot

    1.pom.xml <?xml version="1.0" encoding="UTF-8"?> <project xmlns="h ...

  8. jqGrid pivot获取所有行包括小计数据及原码分析

    1.结论:按正常jqGid获取,在中间加入以下代码,即将小计行当成改变为普能行,以便能让'getRowData'方法获取到,第三点会进行原码分析 //get all page grid data,in ...

  9. ubuntu16.04搭建geodjango+postgresql+postgis的WebGIS框架(三)加载空间数据

    之前两篇基本上搭好了geodjango开发的环境了,当然你的电脑上肯定要有python和django的环境(这个我就不介绍了,网上一搜一大堆),我自己用的python3.5和django2.0(毕竟2 ...

  10. beego生成 api 项目 && api 文档

    目标 生成 api 项目,并且自动生成db的mapper.module等:同时生成api文档 操作步骤 1.生成 api 项目,并且自动生成db全表的映射 bee api [projectName] ...