Variance
http://mathworld.wolfram.com/Variance.html
Variance

For a single variate
having a distribution
with known population mean
, the population variance
, commonly also written
, is defined as
![]() |
(1)
|
where
is the population mean and
denotes the expectation value of
. For a discrete distribution with
possible values of
, the population variance is therefore
![]() |
(2)
|
whereas for a continuous distribution, it is given by
![]() |
(3)
|
The variance is therefore equal to the second central moment
.
Note that some care is needed in interpreting
as a variance, since the symbol
is also commonly used as a parameter related to but not equivalent to the square root of the variance, for example in the log normal distribution, Maxwell distribution, and Rayleigh distribution.
If the underlying distribution is not known, then the sample variance may be computed as
![]() |
(4)
|
where
is the sample mean.
Note that the sample variance
defined above is not an unbiased estimator for the population variance
. In order to obtain an unbiased estimator for
, it is necessary to instead define a "bias-corrected sample variance"
![]() |
(5)
|
The distinction between
and
is a common source of confusion, and extreme care should be exercised when consulting the literature to determine which convention is in use, especially since the uninformative notation
is commonly used for both. The bias-corrected sample variance
for a list of data is implemented as Variance[list].
The square root of the variance is known as the standard deviation.
The reason that
gives a biased estimator of the population variance is that two free parameters
and
are actually being estimated from the data itself. In such cases, it is appropriate to use a Student's t-distribution instead of a normal distribution as a model since, very loosely speaking, Student's t-distribution is the "best" that can be done without knowing
.
Formally, in order to estimate the population variance
from a sample of
elements with a priori unknown mean (i.e., the mean is estimated from the sample itself), we need an unbiased estimator for
. This is given by the k-statistic
, where
![]() |
(6)
|
and
is the sample variance uncorrected for bias.
It turns out that the quantity
has a chi-squared distribution.
For set of data
, the variance of the data obtained by a linear transformation is given by
![]() |
![]() |
![]() |
(7)
|
![]() |
![]() |
![]() |
(8)
|
![]() |
![]() |
![]() |
(9)
|
![]() |
![]() |
![]() |
(10)
|
![]() |
![]() |
![]() |
(11)
|
![]() |
![]() |
![]() |
(12)
|
For multiple variables, the variance is given using the definition of covariance,
![]() |
![]() |
![]() |
(13)
|
![]() |
![]() |
![]() |
(14)
|
![]() |
![]() |
![]() |
(15)
|
![]() |
![]() |
![]() |
(16)
|
![]() |
![]() |
![]() |
(17)
|
A linear sum has a similar form:
![]() |
![]() |
![]() |
(18)
|
![]() |
![]() |
![]() |
(19)
|
![]() |
![]() |
![]() |
(20)
|
These equations can be expressed using the covariance matrix.
SEE ALSO: Central Moment, Charlier's Check, Covariance, Covariance Matrix, Error Propagation, k-Statistic, Mean, Moment, Raw Moment, Sample Variance, Sample Variance Computation, Sample Variance Distribution, Sigma, Standard Error, Statistical Correlation
REFERENCES:
Kenney, J. F. and Keeping, E. S. Mathematics of Statistics, Pt. 2, 2nd ed. Princeton, NJ: Van Nostrand, 1951.
Papoulis, A. Probability, Random Variables, and Stochastic Processes, 2nd ed. New York: McGraw-Hill, pp. 144-145, 1984.
Press, W. H.; Flannery, B. P.; Teukolsky, S. A.; and Vetterling, W. T. "Moments of a Distribution: Mean, Variance, Skewness, and So Forth." §14.1 in Numerical Recipes in FORTRAN: The Art of Scientific Computing, 2nd ed. Cambridge, England: Cambridge University Press, pp. 604-609, 1992.
Roberts, M. J. and Riccardo, R. A Student's Guide to Analysis of Variance. London: Routledge, 1999.

Variance的更多相关文章
- 什么是遗传方差(Genetic variance)、加性遗传方差(Additive genetic variance)、显性遗传方差(Dominance genetic variance)、上位遗传方差(Epistatic genetic variance)
遗传方差:遗传方差又称表型方差(phenotypic variance),通常结合基因型方差(genotype variance)和环境方差(environmental variance).遗传方差主 ...
- Error=Bias+Variance
首先 Error = Bias + Variance Error反映的是整个模型的准确度,Bias反映的是模型在样本上的输出与真实值之间的误差,即模型本身的精准度,Variance反映的是模型每一次输 ...
- 机器学习中的Bias(偏差),Error(误差),和Variance(方差)有什么区别和联系?
前几天搜狗的一道笔试题,大意是在随机森林上增加一棵树,variance和bias如何变化呢? 参考知乎上的讨论:https://www.zhihu.com/question/27068705 另外可参 ...
- controlling the variance of request response times and not just worrying about maximizing queries per second
http://highscalability.com/blog/2010/11/4/facebook-at-13-million-queries-per-second-recommends-minim ...
- Bias and Variance
以下内容参考 cousera 吴恩达 机器学习课程 1. Bias 和 Variance 的定义 Bias and Variance 对于改进算法具有很大的帮助作用,在bias和Variance的指引 ...
- 第50讲:Scala中Variance变化点
王家林亲授<DT大数据梦工厂>大数据实战视频 Scala 深入浅出实战经典(1-64讲)完整视频.PPT.代码下载:百度云盘:http://pan.baidu.com/s/1c0noOt6 ...
- Scala 深入浅出实战经典 第49课 Scala中Variance代码实战(协变)
王家林亲授<DT大数据梦工厂>大数据实战视频 Scala 深入浅出实战经典(1-64讲)完整视频.PPT.代码下载:百度云盘:http://pan.baidu.com/s/1c0noOt6 ...
- 理解 Bias 与 Variance 之间的权衡
有监督学习中,预测误差的来源主要有两部分,分别为 bias 与 variance,模型的性能取决于 bias 与 variance 的 tradeoff ,理解 bias 与 variance 有助 ...
- 为什么样本方差(sample variance)的分母是 n-1?
为什么样本方差(sample variance)的分母是 n-1? (補充一句哦,題主問的方差 estimator 通常用 moments 方法估計.如果用的是 ML 方法,請不要多想不是你們想的那樣 ...
随机推荐
- list操作 foreach和for的区别
foreach只是简单的遍历读取,不能在循环中进行remove等操作. for可以
- DOM基础2
插入元素 <!DOCTYPE html> <html> <head lang="en"> <meta charset="UTF- ...
- linux ubuntu的root密码
安装完Ubuntu后忽然意识到没有设置root密码,不知道密码自然就无法进入根用户下.到网上搜了一下,原来是这麽回事.Ubuntu的默认root密码是随机的,即每次开机都有一个新的root密码.我们可 ...
- BZOJ4117 : [Wf2015]Weather Report
一种天气情况的概率只与4种天气的出现次数有关,故将相同概率的情况计数后放入堆中模拟哈夫曼树即可. 每次取出概率最小的,将它个数除以2,对于零头需要特判. #include<cstdio> ...
- 【BZOJ】2823: [AHOI2012]信号塔
题意 给\(n\)个点,求一个能覆盖所有点的面积最小的圆.(\(n \le 50000\)) 分析 随机增量法 题解 理论上\(O(n^3)\)暴力,实际上加上随机化后期望是\(O(n)\)的. 算法 ...
- HDU 5876 关于补图的bfs
1.HDU 5876 Sparse Graph 2.总结:好题,把STL都过了一遍 题意:n个点组成的完全图,删去m条边,求点s到其余n-1个点的最短距离. 思路:把点分为两个集合,A为所有没有到达 ...
- JS中注意事项
(一)判断中注意事项 一.所有的相对路径都别拿来做判断 1.img src='...' 2.href='1.css', href='html/index.html' 3.img src='http:/ ...
- 李洪强iOS经典面试题123
1.static 关键字的作用: (1)函数体内 static 变量的作用范围为该函数体,不同于 auto 变量,该变量的内存只被分配一次, 因此其值在下次调用时仍维持上次的值; (2)在模块内的 s ...
- [LintCode] Maximal Rectangle 最大矩形
Given a 2D boolean matrix filled with False and True, find the largest rectangle containing all True ...
- 32位的Win7系统下安装64位的Sql Sever?
来自:http://zhidao.baidu.com/link?url=nQBoaLgoOyYCUdI7V4WZCMlTW3tKscdkOnLTIvlYtPpwoVhQkSahq44HeofBfzFT ...















































