Applied Nonparametric Statistics-lec4】的更多相关文章

Ref:https://onlinecourses.science.psu.edu/stat464/print/book/export/html/14 估计CDF The Empirical CDF 绘制empirical cdf的图像: x = c(4, 0, 3, 2, 2) plot.ecdf(x) Kolmogorov-Smirnov test testing the "sameness" of two independent samples from a continuous…
Ref:https://onlinecourses.science.psu.edu/stat464/print/book/export/html/12 前面我们考虑的情况是:response是连续的,variable是离散的.举例:如果打算检查GPA的中位数是否与学生坐在教室的位置有关, 那么GPA的中位数是连续的,是响应变量:学生坐的位置(前中后)是离散的,是解释变量. 现在考虑解释变量也是连续的情况,即检查两个连续变量之间的因果关系.其中,我们最关心的是关系的强弱和方向. 首先,我们考虑线性…
Ref:https://onlinecourses.science.psu.edu/stat464/print/book/export/html/11 additive model value = typical value + row effect + column effect + residual predicate value = typical value + row effect + column effect 其中value是我们关注的值,typical value是overall…
Ref: https://onlinecourses.science.psu.edu/stat464/print/book/export/html/9 经过前面的步骤,我们已经可以判断几个样本之间是否有差异,差异有多大,现在,我们的备选假设 变成有规律的了,如: 在前面的方法中,我们没有限定这种有顺序的小于等于关系. contrasts: R中可以使用ANGEL包中的函数.染鹅我装不了这个包:)仅供参考. permcontrast(data, R=1000, contrast, graph=T,…
Ref: https://onlinecourses.science.psu.edu/stat464/print/book/export/html/8 前面都是对一两个样本的检查,现在考虑k个样本的情况,我们的假设是: Analysis of Variance (ANOVA) assumptions are: Groups are independent Distributions are Normally distributed Groups have equal variances 那么我们…
今天继续two-sample test Ref: https://onlinecourses.science.psu.edu/stat464/print/book/export/html/6 Mann-Whitney Test 前面说这个和Wilcoxon是identical的,只是统计量不同.现在我们来看一下它的统计量U.注意,现在检查的仍然是两个独立样本. Treatment 1:  x1, x2, ... , xmTreatment 2:  y1, y2, ... , yn U = # o…
Ref: https://onlinecourses.science.psu.edu/stat464/print/book/export/html/5 Two sample test 直接使用R的t-test t.test(n, t, alternative="two.sided", var.equal=T) permutation test 当我们判断两个样本的均值或者中值是否相等时,如果样本数量足够大,可以使用t-test. 但是,当两个样本的数量都很小时,它们的分布可能是有偏的,…
Ref: https://onlinecourses.science.psu.edu/stat464/print/book/export/html/4 使用非参数方法的优势: 1. 对总体分布做的假设少,所以总体分布未知也可以: 2. 容易做: 3. 一般对离群值更具鲁棒性robust: 4. 适用于数据中包含ranks, ordinal or categorical的. In a skewed distribution, the population median, η, is a bette…
Ref: https://onlinecourses.science.psu.edu/stat464/print/book/export/html/3 The Binomial Distribution in R: # return PMF. prob is the probability of success . x can be a list dbinom(x, size, prob) # CDF pbinom(x, size, prob) # returns a value for a p…
参考网址: https://onlinecourses.science.psu.edu/stat464/node/2 Binomial Distribution Normal Distribution 将正态分布标准化.这也就是Z-score Confidence Interval 在上面的前提下,假设σ^2已知,现在构造μ的置信区间: 利用上面Z-score的公式,且 套入公式,解出μ.注意此处的标准差用的是σ/根号n.最终解出: 当σ^2=Var(X)不知道时,我们可以用样本的标准差,计算Z…
https://onlinecourses.science.psu.edu/statprogram/programs Graduate Online Course Overviews Printer-friendly versionPrinter-friendly version Picture of Thomas Building where the Eberly College of Science and the Department of Statistics resides.The D…
Machine and Deep Learning with Python Education Tutorials and courses Supervised learning superstitions cheat sheet Introduction to Deep Learning with Python How to implement a neural network How to build and run your first deep learning network Neur…
ISSN Abbreviated Journal Title Full Title Category Subcategory Country total Cites IF        2013-2014 IF 2012-2013 IF 2011-2012 IF 2010-2011 IF 2009-2010 IF 2008-2009 IF 2007-2008 5-Year Impact Factor Immediacy Index Articles Cited Half-Life Eigenfa…
参考文献: 1.python 皮尔森相关系数 https://www.cnblogs.com/lxnz/p/7098954.html 2.统计学之三大相关性系数(pearson.spearman.kendall) http://blog.sina.com.cn/s/blog_69e75efd0102wmd2.html 皮尔森系数 重点关注第一个等号后面的公式,最后面的是推导计算,暂时不用管它们.看到没有,两个变量(X, Y)的皮尔森相关性系数(ρX,Y)等于它们之间的协方差cov(X,Y)除以它…
https://www.quora.com/How-do-I-learn-mathematics-for-machine-learning   How do I learn mathematics for machine learning? Promoted by Time Doctor Software for productivity tracking. Time tracking and productivity improvement software with screenshots…
python机器学习-乳腺癌细胞挖掘(博主亲自录制视频) https://study.163.com/course/introduction.htm?courseId=1005269003&utm_campaign=commission&utm_source=cp-400000000398149&utm_medium=share 机器学习,统计项目联系QQ:231469242 两个配对样本,均匀分布,非正太分布 Wilcoxon signed-rank test 曼-惠特尼U检验M…
一.MCMC 简介 1. Monte Carlo 蒙特卡洛 蒙特卡洛方法(Monte Carlo)是一种通过特定分布下的随机数(或伪随机数)进行模拟的方法.典型的例子有蒲丰投针.定积分计算等等,其基础是大数定律. 蒙特卡洛方法有哪些优缺点如下: 优点:计算准确性由采样的均匀程度决定:大大简化问题复杂性 缺点: 由于要进行大量的抽样计算,对计算机速度依赖性强 目前绝大多数随机数发生器均为伪随机数,一定程度上有偏 定积分求解问题中,对于\(\color{blue}{复杂或者高维的分布}\),利用蒙特…
http://stackoverflow.com/jobs/124781/principal-data-scientist-concur-technologies-inc?med=clc&ref=small-sidebar-tag-themed-python Job Description Be a core part of the Data Platform team and help deliver the promise of a better and more interesting t…
https://github.com/josephmisiti/awesome-machine-learning/blob/master/books.md Machine-Learning / Data Mining An Introduction To Statistical Learning - Book + R Code Elements of Statistical Learning - Book Probabilistic Programming & Bayesian Methods…
SAS是著名的统计分析软件,全称为Statistics Analysis System,最早由北卡罗来纳大学的两位生物统计学研究生编制,并于1976年成立了SAS软件研究所,正式推出了SAS软件. 转载自:http://www.hejizhan.com/html/xueke/110/x110_46.html 这里有几十个SAS学习教程,大家可以按需下载学习,当然了,可以的话,还是多支持正版为好!  现代统计学与SAS应用 胡良平主编.pdf 10.39 MB  SAS数据挖掘实战精简版.pdf…
The following is a list of free, open source books on machine learning, statistics, data-mining, etc. Machine-Learning / Data Mining An Introduction To Statistical Learning - Book + R Code Elements of Statistical Learning - Book Probabilistic Program…
LDSO:具有回环检测的直接稀疏里程计:LDSO:Direct Sparse Odometry with Loop Closure Abstract—In this paper we present an extension of Direct Sparse Odometry (DSO) [1] to a monocular visual SLAM system with loop closure detection and pose-graph optimization (LDSO). As…
David M.BLEI nCR文献学习笔记(基本完成了)  http://yhbys.blog.sohu.com/238343705.html 题目:The Nested Chinese Restaurant Process and Bayesian Nonparametric Inference of Topic Hierarchies David M.BLEI 这个LDA领域的大牛,对LDA有诸多变形,这一片是将随机过程(stochastic process)用于无参贝叶斯推断上,构造主题…
1.Oracle 11g R2安装手册(图文教程)For Windows 1.下载Oracle 11g R2 for Windows版本,下载地址如下官方网站:http://download.oracle.com/otn/nt/oracle11g/112010/win32_11gR2_database_1of2.ziphttp://download.oracle.com/otn/nt/oracle11g/112010/win32_11gR2_database_2of2.zip 2.解压两个压缩包…
一. t-tests 这一部分我们使用分布在MASS包中的UScrime数据集.它是关于美国47个州在1960年时,关于惩罚制度对犯罪率的影响. Prob:监禁(坐牢)的概率: U1:14到24岁的城市那你的失业率: U2:35到39岁的城市男子的失业率: So:an indicator variable for Southern states 1. 独立的t-test(independent t-test) t.test(y~x,data) t.tset(y1,y2) 例01: > libra…
8.4 Confidence Intervals for One Population Mean When σ Is Unknown 原先是 standardized version of x bar: 当没有提供population 的标准差时,采用S(样本标准差作为population 标准差),即studentized version of x bar t-Distributions and t-Curves t-curves have more spread than the stand…
Previously in this series: The beta distribution Empirical Bayes estimation Credible intervals The Bayesian approach to false discovery rates Bayesian A/B testing Beta-binomial regression Understanding empirical Bayesian hierarchical modeling Mixture…
http://blog.csdn.net/pipisorry/article/details/52227580 Statsmodels Statsmodels is a Python package that provides a complement to scipy for statistical computations including descriptive statistics and estimation of statistical models. statsmodels原名叫…
Statistics in Python Materials for the “Statistics in Python” euroscipy 2015 tutorial. Requirements Standard scientific Python environment (numpy, scipy, matplotlib) Pandas Statsmodels Seaborn To install Python and these dependencies, we recommend th…
1.What are “Parametric Statistics”? 统计中的参数指的是总体的一个方面,而不是统计中的一个方面,后者指的是样本的一个方面.例如,总体均值是一个参数,而样本均值是一个统计量.参数统计检验对总体参数和数据的分布进行假设.这些类型的测试包括学生的T测试和方差分析测试,假设数据来自正态分布. A parameter in statistics refers to an aspect of a population, as opposed to a statistic,…