MAST 397B: Introduction to Statistical Computing
MAST 397B: Introduction to Statistical Computing
ABSTRACT
Notes: (i) This project can be done in groups. If it is done
in a group, you have to submit the copy for the group
(not individuals). In this case the cover page must have all
the group members with their ID numbers along with a
statement of contributions of each member of the group.
(ii) You should present references to all materials (online
or otherwise) in your report. (ii) All the codes should be
put in an appendix. (iii) Answers should be clearly stated;
a not-well written report will get only partial credit.
Instructor: Yogen Chaubey
MAST 397B
FINAL PROJECT
Due Date: December 2, 2019
MAST 397B: Introduction to Statistical Computing
Final Project
Due Date: December 2, 2019 [Hard Copies only]
Problem 1. [20 Points]
Fitting distributions to a given dataset is an important problem in statistical analysis. R
contains a package called fitdistrplus that facilitates fitting various known continuous
distributions. In general fitting a distribution requires the knowledge of the form of the
distribution such as the Gaussian distribution given by the probability density function (pdf)
????(????) = 1 ????√(2????) ????????????{? 12????2 (???? ? ????)2}; ???? ∈ (?∞, ∞).
The vector ???? = (????, ????2) is known as the parameter vector and is estimated from a random
sample (????1, ????2, … , ????????). Consider the data named goundbeef, available with the package
fitdistrplus. Fit the following two distributions for this dataset (a) log-normal distribution
(b) Gamma distribution.
(i) Use the maximum likelihood (ML) method for the log-normal distribution and
method of moments (MM) for the Gamma distribution. Note that ???? is said to have
log-normal distribution if ???? = log ???? has a normal distribution and that the Gamma
pdf with shape parameter ???? and scale parameter ???? is given by
????(????) = 1 ????????Γ(????) ?????????1 exp{ ? ???????? }; ???? ≥ 0
Use a standard statistical text for explicit formulae in order to calculate these estimators
using your own defined function in R.
(ii) Use the package fitdistrplus to find the ML and MM estimators for the two
distributions.
(iii) One method of justifying a given distribution is to perform a Chi-square goodness-of?fit test. It is given by the test statistic
????2 = ?????????? ? ?????????2 ????????2 ????????=1
Here we assume that the data is grouped into k groups (???? = # ???????? ???????????????? ???????? ????????? ?????????????????????????????????) ,
???????? is the observed frequency in ????????? group and ???????? is the frequency in ????????? group under the fitted
model.
This has to be computed by the formula, ???????? = ????????????, ???????? is the probability of the observation
代做MAST 397B作业、代写R语言留学生作业
being in group ???? in the model. If the model fits, the test statistic ????2 has a Chi-square
distribution with df= ????=k-1-p where p= No. of estimated parameters.
Compute the ????2 statistic for the above data for a suitable value of ????; note that for the test to
be valid each group must have 5 or more observations. Find the upper 5% value of the
appropriate ????2 distribution and compare the computed value (for both the models) in
deciding if the models fit the data. [Note: The observed value of ????2 greater than 5% value of
χ2 with df= ???? indicates poor fit].
(iv) Quality of the fits may also be gauged by plotting the histogram with estimated
density super-imposed over it. Provide the histogram with the estimated density
super-imposed over it for both the methods for each of the log-normal and gamma
distributions and comment on the quality of the fit.
(v) Another qualitative method to judge the fit is the Q-Q plot of the data. Give the QQ
plots for both the methods for each of the log-normal and Gamma densities. Comment
on the quality of fit in each case. How does it compare with your conclusion in part
(iii).
Problem 2. [15 Points]
Problem 3 [10 Points]
Consider the following data from Example 7.12
(a)The objective is to determine a line ???? = ????0 + ????1???? such that the function
????(????0, ????1) = ? |???????? ? ????0 ? ????1????????| ????????=1
is minimized. Use optim( ) function of R with starting values obtained from lm( ).
(b) Plot the least square line and the line obtained in part (a) on the scatterplot and
comment on the fit of these lines to the data.
(c) Suppose another point (2.05,3.23) is added to the data. Compute the two lines again
and comment on the effect of the new point on the estimates.
因为专业,所以值得信赖。如有需要,请加QQ:99515681 或邮箱:99515681@qq.com
微信:codehelp
MAST 397B: Introduction to Statistical Computing的更多相关文章
- Brief introduction to Scala and Breeze for statistical computing
Brief introduction to Scala and Breeze for statistical computing 时间 2013-12-31 03:17:19 Darren Wilk ...
- Introduction to Parallel Computing
Copied From:https://computing.llnl.gov/tutorials/parallel_comp/ Author: Blaise Barney, Lawrence Live ...
- The R Project for Statistical Computing
[Home] Download CRAN R Project About R Contributors What’s New? Mailing Lists Bug Tracking Conferenc ...
- Introduction to statistical learning:with Applications in R (书,数据,R代码,链接)
http://faculty.marshall.usc.edu/gareth-james/ http://faculty.marshall.usc.edu/gareth-james/ISL/
- How-to: Do Statistical Analysis with Impala and R
sklearn实战-乳腺癌细胞数据挖掘(博客主亲自录制视频教程) https://study.163.com/course/introduction.htm?courseId=1005269003&a ...
- Evolutionary Computing: 5. Evolutionary Strategies(2)
Resource: Introduction to Evolutionary Computing, A.E.Eliben Outline recombination parent selection ...
- Evolutionary Computing: 4. Review
Resource:<Introduction to Evolutionary Computing> 1. What is an evolutionary algorithm? There ...
- A Gentle Introduction to the Gradient Boosting Algorithm for Machine Learning
A Gentle Introduction to the Gradient Boosting Algorithm for Machine Learning by Jason Brownlee on S ...
- A Statistical View of Deep Learning (V): Generalisation and Regularisation
A Statistical View of Deep Learning (V): Generalisation and Regularisation We now routinely build co ...
随机推荐
- C#中窗口关闭时没有取消事件订阅导致事件重复执行的解决方法
场景 C#中委托与事件的使用-以Winform中跨窗体传值为例: https://blog.csdn.net/BADAO_LIUMANG_QIZHI/article/details/100150700 ...
- Jenkins持续集成的应用--基础
1.测试工程师为什么要掌握持续集成 一个程序员如果想发布一个产品,他需要编码.编译.测试,发布的过程.对于一个企业来说,如果也想发布一个产品的话,同样的也是需要上述的过程,区别在于企业要发布的产品的需 ...
- 有两个CIDR地址块208.128/11和208.130.28/22。是否有那一个地址块包含了另一个地址?如果有,请指出,并说明理由。
有两个CIDR地址块208.128/11和208.130.28/22.是否有那一个地址块包含了另一个地址?如果有,请指出,并说明理由. 208.128/11的前缀为:11010000 100: 208 ...
- @property与@xxx.setter的用法
类中@property与@xxx.setter的方法介绍. 简单说,@property就是将定义的函数(方法)当作属性对象使用,不需要像调用函数那样去调用,而@xxx.setter是为@xxx的这样函 ...
- Java中级—转发和重定向的区别
在设计Web应用程序的时候,经常需要把一个系统进行结构化设计,即按照模块进行划分,让不同的Servlet来实现不同的功能,例如可以让其中一个Servlet接收用户的请求,另外一个Servlet来处理用 ...
- leetcode之求众数
求众数 给定一个大小为 n 的数组,找到其中的众数. 你可以假设数组是非空的,并且给定的数组总是存在众数. 示例 1: 输入: [3,2,3] 输出: 3 示例 2: 输入: [2,2,1,1,1,2 ...
- HeadFirst设计模式---观察者
表达公式 注册者 + 订阅者 = 观察者模式 设计气象站 气象站接口 /** ** 布告板 ** @author lollipop ** @since 2019/11/24 **/ public in ...
- apache主配置文件设置
主配置文件:httpd.conf #设置管理员邮箱地址ServerAdmin admin@example.com#定义apache安装根目录变量Define SRVROOT "F:\www\ ...
- JS高阶---浏览器内核
不同浏览器的内核,不太一样 360双核切换机制 一般涉及到金钱交易时,会切换到Trident内核,因为IE内核安全性较稳 不涉及金钱利益时,则会使用webkit内核 (1)内核是由很多模块构成 注意: ...
- HTML与CSS学习笔记(1)
1.web三大核心技术? HTML CSS JavaScript 2.HTML基本机构和属性 HTML:超文本 标记 语言 超文本:文本内容+非文本内容(图片.视频.音频等) 标记:<单词> ...