MAST 397B: Introduction to Statistical Computing
MAST 397B: Introduction to Statistical Computing
ABSTRACT
Notes: (i) This project can be done in groups. If it is done
in a group, you have to submit the copy for the group
(not individuals). In this case the cover page must have all
the group members with their ID numbers along with a
statement of contributions of each member of the group.
(ii) You should present references to all materials (online
or otherwise) in your report. (ii) All the codes should be
put in an appendix. (iii) Answers should be clearly stated;
a not-well written report will get only partial credit.
Instructor: Yogen Chaubey
MAST 397B
FINAL PROJECT
Due Date: December 2, 2019
MAST 397B: Introduction to Statistical Computing
Final Project
Due Date: December 2, 2019 [Hard Copies only]
Problem 1. [20 Points]
Fitting distributions to a given dataset is an important problem in statistical analysis. R
contains a package called fitdistrplus that facilitates fitting various known continuous
distributions. In general fitting a distribution requires the knowledge of the form of the
distribution such as the Gaussian distribution given by the probability density function (pdf)
????(????) = 1 ????√(2????) ????????????{? 12????2 (???? ? ????)2}; ???? ∈ (?∞, ∞).
The vector ???? = (????, ????2) is known as the parameter vector and is estimated from a random
sample (????1, ????2, … , ????????). Consider the data named goundbeef, available with the package
fitdistrplus. Fit the following two distributions for this dataset (a) log-normal distribution
(b) Gamma distribution.
(i) Use the maximum likelihood (ML) method for the log-normal distribution and
method of moments (MM) for the Gamma distribution. Note that ???? is said to have
log-normal distribution if ???? = log ???? has a normal distribution and that the Gamma
pdf with shape parameter ???? and scale parameter ???? is given by
????(????) = 1 ????????Γ(????) ?????????1 exp{ ? ???????? }; ???? ≥ 0
Use a standard statistical text for explicit formulae in order to calculate these estimators
using your own defined function in R.
(ii) Use the package fitdistrplus to find the ML and MM estimators for the two
distributions.
(iii) One method of justifying a given distribution is to perform a Chi-square goodness-of?fit test. It is given by the test statistic
????2 = ?????????? ? ?????????2 ????????2 ????????=1
Here we assume that the data is grouped into k groups (???? = # ???????? ???????????????? ???????? ????????? ?????????????????????????????????) ,
???????? is the observed frequency in ????????? group and ???????? is the frequency in ????????? group under the fitted
model.
This has to be computed by the formula, ???????? = ????????????, ???????? is the probability of the observation
代做MAST 397B作业、代写R语言留学生作业
being in group ???? in the model. If the model fits, the test statistic ????2 has a Chi-square
distribution with df= ????=k-1-p where p= No. of estimated parameters.
Compute the ????2 statistic for the above data for a suitable value of ????; note that for the test to
be valid each group must have 5 or more observations. Find the upper 5% value of the
appropriate ????2 distribution and compare the computed value (for both the models) in
deciding if the models fit the data. [Note: The observed value of ????2 greater than 5% value of
χ2 with df= ???? indicates poor fit].
(iv) Quality of the fits may also be gauged by plotting the histogram with estimated
density super-imposed over it. Provide the histogram with the estimated density
super-imposed over it for both the methods for each of the log-normal and gamma
distributions and comment on the quality of the fit.
(v) Another qualitative method to judge the fit is the Q-Q plot of the data. Give the QQ
plots for both the methods for each of the log-normal and Gamma densities. Comment
on the quality of fit in each case. How does it compare with your conclusion in part
(iii).
Problem 2. [15 Points]
Problem 3 [10 Points]
Consider the following data from Example 7.12
(a)The objective is to determine a line ???? = ????0 + ????1???? such that the function
????(????0, ????1) = ? |???????? ? ????0 ? ????1????????| ????????=1
is minimized. Use optim( ) function of R with starting values obtained from lm( ).
(b) Plot the least square line and the line obtained in part (a) on the scatterplot and
comment on the fit of these lines to the data.
(c) Suppose another point (2.05,3.23) is added to the data. Compute the two lines again
and comment on the effect of the new point on the estimates.
因为专业,所以值得信赖。如有需要,请加QQ:99515681 或邮箱:99515681@qq.com
微信:codehelp
MAST 397B: Introduction to Statistical Computing的更多相关文章
- Brief introduction to Scala and Breeze for statistical computing
Brief introduction to Scala and Breeze for statistical computing 时间 2013-12-31 03:17:19 Darren Wilk ...
- Introduction to Parallel Computing
Copied From:https://computing.llnl.gov/tutorials/parallel_comp/ Author: Blaise Barney, Lawrence Live ...
- The R Project for Statistical Computing
[Home] Download CRAN R Project About R Contributors What’s New? Mailing Lists Bug Tracking Conferenc ...
- Introduction to statistical learning:with Applications in R (书,数据,R代码,链接)
http://faculty.marshall.usc.edu/gareth-james/ http://faculty.marshall.usc.edu/gareth-james/ISL/
- How-to: Do Statistical Analysis with Impala and R
sklearn实战-乳腺癌细胞数据挖掘(博客主亲自录制视频教程) https://study.163.com/course/introduction.htm?courseId=1005269003&a ...
- Evolutionary Computing: 5. Evolutionary Strategies(2)
Resource: Introduction to Evolutionary Computing, A.E.Eliben Outline recombination parent selection ...
- Evolutionary Computing: 4. Review
Resource:<Introduction to Evolutionary Computing> 1. What is an evolutionary algorithm? There ...
- A Gentle Introduction to the Gradient Boosting Algorithm for Machine Learning
A Gentle Introduction to the Gradient Boosting Algorithm for Machine Learning by Jason Brownlee on S ...
- A Statistical View of Deep Learning (V): Generalisation and Regularisation
A Statistical View of Deep Learning (V): Generalisation and Regularisation We now routinely build co ...
随机推荐
- PhantomJS简单使用
PhantomJS下载地址: http://phantomjs.org/download.html 简单使用: from selenium import webdriver # 要想调用键盘按键操 ...
- 数据解析模块BeautifulSoup简单使用
一.准备环境: 1.准备测试页面test.html <html> <head> <title> The Dormouse's story </title> ...
- MQ选型对比ActiveMQ,RabbitMQ,RocketMQ,Kafka 消息队列框架选哪个?
最近研究消息队列,发现好几个框架,搜罗一下进行对比,说一下选型说明: 1)中小型软件公司,建议选RabbitMQ.一方面,erlang语言天生具备高并发的特性,而且他的管理界面用起来十分方便.不考虑r ...
- AOD.NET实现数据库事物Transaction
在开始介绍文章主要内容前先简单说一下事务 1.事务介绍 事务是一种机制.是一种操作序列,它包含了一组数据库操作命令,这组命令要么全部执行,要么全部不执行.因此事务是一个不可分割的工作逻辑单元.在数据库 ...
- 7 CentOS 7网卡配置
首先重中之重:修改前一定要进行系统备份,如果是虚拟机进行快照 查看虚拟机的网卡配置 注意桥接模式和NAT模式 桥接模式:网络层面,虚拟机和PC处于同级地位,虚拟机直接和路由器相连 NA ...
- [b0013] Hadoop 版hello word mapreduce wordcount 运行(三)
目的: 不用任何IDE,直接在linux 下输入代码.调试执行 环境: Linux Ubuntu Hadoop 2.6.4 相关: [b0012] Hadoop 版hello word mapred ...
- idea加载springboot 项目热加载失效
需要打开 help -> find action ->registry ->其中的compiler.automake.allow.when.app.running勾上
- 5.4 RDD编程---综合案例
一.求top值 任务描述:求出多个文件中数值的最大.最小值 二.求最大最小值 任务描述:求出多个文件中数值的最大.最小值 解题思路:通过一个人造的key,让所有的值都成为“key”的value-lis ...
- 【洛谷P4542】 [ZJOI2011]营救皮卡丘(费用流)
洛谷 题意: 给出\(n\)个点,\(m\)条边,现在有\(k,k\leq 10\)个人从\(0\)号点出发前往\(n\)点. 规定若某个人想要到达\(x\)点,则\(1\)~\(x-1\)号点都有人 ...
- P4677 山区建小学|区间dp
P4677 山区建小学 题目描述 政府在某山区修建了一条道路,恰好穿越总共nn个村庄的每个村庄一次,没有回路或交叉,任意两个村庄只能通过这条路来往.已知任意两个相邻的村庄之间的距离为di 为了提高山区 ...