

















 1 R的下载与安装



Windows镜像:  http://mirror.fcaglp.unlp.edu.ar/CRAN/






2 R包的安装与使用方法


首先,如果照1方法安装完成之后打开软件。在R console中输入library()就能查看当前已经安装的包。


 abind                                                                 Combine Multidimensional Arrays
assertthat Easy Pre and Post Assertions
base The R Base Package
BH Boost C++ Header Files
boot Bootstrap Functions (Originally by Angelo Canty for S)
car Companion to Applied Regression
carData Companion to Applied Regression Data Sets
cellranger Translate Spreadsheet Cell Ranges to Rows and Columns
class Functions for Classification
cli Helpers for Developing Command Line Interfaces
clipr Read and Write from the System Clipboard
cluster "Finding Groups in Data": Cluster Analysis Extended Rousseeuw et al.
codetools Code Analysis Tools for R
compiler The R Compiler Package
crayon Colored Terminal Output
curl A Modern and Flexible Web Client for R
data.table Extension of `data.frame`
datasets The R Datasets Package
ellipsis Tools for Working with ...
fansi ANSI Control Sequence Aware String Functions
forcats Tools for Working with Categorical Variables (Factors)
foreign Read Data Stored by 'Minitab', 'S', 'SAS', 'SPSS', 'Stata', 'Systat', 'Weka', 'dBase', ...
graphics The R Graphics Package
grDevices The R Graphics Devices and Support for Colours and Fonts
grid The Grid Graphics Package
haven Import and Export 'SPSS', 'Stata' and 'SAS' Files
hms Pretty Time of Day
KernSmooth Functions for Kernel Smoothing Supporting Wand & Jones (1995)
lattice Trellis Graphics for R
lme4 Linear Mixed-Effects Models using 'Eigen' and S4
magrittr A Forward-Pipe Operator for R
maptools Tools for Handling Spatial Objects
MASS Support Functions and Datasets for Venables and Ripley's MASS
Matrix Sparse and Dense Matrix Classes and Methods
MatrixModels Modelling with Sparse And Dense Matrices
methods Formal Methods and Classes
mgcv Mixed GAM Computation Vehicle with Automatic Smoothness Estimation
minqa Derivative-free optimization algorithms by quadratic approximation
nlme Linear and Nonlinear Mixed Effects Models
nloptr R Interface to NLopt
nnet Feed-Forward Neural Networks and Multinomial Log-Linear Models
openxlsx Read, Write and Edit XLSX Files
parallel Support for Parallel computation in R
pbkrtest Parametric Bootstrap and Kenward Roger Based Methods for Mixed Model Comparison
pillar Coloured Formatting for Columns
pkgconfig Private Configuration for 'R' Packages
prettyunits Pretty, Human Readable Formatting of Quantities
progress Terminal Progress Bars
quantreg Quantile Regression
R6 Encapsulated Classes with Reference Semantics
Rcpp Seamless R and C++ Integration
RcppEigen 'Rcpp' Integration for the 'Eigen' Templated Linear Algebra Library
readr Read Rectangular Text Data
readxl Read Excel Files
rematch Match Regular Expressions with a Nicer 'API'
rio A Swiss-Army Knife for Data I/O
rlang Functions for Base Types and Core R and 'Tidyverse' Features
rpart Recursive Partitioning and Regression Trees
sp Classes and Methods for Spatial Data
SparseM Sparse Linear Algebra
spatial Functions for Kriging and Point Pattern Analysis
splines Regression Spline Functions and Classes
stats The R Stats Package
stats4 Statistical Functions using S4 Classes
survival Survival Analysis
tcltk Tcl/Tk Interface
tibble Simple Data Frames
tools Tools for Package Development
translations The R Translations Package
utf8 Unicode Text Processing
utils The R Utils Package
zip Cross-Platform 'zip' Compression


>help(package="car")        #car就是具体的某个包的名称






update.packages()    #不生命的话就默认更新全部

3 结果的重用

>head(mtcars)                                      #mtcars是一个数据集
>lm(mpg~wt, data=mtcars #lm是线性拟合的命令
>Result = lm(mpg~wt, data=mtcars)
>predict(Result, mynewdata) #mynewdata是自己要预测的值


 4 R处理大数据集




对数据库有兴趣的话可以看看:2019最受欢迎的数据库是?     https://mp.weixin.qq.com/s/9fhPicVCjMpfMmjbhZUoFA

5 R的数据结构




>a <- c(1,3,5,7,2,-4)
>b <- c("one","two","three")
>d <- c(1,3,5,"ONE")



(2)矩阵  matrix

>y <- matrix(5:24, nrow=4, ncol=5)
>x <- c(2,45,68,94)
>rnames <- c("R1","R2")
>cnames <- c("C1","C2")
>newMatrix <- matrix(x, nrow=2, ncol=2, byrow=TRUE, dimnames=list(rnames,cnames))
>>newMatrix <- matrix(x, nrow=2, ncol=2,dimnames=list(rnames,cnames)) #默认按列填充

(3)数组  array

>dim1 <- c("A1","A2", "A3")
>dim2 <- c("B1", "B2")
>dim3 <- c("C1","C2", "C3")
>d <- array(1:24, c(3,2,4), dimnames=list(dim1,dim2,dim3))
> d
, , C1 B1 B2
A1 1 4
A2 2 5
A3 3 6 , , C2 B1 B2
A1 7 10
A2 8 11
A3 9 12 , , C3 B1 B2
A1 13 16
A2 14 17
A3 15 18 , , C4 B1 B2
A1 19 22
A2 20 23
A3 21 24 > d[1,2,3]
[1] 16

(4)数据框  data.frame()

>patientID <- c(1,2,3,4)
>age <- c(25,34,28,52)
>diabetes <- c("Type1", "Type2", "Type3", "Type2")
>status <- c("poor", "Improved, "Excllent", "poor")
>patientData <- data.frame(patientID, age, diabetes, status)
> patientData
patientID age diabetes status
1 1 25 Type1 poor
2 2 34 Type2 Improved
3 3 28 Type3 Excllent
4 4 52 Type2 poor
> diabetes <- factor(diabetes)
> diabetes
[1] Type1 Type2 Type3 Type2
Levels: Type1 Type2 Type3

(5)列表  list

> g <- "My first list"
> h <- c(12,23,34)
> j <- c("one","two","there")
> k <- matrix(1:10, nrow=2)
> mylist <- list(g,h,j,k
> mylist
[1] "My first list" [[2]]
[1] 12 23 34 [[3]]
[1] "one" "two" "there" [[4]]
[,1] [,2] [,3] [,4] [,5]
[1,] 1 3 5 7 9
[2,] 2 4 6 8 10



6 实例演练

>age <- c(1,3,5,2,11,9,3,9,12,3)
>weight <- c(4.4, 5.3, 7.2, 5.2, 8.5, 7.3, 6.0, 10.4, 10.2, 6.1)
>mean(weight) #求均值
>sd(weight) #求方差
>cor(age, weight) #求相关性

7 推荐

推荐1: 数据分析从零开始实战 | 基础篇  https://mp.weixin.qq.com/s/4ESKjlF4B63IveiIlfCdDA

推荐2:给入行数据分析的8个建议    https://mp.weixin.qq.com/s/FYQ192iwstn2J2QejDvNhA




