7、RNAseq Downstream Analysis
Created by Dennis C Wylie, last modified on Jun 29, 2015
Machine learning methods (including clustering, dimensionality reduction, classification and regression modeling, resampling techniques, etc.), ANOVA modeling, and empirical Bayes analysis.
Unsupervised Analysis
Unsupervised methods provide exploratory data analysis useful for getting a big picture view: can provide valuable QC information and can help to both assess expected trends and identify unexpected patterns in your data.
- Deliverables:
- Plots in png and pdf format
- Results from any additional algorithms applied may be provided in tab-delimited or excel formatted tables as appropriate
- Tools Used:
- Hierarchical Clustering: both of genes and and samples.
- Principal Components Analysis: PCA biplot of data after centering both on the gene and sample axes (and optionally scaling of gene axis if desired).
- Other methods: (e.g., k-means clustering, self-organized maps, multidimensional scaling, etc.) available if desired
Empirical Bayes Differential Expression Analysis
RNAseq experiments yield simultaneous measurements of many intrinsically similar variables (gene expression levels) but with often limited sample sizes. Empirical Bayes methods provide a statistical approach designed just for such situations which "borrow strength" across genes to increase statistical power and decrease false discovery.
Deliverables:
Tables of model parameters, p-values, and FDR q-values (in tab-delimited and excel format)
Boxplots (stratified by sample group) and pairs plots of top genes provided in png and pdf format
- Tools Used:
- Limma: applies empirical Bayes methods in the construction of linear models (e.g, t-tests, ANOVA) for a large variety of experimental designs. Originally designed for microarray data analysis, Limma's developers have substantially extended its functionality into the realm of RNAseq as well.
Supervised Analysis
Many methods available for classification and regression as appropriate to your analysis. Model performance may be assessed using standard metrics evaluated under cross-validation or using independent test sets if available. Analysis will be conducted using R and/or Python scripts.
- Deliverables:
- Tables of results (in tab-delimited and excel formats)
- plots in png and pdf format
- R and/or Python source files
- binary, JSON, or XML representations of R or Python objects can be made available if desired
- further reports in the form of slides or text documents may be provided in standard formats (pdf, doc, ppt) if desired
- Methods Available:
- Diagonal linear discriminant analysis (DLDA, a form of linear naive Bayes classification)
- Linear and quadratic discriminant analysis
- Logistic regression including L1/lasso and/or L2/ridge regularization if desired
- Partial least squares (PLS) discriminant analysis and regression
- k-nearest neighbors (KNN)
- Support vector machines (SVM)
- Decision tree ensembles (Random Forests or AdaBoost).
- Other methods are available on request.
7、RNAseq Downstream Analysis的更多相关文章
- 6、RNA-Seq Analysis Pipeline
Created by Dhivya Arasappan, last modified by Dennis C Wylie on Nov 08, 2015 This pipeline uses an a ...
- 转录组分析综述A survey of best practices for RNA-seq data analysis
转录组分析综述 转录组 文献解读 Trinity cufflinks 转录组研究综述文章解读 今天介绍下小编最近阅读的关于RNA-seq分析的文章,文章发在Genome Biology 上的A sur ...
- A survey of best practices for RNA-seq data analysis RNA-seq数据分析指南
A survey of best practices for RNA-seq data analysis RNA-seq数据分析指南 内容 前言 各位同学/老师,大家好,现在由我给大家讲讲我的文献阅读 ...
- Power BI 与 Azure Analysis Services 的数据关联:1、建立 Azure Analysis Services服务
Power BI 与 Azure Analysis Services 的数据关联:1.建立 Azure Analysis Services服务
- 10、RNA-seq for DE analysis training(Mapping to assign reads to genes)
1.Goal of mapping 1)We want to assign reads to genes they were derived from 2)The result of the mapp ...
- single-cell RNA-seq 工具大全
[怪毛匠子-整理] awesome-single-cell List of software packages (and the people developing these methods) fo ...
- 7、sraToolkit安装使用
参考:http://blog.csdn.net/Cs_mary/article/details/78378552 ###prefetch 参数解释 https://www.ncbi.nl ...
- 玩转大数据:深入浅出大数据挖掘技术(Apriori算法、Tanagra工具、决策树)
一.本课程是怎么样的一门课程(全面介绍) 1.1.课程的背景 “大数据”作为时下最火热的IT行业的词汇,随之而来的数据仓库.数据分析.数据挖掘等等围绕大数据的商业价值的利用逐渐成为 ...
- loadrunner入门篇-Analysis 分析器
analysis简介 分析器就是对测试结果数据进行分析的组件,它是LR三大组件之一,保存着大量用来分析性能测试结果的数据图,但并不一定要对每个视图进行分析,可以根据实际情况选择相关的数据视图进行分析, ...
随机推荐
- Linux平台下贪吃蛇游戏的运行
1.参考资料说明: 这是一个在Linux系统下实现的简单的贪吃蛇游戏,同学找帮忙,我就直接在Red Hat中调试了一下,参考的是百度文库中"maosuhan"仁兄的文章,结合自己的 ...
- 07 09&10
0709: 排名还是不高,毕竟没切出来题. 第一题dalao: 要求你做一个三维数点,只回答最终有多少个点对的状态是完全小于(可比?)的.(n<=2000000) 特殊限制是三维都是随机排列. ...
- STL 之 map的用法
Map是STL的一个关联容器,它提供一对一(其中第一个可以称为关键字,每个关键字只能在map中出现一次,第二个可能称为该关键字的值)的数据处理能力,由于这个特性,它完成有可能在我们处理一对一数据的时候 ...
- Oracle的PL_SQL的异常处理
--什么是异常 --异常是在PL/SQL运行过程中有可能出现的错误. --执行异常的语句 exception when [异常] when --异常输出信息. --Oracle的预定义异常 CASE_ ...
- UniDAC 安装教程
翻译: 1.解压后把UniDAC文件夹直接复制到你专门用来存放第三方控件的地方(这一步根据自己的喜好,可以跳过这一步)2.在UniDAC\Source\Delphi21文件夹中找到Make.bat文件 ...
- 带SoapHeader验证的WebServices
一般在项目中,制作的都是基于SOAP协议的webservices,其描述语言是WSDL.但是有时候在项目中,需要保证webservices的安全,需要对其进行进行验证,那么我们就要实现SoapHead ...
- 分享知识-快乐自己:解决 Maven 无法下载 fastdfs-client-java 依赖。
因为fastdfs-client-java-1.27-SNAPSHOT.jar这个依赖包在maven中央仓库是没有的. 需要自己编译源码成jar本地安装到maven 的本地仓库,安装完以后就能正常引用 ...
- 4 Python 日期和时间
Python 程序能用很多方式处理日期和时间,转换日期格式是一个常见的功能. Python 提供了一个 time 和 calendar 模块可以用于格式化日期和时间. 时间间隔是以秒为单位的浮点小数. ...
- Log4j_学习_00_资源帖
一.log4j2 1. log4j使用教程详解(怎么使用log4j2) 2.Log4j2的基本使用 二.log4j 1.[转]最详细的Log4J使用教程 2.最详细的Log4j使用教程 3.log4j ...
- 关于ATML信号定义的理解-1
1.XML中的类型标签: <xs:complexType>复合类型和<xs:simpleTyle>简单类型是数据结构类型,包含了各种类型的属性.可以被子类型继承,继承方式为&l ...