RNA-seq是利器,大部分做实验的老板手下都有大量转录组数据,所以RNA-seq的分析需求应该是很大的(大部分的生信从业人员应该都差不多要沾边吧). 普通的转录组套路并不多,差异表达基因.富集分析.WGCNA network以及一些没卵用的花式分析.DEG分析是基础,up and down,做个富集,了解一下处理后到底是什么通路被改变了:WGCNA主要就是根据相关性来找出一些co-express的gene module. 单细胞的转录组的玩法就比较多了,可以理解为超多样本的普通转录组,普通转录…
RNA-Seq differential expression analysis: An extended review and a software tool   RNA-Seq差异表达分析: 扩展评论和软件工具 正确鉴定特定条件之间的差异表达基因(DEG)是理解表型变异的关键.高通量转录组测序(RNA-Seq)已成为这些研究的主要选择. 因此,用于RNA-Seq数据的差异表达分析的方法和软件的数量也迅速增加. 但是,对于最合适的管道还是没有达成共识用于从RNA-Seq数据鉴定差异表达基因的方…
Differential expression analysis for paired RNA-seq data 抽象背景:RNA-Seq技术通过产生序列读数并在不同生物条件下计数其频率来测量转录本丰度. 为了鉴定两种条件之间差异表达的基因,重要的是要考虑实验设计以及数据的分布特性. 在许多RNA-Seq研究中,表达数据以多对获得,例如来自相同个体的治疗前和治疗后样品.我们寻求将配对结构纳入分析. 结果:我们提出了一个用于RNA-Seq数据的贝叶斯分层混合模型,以分别考虑变异性来自配对数据结构的…
ASE又走到了关键的一步  要生成能决定是否有差异表达的table. 准备借鉴一下cuffdiff和edgeR 的结果 cuffdiff对差异表达基因的描述: 一共十四列: 第一列, test_id a unique identifer describing the transcript, gene, primary transcript, or CDS being tested. eg XLOC_000003 第二列,gene_id eg XLOC_000003 第三列, gene 第四列,…
仍然是两年前的笔记 1. prepare-reference 如果用RSEM对比对后的bam进行转录本定量,则在比对过程中要确保比对用到的索引是由rsem-prepare-reference产生的. ~/software/rsem/rsem-prepare-reference \ --transcript-to-gene-map ~/project/RNA-seq/ref_cds/gene_transcript.txt \ #作用是在后面的定量结果文件中,添加gene名称, 转录本名称两列,该文…
使用tophat和cufflinks计算RNA-seq数据的表达水平时,当一个基因在一个样本中有多个表达水平时需要合并它们的表达水平. This code is a solution to collapsing duplicate FPKMs for a gene. CollapseFPKM This code is a solution to collapsing duplicate FPKMs for a gene Problem/Issue: In the cufflinks output…
Abstract This paper presents a fundus image analysis based computer aided system for automatic classification and grading of cataract, which provides great potentials to reduce the burden of well-experienced ophthalmologists (the scarce resources) an…
Systematic LncRNA Classification From: http://www.arraystar.com/Services/Services_main.asp?ID=307 Analyzing the genomic context of LncRNAs can help predict their functional role. According to the relationship between LncRNAs and their associated prot…
Created by Dennis C Wylie, last modified on Jun 29, 2015 Machine learning methods (including clustering, dimensionality reduction, classification and regression modeling, resampling techniques, etc.), ANOVA modeling, and empirical Bayes analysis. Uns…
[protocol]GO enrichment analysis     背景: 什么是富集分析,自己可以百度.我到目前也没发现一个比较通俗易懂的介绍.直接理解为一种统计学方法就可以了. 用于查看显著性. 富集分析有很多种,最常见的是GO富集分析.也有pathway富集分析.[pathway的我目前不会啊 ::>_<:: ] 工具: 也有很多种,我这里主要是用Ontologizer (links:http://compbio.charite.de/contao/index.php/cmdlin…
南大<软件分析>--Data Flow Analysis @(静态分析) 目录 数据流分析概述 数据流分析应用 Reaching Definitions Analysis(may analysis) Live Variables Analysis(may analysis) Available Expressions Analysis(must analysis) 数据流分析 相关概念 may analysis: 输出可能是正确的,要做over-approxiamation追求sound,可以…
题目描述 As a gene engineer of a gene engineering project, Enigma encountered a puzzle about gene recombination. It is well known that a gene can be considered as a sequence, consisting of four nucleotides, which are simply denoted by four letters, A, C,…
1.          Summary The document is about the general idea of the architecture design of the Bitizens game, the detail logic is more complicated than what is documented here, and the actual logic will be somewhat different from what is documented her…
题目链接:http://acm.zju.edu.cn/onlinejudge/showProblem.do?problemCode=1027 http://poj.org/problem?id=1080 解题报告: 1.类似于LCS 2.gene[i][j]表示str1[i-1]和str2[j-1]的分值串没有,则应该扣分 3.递推公式 temp1=gene[i-1][j-1]+score[_map[str1[i-1]]][_map[str2[j-1]]]; temp2=gene[i-1][j]…
1605 - Gene recombination Time Limit: 2s Memory Limit: 64MB Submissions: 264 Solved: 46 DESCRIPTION As a gene engineer of a gene engineering project, Enigma encountered a puzzle about gene recombination. It is well known that a gene can be considered…
This article come from HEREARS-L1: Learning Tuesday 10:30–12:30; Oral Session; Room: Leonard de Vinci 10:30  ARS-L1.1—GROUP STRUCTURED DIRTY DICTIONARY LEARNING FOR CLASSIFICATION Yuanming Suo, Minh Dao, Trac Tran, Johns Hopkins University, USA; Hojj…
[怪毛匠子-整理] awesome-single-cell List of software packages (and the people developing these methods) for single-cell data analysis, including RNA-seq, ATAC-seq, etc. Contributions welcome... Software packages RNA-seq anchor - [Python] - ⚓ Find bimodal,…
https://github.com/Jialab-UCR/GDCRNATools GDCRNATools - An R package for downloading, organizing, and integrative analyzing lncRNA, mRNA, and miRNA data in GDC Introduction The Genomic Data Commons (GDC) maintains standardized genomic, clinical, and…
http://www.gene-quantification.de/liquid-biopsy.html Liquid Biopsy -- Definitions Liquid Biopsy -- reliable biomarkers Liquid Biopsy -- the role in cancer diagnostics Liquid Biopsy -- the role of Exosomes Biofluids Guidelines Liquid Biopsy Research P…
文献编号:19Mar - 11 2019年04月23日三读,会其精髓: 相信这种方法的话,那么它的精髓是什么,如何整合出这个core gene set. 首先要考虑样本的选择,样本里是否存在明显的分层? 2019年04月01日再读:精读: 已经发现我的data没法在PCA里有明显的规律:应该可以直接从bulk RNA-seq里获取有价值的信息,那么single cell到底有什么优势呢?回答:单细胞的数据是必须的,它可以把core genes锚定到case-control pseudotime,…
注:从进入生信领域到现在,已经过去快8年了.生物信息学包含了我最喜欢的三门学科:生物学.计算机科学和数学.但是如果突然问起,什么是生物信息学,我还是无法给出一个让自己满意的答案.于是便有了这篇博客. 起源 据说在1970年,荷兰科学家Paulien Hogeweg和Ben Hesper最早在荷兰语中创造了"bioinformatica"一词,英语中的"bioinformatics" 在1978年首次被使用.这两位科学家当时使用该词来表示: The study of…
我们的云平台上的GO富集分析工具,需要输入的文件表格和参数很简单,但很多同学都不明白其中的原理与结果解读,这个帖子就跟大家详细解释~ 一.GO富集介绍:       Gene Ontology(简称GO)是一个国际标准化的基因功能分类体系,提供了一套动态更新的标准词汇表(controlled vocabulary)来全面描述生物体中基因和基因产物的属性.GO总共有三个ontology(本体),分别描述基因的分子功能(molecular function).细胞组分(cellular compon…
一些基本概念: intracellular biochemical signaling pathways:胞内生化信号通路 extracellular mechanical cues: 胞外机械信号 The key mechanical machinery consists of myosin motors that enable contractile force generation and assist in cytoskeletal crosslinking, adhesion comp…
利用支持向量机对基因表达标本是否癌变的预测 As we mentioned earlier, gene expression analysis has a wide variety of applications, including cancer studies. In 1999, Uri Alon analyzed gene expression data for 2,000 genes from 40 colon tumor tissues and compared them with d…
1. What is KOBAS 3.0? KOBAS (KEGG Orthology Based Annotation System) is a web server for gene/protein functional annotation (Annotation module) and functional set enrichment (Enrichment module). Given a set of genes or protein, it can determine wheth…
Accepted Papers     Title Primary Subject Area ID 3D computer vision 93 UPnP: An optimal O(n) solution to the absolute pose problem with universal applicability 128 Video Registration to SfM Models 168 Image-based 4-d Modeling Using 3-d Change Detect…
CNN综述文章 的翻译 [2019 CVPR] A Survey of the Recent Architectures of Deep Convolutional Neural Networks 翻译 综述深度卷积神经网络架构:从基本组件到结构创新 目录 摘要    1.引言    2.CNN基本组件        2.1 卷积层        2.2 池化层        2.3 激活函数        2.4 批次归一化        2.5 Dropout        2.6 全连接层…
http://exploredegrees.stanford.edu/coursedescriptions/cs/ CS 101. Introduction to Computing Principles. 3-5 Units. Introduces the essential ideas of computing: data representation, algorithms, programming "code", computer hardware, networking, s…
参考:产品手册 PacBio三代全长转录组有什么优势? 近年来,随着高通量测序技术的发展,转录组测序已经成为研究基因表达调控的主要手段.但二代的转录本重构准确率很低,三代可以直接得到全长转录本,无需组装.可改善基因表达定量结果,发现新的基因和转录异构体,鉴定可变剪切及基因融合现象. Google第一个就是官网介绍,可以立马理解Iso-Seq的字面意思了. Iso-Seq 就是 isoform sequencing,中文就是同源异构体测序,其实也是一种 RNA 测序技术. The challeng…