We provide here detailed Description about the files outputted from the somatic mutation annotators via ANNOVAR and SnpEff.

    • *_annoTable.txt from the annotator via ANNOVAR
Column Names Description
Chr Chromosome number
Start Start position
End End position
Ref Reference base(s)
Alt Alternate non-reference alleles called on at least one of the samples
COSMIC ID COSMIC ID
Func.refGene Regions (e.g., exonic, intronic, non-coding RNA)) that one variant hits; please click here for details.
Gene.refGene Gene name associated with one variant
ExonicFunc.refGene Exonic variant function, e.g., nonsynonymous, synonymous, frameshift insertion.please click here for details.
AAChange.refGene Amino acid change. For example, SAMD11:NM_152486:exon10:c.T1027C:p.W343R stands for gene name, Known RefSeq accession, region, cDNA level change, protein level change.
SIFT_score SIFT score. See the dbNSFP information table for details.
SIFT_pred SIFT prediction. See the dbNSFP information table for details.
Polyphen2_HDIV_score Pholyphen2 score based on HDIV. See the dbNSFP information table for details.
Polyphen2_HDIV_pred Pholyphen2 prediction based on HDIV. See the dbNSFP information table for details.
Polyphen2_HVAR_score Polyphen2 score based on HVAR. See the dbNSFP information table for details.
Polyphen2_HVAR_pred Polyphen2 prediction based on HVAR. See the dbNSFP information table for details.
LRT_score LRT score. See the dbNSFP information table for details.
LRT_pred LRT prediction. See the dbNSFP information table for details.
MutationTaster_score MutationTaster score. See the dbNSFP information table for details.
MutationTaster_pred MutationTaster prediction. See the dbNSFP information table for details.
MutationAssessor_score MutationTaster score. See the dbNSFP information table for details.
MutationAssessor_pred MutationTaster prediction. See the dbNSFP information table for details.
FATHMM_score FATHMM score. See the dbNSFP information table for details.
FATHMM_pred FATHMM prediction. See the dbNSFP information table for details.
PROVEAN_score PROVEAN score<. See the dbNSFP information table for details./td>
PROVEAN_pred PROVEAN prediction. See the dbNSFP information table for details.
VEST3_score VEST V3 score. See the dbNSFP information table for details.
CADD_raw CADD raw score. See the dbNSFP information table for details.
CADD_phred CADD phred-like score. See the dbNSFP information table for details.
DANN_score DANN score. See the dbNSFP information table for details.
fathmm-MKL_coding_score fathmm-MKL score for one coding variant. See the dbNSFP information table for details.
fathmm-MKL_coding_pred fathmm-MKL prediction for one coding variant. See the dbNSFP information table for details.
MetaSVM_score MetaSVM score. See the dbNSFP information table for details.
MetaSVM_pred MetaSVM prediction. See the dbNSFP information table for details.
MetaLR_score MetaLR score. See the dbNSFP information table for details.
MetaLR_pred MetaLR prediction. See the dbNSFP information table for details.
integrated_fitCons_score fitCons score<. See the dbNSFP information table for details./td>
integrated_confidence_value confidence level. See the dbNSFP information table for details.
GERP++_RS GREP++ "rejected substitutions" (RS) score. See the dbNSFP information table for details.
phyloP7way_vertebrate Phylogenetic p-values for 7 vertebrate species. See the dbNSFP information table for details.
phyloP20way_mammalian Phylogenetic p-values for 20 mammalian species. See the dbNSFP information table for details.
phastCons7way_vertebrate PhastCons score for 7 vertebrate species. See the dbNSFP information table for details.
phastCons20way_mammalian phastCons p-values for 20 mammalian species. See the dbNSFP information table for details.
SiPhy_29way_logOdds SiPhy log odds score for 29 species. See the dbNSFP information table for details.
    • *_annoTable.txt from the annotator via SnpEff
Column Names Description
CHROM Chromosome number
POS Position
ID semi-colon separated list of unique identifiers where available. If this is a dbSNP variant it is encouraged to use the rs number(s).
REF Reference base(s)
ALT Alternate non-reference alleles called on at least one of the samples
EFFECT Functional consequences of one variant, e.g., missense_variant, synonymous_variant. please clickhere for details.
REGION Regions (e.g., exonic, intronic) that one variant hits
IMPACT Putative impact of the variant (e.g. HIGH, MODERATE or LOW impact).
GENE Gene name (usually HUGO)
GENEID Gene ID)
FEATURE The type of feature is in the next field (e.g. transcript, motif, miRNA, etc.)
FEATUREID Transcript ID (preferably using version number), Motif ID, miRNA, ChipSeq peak, Histone mark, depending on the annotation.
BIOTYPE Description on whether the transcript is {“Coding”, “Noncoding”}. Whenever possible, use ENSEMBL biotypes. .
HGVS_C Variant using HGVS notation (DNA level). For example, c.352A>G stands for A to G substitution of nucleotide 352. Click here for details.
HGVS_P Coding variant using HGVS notation (Protein level). For example, p.Ile118Val stands for Isoleucine at position number 66 substitution to Valine. p.Ile118Val can be also be represented by p.I118V using the 1-letter symbol here. Click here for details.
SIFT_score SIFT score. See the dbNSFP information table for details.
SIFT_pred SIFT prediction. See the dbNSFP information table for details.
Polyphen2_HDIV_score Pholyphen2 score based on HDIV. See the dbNSFP information table for details.
Polyphen2_HDIV_pred Pholyphen2 prediction based on HDIV. See the dbNSFP information table for details.
Polyphen2_HVAR_score Polyphen2 score based on HVAR. See the dbNSFP information table for details.
Polyphen2_HVAR_pred Polyphen2 prediction based on HVAR. See the dbNSFP information table for details.
LRT_score LRT score. See the dbNSFP information table for details.
LRT_pred LRT prediction. See the dbNSFP information table for details.
MutationTaster_score MutationTaster score. See the dbNSFP information table for details.
MutationTaster_pred MutationTaster prediction. See the dbNSFP information table for details.
MutationAssessor_score MutationAssessor score. See the dbNSFP information table for details.
MutationAssessor_pred MutationAssessor prediction. See the dbNSFP information table for details.
FATHMM_score FATHMM score. See the dbNSFP information table for details.
FATHMM_pred FATHMM prediction. See the dbNSFP information table for details.
PROVEAN_score PROVEAN score<. See the dbNSFP information table for details./td>
PROVEAN_pred PROVEAN prediction. See the dbNSFP information table for details.
VEST3_score VEST V3 score. See the dbNSFP information table for details.
CADD_raw CADD raw score. See the dbNSFP information table for details.
CADD_phred CADD phred-like score. See the dbNSFP information table for details.
MetaSVM_score MetaSVM score. See the dbNSFP information table for details.
MetaSVM_pred MetaSVM prediction. See the dbNSFP information table for details.
MetaLR_score MetaLR score. See the dbNSFP information table for details.
MetaLR_pred MetaLR prediction. See the dbNSFP information table for details.
GERP++_NR GREP++ conservation score. See the dbNSFP information table for details.
GERP++_RS GREP++ "rejected substitutions" (RS) score. See the dbNSFP information table for details.
phyloP100way_vertebrate Phylogenetic p-values for 100 vertebrate species. See the dbNSFP information table for details.
phastCons100way_vertebrate PhastCons score for 7 vertebrate species. See the dbNSFP information table for details.
SiPhy_29way_logOdds SiPhy log odds score for 29 species. See the dbNSFP information table for details.
    • *_genelist.txt from the annotators via ANNOVAR and SnpEff
Column Names Description
Gene Gene name associated with each variant; one gene name may correspond to several variants.
Mutations Amino acid change information. For example, SAMD11:NM_152486:exon10:c.T1027C:p.W343R stands for gene name, Known RefSeq accession, region, cDNA level change, protein level change..
    • dbNSFP Information
Columns of Annotations from dbNSFP Database Pediction Algorithm/Conservation Score Description Method Categorical Prediction Author(s)
SIFT_pred 
SIFT_score
SIFT Sort intolerated from tolerated P(An amino acid at a position is tolerated | The most frequentest amino acid being tolerated) D: Deleterious (sift<=0.05);
T: tolerated (sift>0.05)
Pauline Ng, Fred Hutchinson 
Cancer Research Center, Seattle, Washington
Polyphen2_HDIV_pred 
Polyphen2_HDIV_score
Polyphen v2 Polymorphism phenotyping v2 D: Probably damaging (>=0.957), 
P: possibly damaging (0.453<=pp2_hdiv<=0.956), 
B: benign (pp2_hdiv<=0.452)
Probablistic Classifier Training sets: HumDiv Havard Medical School/td>
Polyphen2_HVAR_pred
Polyphen2_HVAR_score
Polyphen v2 Polymorphism phenotyping v2 Machine learning Training sets: HumVar D: Probably damaging (>=0.957), 
P: possibly damaging (0.453<=pp2_hdiv<=0.956);
B: benign (pp2_hdiv<=0.452)
Shamil Sunyaev
Havard Medical School
LRT_pred 
LRT_score
LRT Likelihood ratio test LRT of H0: each codon evolves neutrally vs H1: the codon evovles under negative selection D: Deleterious; 
N: Neutral;
U: Unknown
Lower scores are more deleterious
Sung Chung, Justin Fay Washington University
MutationTaster_pred 
MutationTaster_score
MutationTaster Bayes Classifier A: (""disease_causing_automatic""); 
D: (""disease_causing""); 
N: (""polymorphism [probably harmless]""); 
P: (""polymorphism_automatic[known to be harmless]"
higher values are more deleterious"
  Markus Schuelke
the Charité - Universitätsmedizin Berlin
MutationAssessor_pred 
MutationAssessor_score
MutationAssessor Entropy of multiple sequence alighnment H: high; 
M: medium; 
L: low; 
N: neutral. 
H/M means functional and L/N means non-functional higher values are more deleterious
  Reva Boris
Computation Biology Center Memorial Sloan Kettering Cancer Center
FATHMM_pred 
FATHMM_score
FATHMM HMM Functional analysis through hidden markov model HMM D: Deleterious; 
T: Tolerated;
lower values are more deleterious
Shihab Hashem
University of Bristol, UK
PROVEAN_pred 
PROVEAN_score
  Protein Variation Effect Analyzer Clustering of homologus sequences D: Deleterious; 
N: Neutral
higher values are more deleterious
Choi Y J. Craig Venter Institute
VEST3_score VEST V3 Variant effect scoring tool Random forest classifier higher values are more deleterious Rachel Karchin John Hopkins University
CADD_raw CADD_phred CADD Combined annotation dependent depletion Linear kernel SVM higher values are more deleterious   Jay Shendure, Xiaohui Xie University of California - Irvine
DANN_score DANN Deleterious Annotation of genetic variants using Neural Networks Neural network higher values are more deleterious Jay Shendure, Xiaohui Xie
University of California - Irvine
fathmm-MKL_coding_pred FATHMM-MKL predicting the effects of both coding and non-coding variants using nucleotide-based HMMs Classifier based on multiple kernel learning D: Deleterious; 
T: Tolerated
Score >= 0.5: D; 
Score < 0.5: T
Shihab Hashem
University of Bristol, UK
MetaSVM_pred 
MetaSVM_score
MetaSVM Support vector machine D: Deleterious; T: Tolerated;
higher scores are more deleterious
  Coco Dong
USC Biostatiscs Department
MetaLR_pred 
MetaLR_score
MetaLR Logistic regression D: Deleterious; 
T: Tolerated; 
higher scores are more deleterious
  Coco Dong 
USC Biostatiscs Department
integrated_fitCons_score 
integrated_confidence_value
FitCons Fitness consequences of functional annotation Integrate functional assays like ChIP-Seq with conservation measure of transcription factor binding sites higher scores are more deleterious Abriza
Cold Spring Harbor Lab
GERP++_RS
GERP++_NR
Genome Evolutionary Rate Profiling ++ maximum likelihood estimation procedure higher scores are more deleterious   Eugne Davydov
Stanford University, CS Department
phyloP7way_vertebrate PhyloP Phylogentic p-values Phylogentic p-values calculated from a LRT, score-based test, GERP test Use 7 species higher scores are more deleterious Adam Siepel 
UCSC
phyloP20way_mammalian PhyloP Phylogentic p-values a phylogenetic hidden Markov model (phylo-HMM) Use 20 species higher scores are more deleterious Adam Siepel
UCSC
phastCons7way_vertebrate phastCons A phylogenetic hidden Markov model (phylo-HMM) Use 7 species higher scores are more deleterious   Adam Siepel
UCSC
phastCons20way_mammalian phastCons a phylogenetic hidden Markov model (phylo-HMM) Use 20 species higher scores are more deleterious   Adam Siepel
UCSC
SiPhy_29_way SiPhy Probablistic framework, HMM Use 29 species higher scores are more deleterious   Manual Garber
Broad Institute of MIT & Harvard

>

Questions? Kindly contact arraytools [at] emmes.com using the subject heading detailed information for outputted files from somatic mutation annotators.

原文链接地址:https://brb.nci.nih.gov/seqtools/colexpanno.html

Detailed Information for Outputted Files from Somatic Mutation Annotators(annovar 注释文件条目详细解释)的更多相关文章

  1. Debugging Information in Separate Files

    [Debugging Information in Separate Files] gdb allows you to put a program's debugging information in ...

  2. MCP|DYM|Quantitative mass spectrometry to interrogate proteomic heterogeneity in metastatic lung adenocarcinoma and validate a novel somatic mutation CDK12-G879V (利用定量质谱探究转移性肺腺瘤的蛋白质组异质性及验证新体细胞突变)

    文献名:Quantitative mass spectrometry to interrogate proteomic heterogeneity in metastatic lung adenoca ...

  3. somatic mutation体细胞变异检测文献分享--转载

    转载 :http://blog.sina.com.cn/s/blog_83f77c940102xuro.html Kalatskaya I, Trinh Q M, Spears M, et al. I ...

  4. The absolute uri: [http://java.sun.com/jsp/jstl/core] cannot be resolved in either web.xml or the jar files deployed with this application] with root cause异常处理及解释

    1.问题描述: 在web的jsp文件中想用jstl这个标准库,在运行的时候很自然的引用jar包如下: <dependency> <groupId>javax.servlet.j ...

  5. How to check type of files without extensions in python? 不通过文件扩展名,怎样知道文件类型?

    有一个命令 file 可以用 $ file fuck fuck.png: PNG image data, 1122 x 750, 8-bit colormap, non-interlaced pyth ...

  6. BlackArch-Tools

    BlackArch-Tools 简介 安装在ArchLinux之上添加存储库从blackarch存储库安装工具替代安装方法BlackArch Linux Complete Tools List 简介 ...

  7. xsltproc docbook 转 html

    /etc/xml/catalog <?xml version="1.0" encoding="UTF-8"?> <catalog xmlns= ...

  8. curl-手册

    Manual -- curl usage explained Related: Man Page FAQ LATEST VERSION   You always find news about wha ...

  9. 【ANT】Ant常用的内置task

    ant 例如: <target name="callProjectB"> <echo message="In projectA calling proj ...

随机推荐

  1. AOP中的ASPECTJ

    一.准备 1.架包 2.配置文件 二.注解的形式 UserDao.java package cn.itcast.spring.aspectj.annocation; public class User ...

  2. struts2 之 struts2数据校验

    1. 数据校验一般分为2类:前端的校验(js校验),后端的校验(java代码):实际开发中大部分情况下都是采用js校验.在对数据安全要求较高的情况下可能会采用后端验证. 2.  Struts2提供了后 ...

  3. 以防忘记,建立stm32工程的过程

    然后会弹出manage run -time environment的运行环境界面,必须选择的有CMSIS下的core和device下的startup,至于其它的根据你需要的外设, 在stdperiph ...

  4. 初学strurs基础

    Struts2基础学习总结 Struts 2是在WebWork2基础发展而来的. 注意:struts 2和struts 1在代码风格上几乎不一样. Struts 2 相比Struts 1的优点: 1. ...

  5. 自定义TextView跑马灯

    本篇主要介绍TextView的可控制跑马灯效果实现. Android自带的TextView添加几个属性就可以实现跑马灯效果,大概是这样 android:ellipsize="marquee& ...

  6. DirectFB学习笔记三

    本篇目的,通过键盘的esc键控制程序退出.学习输入设备产生事件,接收事件,产生反应. 首先获取输入设备 IDirectFBInputDevice *keyboard = NULL; dfb->G ...

  7. Linux-进程描述(5)之进程环境

    main函数和启动例程 当内核使用一个exec函数执行C程序时,在调用main函数之前先调用一个特殊的启动例程,可执行程序将此例程指定为程序的起始地址.启动例程从内核获取命令行参数和环境变量,然后为调 ...

  8. HttpClient研究学习总结

    Http协议非常的重要,HttpClient相比传统JDK自带的URLConnection,增加了易用性和灵活性(具体区别,日后我们再讨论),它不仅是客户端发送Http请求变得容易,而且也方便了开发人 ...

  9. C语言精要总结-内存地址对齐与struct大小判断篇

    在笔试时,经常会遇到结构体大小的问题,实际就是在考内存地址对齐.在实际开发中,如果一个结构体会在内存中高频地分配创建,那么掌握内存地址对齐规则,通过简单地自定义对齐方式,或者调整结构体成员的顺序,可以 ...

  10. 蓝桥杯-分小组-java

    /* (程序头部注释开始) * 程序的版权和版本声明部分 * Copyright (c) 2016, 广州科技贸易职业学院信息工程系学生 * All rights reserved. * 文件名称: ...