bioawk

【bioawk】的更多相关文章

https://github.com/lh3/bioawk 1.基本思想使用: usage: bioawk [-F fs] [-v var=value] [-c fmt] [-tH] [-f progfile | 'prog'] [file ...] bioawk基本思想是把组成不同类型的文件(sam.bam.fasta.fastq.vcf)的基本元素封装成变量,直接调用即可. 上面出现的名称即可引用其变量. 2.实际例子打印fasta序列ID.序列.长度.GC含量: bioawk -c f…

使用bioawk对基因组fasta序列ID（染色体/scaffold名称）排序？

目录需求实现需求已知某基因组序列,染色体或scaffold ID顺序不定,想要对其按数字排序. 原顺序: 想要的排序结果: 实现使用bioawk,没有的话conda直接安装. bioawk -c fastx '{print}' old.genome.fa | \ sort -k1,1V | awk '{print ">"$1;print $2}' >new.genome.fa https://www.biostars.org/p/494201/…

Linux command line exercises for NGS data processing

by Umer Zeeshan Ijaz The purpose of this tutorial is to introduce students to the frequently used tools for NGS analysis as well as giving experience in writing one-liners. Copy the required files to your current directory, change directory (cd) to t…