bioawk】的更多相关文章

https://github.com/lh3/bioawk 1.基本思想 使用: usage: bioawk [-F fs] [-v var=value] [-c fmt] [-tH] [-f progfile | 'prog'] [file ...] bioawk基本思想是把组成不同类型的文件(sam.bam.fasta.fastq.vcf)的基本元素封装成变量,直接调用即可. 上面出现的名称即可引用其变量. 2.实际例子 打印fasta序列ID.序列.长度.GC含量: bioawk -c f…
目录 需求 实现 需求 已知某基因组序列,染色体或scaffold ID顺序不定,想要对其按数字排序. 原顺序: 想要的排序结果: 实现 使用bioawk,没有的话conda直接安装. bioawk -c fastx '{print}' old.genome.fa | \ sort -k1,1V | awk '{print ">"$1;print $2}' >new.genome.fa https://www.biostars.org/p/494201/…
by Umer Zeeshan Ijaz The purpose of this tutorial is to introduce students to the frequently used tools for NGS analysis as well as giving experience in writing one-liners. Copy the required files to your current directory, change directory (cd) to t…