Viral-genome-assembly
文章目录
目前一般的方法有VICUNA, PRICE, IVA, 因此,我们尝试使用这3种方法进行组装。
本地工具
VICUNA
VICUNA出自大名鼎鼎的borad研究所,但在2012年后,就停止了开发。
PRICE
DeRisi实验室出品,2014年停止开发。组装需提供一个seed sequence
IVA
总结之前的工具,发表于Gigascience
网页工具
VirAmp
默认使用velvet
进行组装
VirAmp workflow
Virus Assembly Pipeline contains a set of programs to assemble the the virus genome from shotgun sequence in a fast and accurate way with a related genome as guide. In general, it contains the following steps:
- Quality Control
This step trims out the low quality bases in reads from both end using seqtk toolkit
- Diginal Normalization
Digital normalization (diginorm) uses a single-pass computational algorithm to reduce the high and redundant coverage in shotgun sequence dataset, thus significantly decreasing time and memory requirements for the de novo assembly while retaining the accuracy.
Notice Since Digital Normalization is designed to deal with extreme high coverage data (x1000 or above), by default it’s not turned on. One can choose to turn on at the input panel.
- De novo assembly (velvet)
This step uses the velvet program to do a de novo assembly from the shotgun sequence. This step results in a set of contigs of various lengths.
- Reference-guided Scaffolding (AMOScmp)
Contigs generated from velvet in step 3 are oriented and connected into scaffolds by the reference-guided assembly tool AMOScmp
- Scaffolding using paired-end information (SSPACE)
Uses paired-end reads from the original dataset to extend and scaffold contigs
- Assembly assessment
QUAST (Quality Assessment Tool for Genome Assemblies) program is used for the statistical assessment of the assemblies resulting from step 4
- SNV generation
MUMmer is used to align the assembly results to the reference genome and report SNVs.
- Draft and reference genome comparison
MUMmer is used to compare the assembly results (draft genome) with the reference genome.
VirusTAP
https://www.frontiersin.org/articles/10.3389/fmicb.2016.00032/full
需注册