使用alienness识别基因水平转移
文章目录
官方描述
alienness需要输入蛋白序列比对到nr数据库的结果文件作为输入
其对blastp的需求如下
-option value | Description |
---|---|
-outfmt 7 | 7 = tabular with comment lines !MANDATORY |
-db nr | BLAST database name For a better coverage of the biodiversity, NCBI’s nr library is recommanded but not necessary. The protein library must have gi or accession numbers that exist in the NCBI database. |
-seg no | The SEG program is used to mask or filter low complexity regions in amino acid queries |
-evalue 1e-3 | Expect value (E) for saving hits |
记录
实际上,alienness需要结果文件第2列subject id中含有gi 或 accession numbers 使其识别subject id的物种来源。且其格式应当如下
|
|
但有时blast输出的结果第二列为subject acc.ver,其内容只是accession number的一部分
|
|
如此,并不能得到我们需求的结果