linux 中blast序列比对

001、对数据库构建索引

[root@PC1 001_protein_database]# ls
protein.faa                         ## 构建索引
[root@PC1 001_protein_database]# makeblastdb -in protein.faa -dbtype prot -title xxx -parse_seqids -hash_index -out index -logfile log.txt
[root@PC1 001_protein_database]# ls
index.pdb  index.phi  index.pin  index.pog  index.pot  index.ptf  log.txt
index.phd  index.phr  index.pjs  index.pos  index.psq  index.pto  protein.faa
[root@PC1 001_protein_database]# cat log.txt


Building a new DB, current time: 07/15/2023 20:05:09
New DB name:   /home/blast_test/001_protein_database/index
New DB title:  xxx
Sequence type: Protein
Keep MBits: T
Maximum file size: 3000000000B
Adding sequences from FASTA; added 48265 sequences in 0.836226 seconds.

makeblastdb：构建索引的软件

-in protein.faa：要构建索引的数据库（这里是拟南芥的蛋白质序列）

-dbtype : 指定数据类型，prot为蛋白质、nucl为核酸；这里是蛋白质，因此选prot

-parse_seqids：帮助我们解析fa文件中，“>”后面的id信息

-parse_seqids, -hash_index: 两个参数一般都带上，主要是为blastdbcmd取子序列时使用；

-title：给数据库起个名(不能用在后面搜索时-db的参数)

-out：后接数据库名

-logfile：输出日志文件。

02、可以直接使用简洁命令：

makeblastdb -in input_file -dbtype prot/nucl -out index

002、balst软件的主要程序

01、blastn：核酸比对到核酸

02、blastp：蛋白质比对到蛋白质

03、blastx：核酸比对到蛋白质

04、tbalstn：蛋白质比对到核酸

05、tblastx: 核酸比对到核酸；（核酸首先翻译成蛋白质，然后进行比对）。

003、NCBI中的几个主要数据库

01、nt：核苷酸库

02、nr：蛋白质库

03、swissprot：蛋白质库。

。

posted @ 2023-07-15 12:04 小鲨鱼2018 阅读(426) 评论(0) 编辑收藏举报

会员力量，点亮园子希望

刷新页面返回顶部

小鲨鱼2018

linux 中blast序列比对

公告