python 中统计fasta文件中每条序列的长度

 

001、

(base) root@PC1:/home/test2# ls
a.fasta  test.py
(base) root@PC1:/home/test2# cat a.fasta       ## 测试fasta文件
>gene1 myc
AGCTGCCTAAGC
GGCATAGCTAATCG
>gene2 jun
ACCGAATCGGAGCGATG
GGCATTAAAGATCTAGCT
>gene3 malat1
AGGCTAGCGAG
GCGCGAG
GATTAGGCG
(base) root@PC1:/home/test2# cat test.py       ## 测试程序
#!/usr/bin/python
in_file = open("a.fasta", "r")
dict1 = dict()

for i in in_file:
    i = i.strip()
    if i.startswith(">"):
        key = i
        dict1[key] = ""
    else:
        dict1[key] += i

for i,j in dict1.items():
    print(i, ":", len(j))

in_file.close()
(base) root@PC1:/home/test2# python test.py      ## 执行程序
>gene1 myc : 26
>gene2 jun : 35
>gene3 malat1 : 27

 

参考:https://mp.weixin.qq.com/s?__biz=MzkyMTI1MTYxNA==&mid=2247493739&idx=1&sn=f690c93761307e6ec9bb77cca2eb4619&chksm=c184d21af6f35b0cda1d964ed896adee1091e1f615b7f6be0caf2508105275ca3ae66889c58e&mpshare=1&scene=23&srcid=0811LY0ghlyV0yNXki8WcW6m&sharer_sharetime=1660215059305&sharer_shareid=50b75c6a886e09824b582fb782a7678b#rd

 

posted @ 2022-08-11 19:46  小鲨鱼2018  阅读(225)  评论(0编辑  收藏  举报