如何解决追加时取出特定字符串
我正在尝试将某些DNA序列翻译成蛋白质序列,但遇到了一些核酸不完整的DNA序列。到目前为止,我已经通过使用它们在每次出现时增加一个计数器来克服它们,但是在添加它们时,我想将这些序列从我的列表中删除。 .strip()函数仅在出现特定字母时才将其取出,但是如果发现不完整的核酸,我希望从列表中删除整个字符串。 作为参考,我将大约100个这些序列(请参见下文)并全部翻译。综上所述,当仅找到一个不是ACTG的字母时,如何删除序列之一?
AAAATAAAAACAACCAAAATGAAGGCAAGACTACTGGTCCTGTTATGTGCACTTGCAGCTACAGATGCAG
ACACAATATGTATAGGCTACCATGCGAACAACTCAACCGATACTGTTGACACAGTACTCGAAAAGAATGT
GACAGTGACACACTCTGTCAACCTACTCGAAGACAGCCACAACGGGAAATTATGTAGATTAAAAGGAATA
GCCCCACTACAATTGAGGAAATGTAACATTGCTGGATGGATCCTGGGAAACCCAGAATGCGAATCACTGC
TTTCAGAGAGATCATGGTCCTACATTGTTGAAACACCAAACTCTGAGAATGGAACATGTTACCCAGGAGA
TTTTACCAACTATGAGGAACTGAGGGAGCAATTGAGCTCTGTATCATCATTCGAAAGATTCGAAATATTC
CCCAAGGAAAGCTCATGGCCCAAACACAACACAACCAGAGGAGTAACGGCAGCATGCTCCCATGCGGGAA
AAAGCAGTTTTTACAGAAATTTGCTATGGCTGACGGAGAAGGATGGCTCATATCCGAATCTGAACAATTC
CTATGTGAACAAGAAAGGGAAAGAAGTCCTTGTACTATGGGGTGTTCATCACCCGTCCAACATAAAGGAT
CAACAGACCCTCTATCAGAAAGAAAATGCTTATGTCTCTGTAGTGTCTTCAAACTATAACAGGAGATTCA
CCCCGGAAATAGCAGAAAGACCCAAAGTAAGAGGTCAAGCAGGGAGGATGAACTATTACTGGACCTTGCT
AAAACCCGGAGACACAATAATGTTTGAGGCAAATGGAAATCTAATAGCGCCATGGTATGCTTTCGCACTA
AGTAGAGGCTTTGGGTCAGGCATCATCACCTCAAACGCATCGATGCATGAGTGTGACACGAAGTGTCAAA
CACCCCAGGGAGCTATAAACAGCAGTCTCCCTTTCCAGAATATCCACCCAGTCACAATAGGAGAGTGCCC
AAAATACGTCAGGAGTACCAAATTGAGGATGGTTACAGGACTACGGAACATCCCATCCATTCAATCCAGA
GGTCTATTTGGAGCCATTGCCGGTTTCATTGAAGGGGGATGGACTGGAATGATAGATGGATGGTATGGTT
ATCATCATCAGAATGAACAGGGATCAGGCTATGCTGCGGATCAAAAAAGCACACAAAATGCCATTAACGG
GATTACAAACAAGGTGAACTCTGTTATCGAGAAGATGAACACTCAATTCACTGCTGTGGGTAAAGAATTC
AACAACTTAGAAAAAAGGATGGAAAACTTAAACAAAAAAGTTGATGATGGATTTCTGGACATTTGGACAT
ATAATGCAGAATTGTTGATTCTACTGGAAAATGAAAGGACTCTGGATTTCCATGACTCAAATGTGAAGAA
TCTGTATGAAAAAGTTAAAAGCCAATTAAGGAATAATGCCAAAGAAATAGGAAACGGGTGTTTTGAGTTC
TACCACAAGTGTAACAATGAATGCATGGAAAGTGTAAAAAATGGAACTTATGATTATCCAAAATATTCAG
AAGAATCAAAGTTGAACAGGGAAAAAATAGATGGAGTGAAATTGGAATCAATGGGGGTCTATCAGATTCT
GGCGATCTACTCAACTGTCGCCAGTTCACTGGTGCTTCTAGTCTCCCTGGGGGCAATCAGCTTCTGGATG
TGTTCTAATGGGTCTTTGCAGTGCAGAATATGCATCTGAGATCAGAATTTCAGAAATATAAGGAAAAAAA
C
到目前为止,正如我之前提到的,我的代码只是通过计算计数器来绕过任何错误。
protein = ''
protein_list = []
mutation = False
completed_HAseq = []
for sequence in selected_HAseqs:
for i in range(0,len(sequence)-2,3):
codon = sequence[i:i+3]
if codon not in FASTAdict:
total_mutation_count += 1
elif codon in FASTAdict:
protein += FASTAdict[codon]
completed_HAseq.append(sequence)
protein_list.append(protein)
protein = ''
除了检查RYMKSWHBVDN的每个字符串外,我什么都没想到,如果有,请删除它。如果没有,我将其附加到翻译序列的最终列表中
解决方法
您可以检查并替换子字符串
[https @ 0xe5b5c0] Cannot reuse HTTP connection for different host: r2---sn-25ge7nsl.googlevideo.com:-1 != r4---sn-25ge7nsl.googlevideo.com:-1
[hls,applehttp @ 0xc06250] keepalive request failed for 'https://r4---sn-25ge7nsl.googlevideo.com/videoplayback/id/DWcJFNfaw9c.2/itag/96/source/yt_live_broadcast/expire/1603923091/ei/M5iZX4_VIMLUW57fiOAO/ip/2a01:cb00:39c:2900:835b:2f40:d2bf:7a5d/requiressl/yes/ratebypass/yes/live/1/goi/160/sgoap/gir%3Dyes%3Bitag%3D140/sgovp/gir%3Dyes%3Bitag%3D137/hls_chunk_host/r4---sn-25ge7nsl.googlevideo.com/playlist_duration/30/manifest_duration/30/vprv/1/playlist_type/DVR/initcwndbps/6610/mh/Ms/mm/44/mn/sn-25ge7nsl/ms/lva/mv/m/mvi/4/pl/30/keepalive/yes/mt/1603901358/disable_polymer/true/sparams/expire,ei,ip,id,itag,source,requiressl,ratebypass,live,goi,sgoap,sgovp,playlist_duration,manifest_duration,vprv,playlist_type/sig/AOq0QJ8wRAIgUQGg9L0Aj_FXl9CqYP0o8t-Dn23Jh0y93xgYGouOHZMCIExicbcOpmWTdWCrUExX8dHFMcPEDXtkXXRplyCl49Y1/lsparams/hls_chunk_host,initcwndbps,mh,mm,mn,ms,mv,mvi,pl/lsig/AG3C_xAwRQIhAKVCR3lS5k-ryagc8fHuvIzMuMDccwvOXmxj8fT7PjUyAiB8D1MwZ0tmEoJidXuttZSKuxCDlc-J5gV43lRScjepXQ%3D%3D/playlist/index.m3u8/sq/3831774/goap/clen%3D81133%3Blmt%3D1603879520260124/govp/clen%3D258451%3Blmt%3D1603879520260122/dur/5.000/file/seg.ts',retrying with new connection: Invalid argument
[https @ 0xf64dd0] Cannot reuse HTTP connection for different host: r2---sn-25ge7nsl.googlevideo.com:-1 != r4---sn-25ge7nsl.googlevideo.com:-1
[hls,pl/lsig/AG3C_xAwRQIhAKVCR3lS5k-ryagc8fHuvIzMuMDccwvOXmxj8fT7PjUyAiB8D1MwZ0tmEoJidXuttZSKuxCDlc-J5gV43lRScjepXQ%3D%3D/playlist/index.m3u8/sq/3831775/goap/clen%3D81296%3Blmt%3D1603879520260134/govp/clen%3D224812%3Blmt%3D1603879520260132/dur/5.000/file/seg.ts',retrying with new connection: Invalid argument
[https @ 0xe48430] Cannot reuse HTTP connection for different host: r2---sn-25ge7nsl.googlevideo.com:-1 != r4---sn-25ge7nsl.googlevideo.com:-1
[hls,pl/lsig/AG3C_xAwRQIhAKVCR3lS5k-ryagc8fHuvIzMuMDccwvOXmxj8fT7PjUyAiB8D1MwZ0tmEoJidXuttZSKuxCDlc-J5gV43lRScjepXQ%3D%3D/playlist/index.m3u8/sq/3831776/goap/clen%3D81030%3Blmt%3D1603879520260144/govp/clen%3D259194%3Blmt%3D1603879520260142/dur/5.000/file/seg.ts',retrying with new connection: Invalid argument
[https @ 0xf82e80] Cannot reuse HTTP connection for different host: r2---sn-25ge7nsl.googlevideo.com:-1 != r4---sn-25ge7nsl.googlevideo.com:-1
[hls,pl/lsig/AG3C_xAwRQIhAKVCR3lS5k-ryagc8fHuvIzMuMDccwvOXmxj8fT7PjUyAiB8D1MwZ0tmEoJidXuttZSKuxCDlc-J5gV43lRScjepXQ%3D%3D/playlist/index.m3u8/sq/3831777/goap/clen%3D81157%3Blmt%3D1603879520260154/govp/clen%3D236880%3Blmt%3D1603879520260152/dur/5.000/file/seg.ts',retrying with new connection: Invalid argument
[https @ 0x112a390] Cannot reuse HTTP connection for different host: r2---sn-25ge7nsl.googlevideo.com:-1 != r4---sn-25ge7nsl.googlevideo.com:-1
[hls,pl/lsig/AG3C_xAwRQIhAKVCR3lS5k-ryagc8fHuvIzMuMDccwvOXmxj8fT7PjUyAiB8D1MwZ0tmEoJidXuttZSKuxCDlc-J5gV43lRScjepXQ%3D%3D/playlist/index.m3u8/sq/3831778/goap/clen%3D81571%3Blmt%3D1603879520260164/govp/clen%3D239140%3Blmt%3D1603879520260162/dur/5.000/file/seg.ts',retrying with new connection: Invalid argument
^Cwebsocket connection is closing.
版权声明:本文内容由互联网用户自发贡献,该文观点与技术仅代表作者本人。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如发现本站有涉嫌侵权/违法违规的内容, 请发送邮件至 dio@foxmail.com 举报,一经查实,本站将立刻删除。