删除 fasta 文件中的单词列表

问题描述 投票:0回答:0

我有一个这样的fasta文件:

solper_LA1333_DN23684_c0_g1_i1.p1>solper_LA1333_DN23684_c0_g1_i4.p1>solper_LA1333_DN23684_c0_g1_i6.p1>solper_LA1333_DN23684_c0_g1_i8.p1 HLPDRHSNLVTDEEVVGFENKAEELIDYLIRGTNELDVVPIVGMGGQGKTTIARKLYYNDIIVSRFDVRAWCIISQTYNRRELLQDIFSQVTGVNDNGATVDDVADMLRRKLMGKRYLIVLDDMWDCMVWDDLRLSFPDSGNRSRIVVTTRLEEVGKQVKYHTDPYSLPFLTTEESCQLLQKKVFQKDDCPP ELQDVSQAVAEKCKGLLSLVVVLVAGIIKKRKMEESWWNEVKDALFDYIDSEFEEYSLATMQLSFDNLPHCLKPCLLYMGMFSEDARIPASTLISLWIAEGFVENTESGILMEEAAEGYLMDLISSNLVMLSKRSYKGKVKYCQVHDVVHHFCLEKSREAKFM solper_LA1333_DN10584_c0_g1_i1.p1 HSSRKSTIEEKTVVGMKDDPNSILNCINAQTKELIVISVVGMGGIGKTTLASKVFDDSMIRSQFDKHAWVTISQDYNKRQMLLEIVSSITGINQENMSNDKLLDTVYKGLKGRRFLIVIDDLWSTEALDLMRRIFPNDHNKSRIILTTRLKTVADYASSPDFPPHDMSFLSLDDSWNLFTERLFKKDPCPPQLEVIGK HIIQ solper_LA1333_DN17995_c0_g1_i1.p1>solper_LA1333_DN17995_c0_g1_i10.p1>solper_LA1333_DN17995_c0_g1_i12.p1>solper_LA1333_DN17995_c0_g1_i13.p1>solper _LA1333_DN17995_c0_g1_i14.p1>solper_LA1333_DN17995_c0_g1_i4.p1>solper_LA1333_DN17995_c0_g1_i5.p1>solper_LA1333_DN17995_c0_g1_i6.p1 HSSRNVAKLNPENIVVGLDDDLERIIRRLKGPTLSREIIPILGMGGIGKTTLARKAFDDFETRNRFDIHIWVTVSQEYRIRGMLLDILRSTSEETNESNIDRLMDMIYKKLKGWRYLVVMDDIWSSEVWDLMTRTFPDDNNGSRIILTSRQEEVASHADPDSNPHKMNLLNSDNSWKLIRDRVFGVEHACPPELEDIGEQIAQRC QGLPLALLVVAGHLSKISRTRESWNDVSKSVSKVVADESDICLGVLAMSYNYLPDHLKPCFLYMGVFPEDSVVNIVRLINLWISEGFISDELVGRDFMEDLVSRNLVMVRNRSFNGEAKTCGVHDLIRDLILREAEKEKFL

另一个包含我想删除的 ID 的文件

我已经尝试了这个问题“https://stackoverflow.com/questions/55636069/remove-multiple-sequences-from-fasta-file”中建议的一些解决方案,但它们没有用。我认为这是因为我的文件有多个 ID 连接在同一个 ID 中。

linux filter sequence fasta
© www.soinside.com 2019 - 2024. All rights reserved.