About Sequence Name Reformatter

Sequence Name Reformatter developed in Dr. Mullins Lab at University of Washington is a web interface for users to extract information they want in sequence name and output a sequence fasta file with newly formatted sequence names only containing the extracted information.

Input

  • A sequence fasta file.

    Example:
    >gi|672892375|gb|AIK03017.1| nef protein [Human immunodeficiency virus 1]
    MGGKWSKSSLVGWPTVRERMKRTEPAADGVGAVSRDLEKHGAVTSSNTAATNADCAWLEAQEDEEVGFPV
    RPQVPLRPMTYKGALDLSHFLKEKGGLEGLIYSQRRQDILDLWVYHTQGYFPDWQNYTPGPGTRYPLTFG
    WCFKLVPVEPQKVEEANEGENNRLLHPMSLHGMDDPEREVLEWRFDSRLAFHHMARELHPEYYKDC
    >gi|672892373|gb|AIK03016.1| nef protein [Human immunodeficiency virus 1]
    MGGKWSKRVGWSTVRERMRRAEPAAVGVGAVSQDLEKHGAITSSNTAANNADCAWLEAQEKEEVGFPVRP
    QVPLRPMTYKAAIDLSHFLKEEGGLEGLIHSQQRQDILDLWVYHTQGYFPDWQNYTPGPGIRYPLTFGWC
    FKLVPVEPEKVEEANEGENNSLLHPMSQHGMEDPEKEVLEWRFDSRLAFHHMARELHPEYYKNC
    >gi|672892371|gb|AIK03015.1| nef protein [Human immunodeficiency virus 1]
    MGGKWSKXSXVGWPAVRZRXXRAEPAAXGVGAVSRXLENRGAXTSSNTXANNAACAWLEAQEEEEVGFPV
    RPQVPLRPMTYKAXXDJSHXLXXXGGLXGJVWSQRRQDILDLWXYHTQGYFPDWQNYTPGPGTRFPLTFG
    WCFKLVPLDPEQVEKANEGENNSLLHPMSQHGTDDPEKEVLIWKFDSRLAFHHMARELHPEYYKDC
    >gi|672892369|gb|AIK03014.1| nef protein [Human immunodeficiency virus 1]
    MGGKWSKRTSGWSTVRERMRRAEPAADGVGAASRDLEKHGAITSSNTATNNADCAWLEAQEEEEVGFPVR
    PQVPLRPMTYKGAVDLSHFLKEKGGLEGLIHSQRRQDILDLWVYHTQGYFPDWQNYTPGPGIRYPLTFGW
    CFKLVPVEPEKIEEANEGENNSLLHPMSLHGIEDPEREVLVWKFDSRLAFHHMARELHPEYYKNC
    >gi|672892367|gb|AIK03013.1| nef protein [Human immunodeficiency virus 1]
    MGGKWSKRSGAGWPTVRERMKRAEPAAEGVGAVSRDLEKHGAITSSNTPTNNAACAWLEAQEEEEVGFPV
    RPQVPLRPMTYKGAVDLSHFLKEKGGLEGLVHSQKRQDILDLWVYNTQGYFPDWQNYTPGPGIRYPLTFG
    WCFKLVPAEPEQVEEANEGENNSLLHPMSLHGMEDPEREVLVWKFDSRLAFHHMARELHPEYYKDC
    

  • Delimiter(s) that separate information in sequence name. If there are more than one delimiter, please put all delimiter together in the text box. e.g. "-/,".

    Example 1: If choosing "|" as delimiter for the example input file, all information in sequence name separated by the delimiter are listed in the left box. If the fourth item "AIK03017.1" in the left box was picked and moved to the right box, by submitting you will get sequence fasta file with new sequence names (output example 1).


    Example 2: If choosing "| " as delimiter for the example input file, all information in sequence name separated by the delimiter are listed in the left box. If the fourth and fifth items ("AIK03017.1" ang "nef") in the left box were picked and moved to the right box, by submitting you will get sequence fasta file with new sequence names (output example 2).

    From the interface above user can select information separated by delimiter(s) from the first sequence name in the left box and move them into the right box.

Output

  • A sequence fasta file with new sequence names only containing the extracted information.

    Example 1:
    >AIK03017.1
    MGGKWSKSSLVGWPTVRERMKRTEPAADGVGAVSRDLEKHGAVTSSNTAATNADCAWLEAQEDEEVGFPV
    RPQVPLRPMTYKGALDLSHFLKEKGGLEGLIYSQRRQDILDLWVYHTQGYFPDWQNYTPGPGTRYPLTFG
    WCFKLVPVEPQKVEEANEGENNRLLHPMSLHGMDDPEREVLEWRFDSRLAFHHMARELHPEYYKDC
    >AIK03016.1
    MGGKWSKRVGWSTVRERMRRAEPAAVGVGAVSQDLEKHGAITSSNTAANNADCAWLEAQEKEEVGFPVRP
    QVPLRPMTYKAAIDLSHFLKEEGGLEGLIHSQQRQDILDLWVYHTQGYFPDWQNYTPGPGIRYPLTFGWC
    FKLVPVEPEKVEEANEGENNSLLHPMSQHGMEDPEKEVLEWRFDSRLAFHHMARELHPEYYKNC
    >AIK03015.1
    MGGKWSKXSXVGWPAVRZRXXRAEPAAXGVGAVSRXLENRGAXTSSNTXANNAACAWLEAQEEEEVGFPV
    RPQVPLRPMTYKAXXDJSHXLXXXGGLXGJVWSQRRQDILDLWXYHTQGYFPDWQNYTPGPGTRFPLTFG
    WCFKLVPLDPEQVEKANEGENNSLLHPMSQHGTDDPEKEVLIWKFDSRLAFHHMARELHPEYYKDC
    >AIK03014.1
    MGGKWSKRTSGWSTVRERMRRAEPAADGVGAASRDLEKHGAITSSNTATNNADCAWLEAQEEEEVGFPVR
    PQVPLRPMTYKGAVDLSHFLKEKGGLEGLIHSQRRQDILDLWVYHTQGYFPDWQNYTPGPGIRYPLTFGW
    CFKLVPVEPEKIEEANEGENNSLLHPMSLHGIEDPEREVLVWKFDSRLAFHHMARELHPEYYKNC
    >AIK03013.1
    MGGKWSKRSGAGWPTVRERMKRAEPAAEGVGAVSRDLEKHGAITSSNTPTNNAACAWLEAQEEEEVGFPV
    RPQVPLRPMTYKGAVDLSHFLKEKGGLEGLVHSQKRQDILDLWVYNTQGYFPDWQNYTPGPGIRYPLTFG
    WCFKLVPAEPEQVEEANEGENNSLLHPMSLHGMEDPEREVLVWKFDSRLAFHHMARELHPEYYKDC
    
    Example 2:
    >AIK03017.1 nef
    MGGKWSKSSLVGWPTVRERMKRTEPAADGVGAVSRDLEKHGAVTSSNTAATNADCAWLEAQEDEEVGFPV
    RPQVPLRPMTYKGALDLSHFLKEKGGLEGLIYSQRRQDILDLWVYHTQGYFPDWQNYTPGPGTRYPLTFG
    WCFKLVPVEPQKVEEANEGENNRLLHPMSLHGMDDPEREVLEWRFDSRLAFHHMARELHPEYYKDC
    >AIK03016.1 nef
    MGGKWSKRVGWSTVRERMRRAEPAAVGVGAVSQDLEKHGAITSSNTAANNADCAWLEAQEKEEVGFPVRP
    QVPLRPMTYKAAIDLSHFLKEEGGLEGLIHSQQRQDILDLWVYHTQGYFPDWQNYTPGPGIRYPLTFGWC
    FKLVPVEPEKVEEANEGENNSLLHPMSQHGMEDPEKEVLEWRFDSRLAFHHMARELHPEYYKNC
    >AIK03015.1 nef
    MGGKWSKXSXVGWPAVRZRXXRAEPAAXGVGAVSRXLENRGAXTSSNTXANNAACAWLEAQEEEEVGFPV
    RPQVPLRPMTYKAXXDJSHXLXXXGGLXGJVWSQRRQDILDLWXYHTQGYFPDWQNYTPGPGTRFPLTFG
    WCFKLVPLDPEQVEKANEGENNSLLHPMSQHGTDDPEKEVLIWKFDSRLAFHHMARELHPEYYKDC
    >AIK03014.1 nef
    MGGKWSKRTSGWSTVRERMRRAEPAADGVGAASRDLEKHGAITSSNTATNNADCAWLEAQEEEEVGFPVR
    PQVPLRPMTYKGAVDLSHFLKEKGGLEGLIHSQRRQDILDLWVYHTQGYFPDWQNYTPGPGIRYPLTFGW
    CFKLVPVEPEKIEEANEGENNSLLHPMSLHGIEDPEREVLVWKFDSRLAFHHMARELHPEYYKNC
    >AIK03013.1 nef
    MGGKWSKRSGAGWPTVRERMKRAEPAAEGVGAVSRDLEKHGAITSSNTPTNNAACAWLEAQEEEEVGFPV
    RPQVPLRPMTYKGAVDLSHFLKEKGGLEGLVHSQKRQDILDLWVYNTQGYFPDWQNYTPGPGIRYPLTFG
    WCFKLVPAEPEQVEEANEGENNSLLHPMSLHGMEDPEREVLVWKFDSRLAFHHMARELHPEYYKDC