ISArsp15
- Family IS30
- Group
Isoform Synonym(s)
| Accession number | Transposition | Origin | Host |
|---|---|---|---|
| MH067967 | ND | Arthrobacter sp. | Arthrobacter sp. Arthrobacter sp. ANT_H19B pA19BH1 |
DNA section
IS Length : 1862 bp
Ends
IR Length : 19/26
IRL : GGATTCTATTGATCGAAGCAACGCCTATTTCTAGGTGGTTTGCAATGTTT
IRR : GGATTTCAGTGGTCGACGCAACGGATGAGTGTTTTTCGGGGGCGTGTGCC
Insertion site
| Left flank | Direct repeat | Right flank | DR Length |
|---|---|---|---|
| GGATTCTATTGATCGAAGCAACG | GACCCACGCG | GGATTTCAGTGGTCGACGCAACG | 10 |
DNA sequence
GGATTCTATTGATCGAAGCAACGCCTATTTCTAGGTGGTTTGCAATGTTTCGTGCTTCTTCGTCAGCAGCAGGCGTCCGGTTTTTCGCCTCAATCAAGGA
AGGGCGCGGTCTCAAACCCTCCGCCCGCGACGCCGGCATCGACAAGGAGGTCGGGTATCGCTGGCTTCGCGAGAAGTACCTGCACCTGCGCCGGGCCGGC
AAGACGCCCGCCGAGACAACCGCCGTACTCGGGTTCACCACATCCCGGTTGCTGGCTTGGGAGGCCGACGTCGATCGCAGTGATGATCGGCACCATCTGC
GTGTCGACCGCGACGAAGAGGCCGCGTTCTGGGCGTGCTTCGAGGACAGCCAGGGCACGAAGGAAGCAGTGATAGCTGCGGGCGTGAGCCGGTCGACCGG
GTATCGGTGGATCGACAAACGCTTCAACCAGCTGCGGCGCGCGGGCGTCACTCTCCGGCGATGCCAAACCCAGTTGCAGCTCACGGATGACCGCACACAG
AGCCTCGAAGAACGACGGCTGCTCCGGCTGCGAAGAGACGCGGCTGCCGCCGCGGCCGCCCGACGTGAAGCCGCGCGGTCCTCCGGGCGCTACGCCGACC
GGGTCCTGGTGGGCGAGCTAACAGCGGGCCGGCAGCGGCTGAGGCTGCGCAATGAGAGGTATTGGCAGCTGATGCGTGACGGGCTGAGCAACGCGGAGGC
TTGCAGACTATTGGGCATGCATCGAGCCTCTGGCACCCAGATTCGTCAAGCCACCAAGTACCAGATCCCTCGTCTTCCCGGCCCGCGGGAAACACTCGGG
CGCTACCTGGACGCGCGCGAGCGGCTGCAGATCGCGGACTTGTTGCGGCTGGGGCACTCAATGCGCCAGATCGCGGCTGAACTGGGACGACAACCGTCGA
CCATCTCTCGGGAGCTGGGCCGGCACCGAAAAGCCGGGGGTCACTACCTGCCCGCGACGGCCGACCACGACGCGCGCCTGCAACGCGCCCGACCCAAAAT
GCCCAAGCTGGTTGCCAGCGCGAAACTGCGGCTTCTGGTGCAGCGAAAACTGAACCGGTGCTGGTCACCAGACGAGATCTGCGGCTGGATGAGGAAGGAG
TTCCCTGATGATCAGACGATGCGGCTCTGCCCGGAGACGATCTACCGGGCTCTGCTGCTCCGCGAGGGCCAGGGCCTGCACAAACGCTTCTCCGTGAAGC
TGCGCACCGGTCGGCGCATCCGCAAGAGCCGCTGGCGCCGACGAATCGGACGCGGATCAGCGATCATCAACATGACGATGATCGATCAGCGCCCCGCCGA
GGTCGAAGACCGGGAACAGGCCGGCCACTGGGAAGGCGACCTCATCGTCGGTCTCGGATCCGTCTCCGCGATGATGACTCTCCGCGAACGAAAGACCCAG
TACGGCATCATCGTGAACCTGCCCCTGGACCACACCGCCGCGAGCGTCAACGCGGCCGCCATCGCTGCGTTCGCAACCCTGCCGCCGCACCTGAAGCGAA
CCCTGACCTGGGACCAGGGAGTCGAGATGGCCTGGCACGAGAAGCTCACCCTCGCCACCGGAGTCCCGGTCTACTTCGCCGAACGCTCCAGCCCCTGGCA
GCGCGGCGCCAACGAGAACTTCAACGGGCTGGCCCGCCAGTACTTCCCCAAGGGCACCAACCTCGCCGTTCACAGCAGCGAGCACGTCGCCCATGTCATG
CGCGAGCTCAACGAACGGCCTCGGAAAACCCTGGGTTACGACACCCCCGCAGCCCGCCTACAGGCCGAACGCGACGCGCCGTCCGCCGCCGTGCGATAGC
CTCCAAACAGCGGGCACACGCCCCCGAAAAACACTCATCCGTTGCGTCGACCACTGAAATCC
AGGGCGCGGTCTCAAACCCTCCGCCCGCGACGCCGGCATCGACAAGGAGGTCGGGTATCGCTGGCTTCGCGAGAAGTACCTGCACCTGCGCCGGGCCGGC
AAGACGCCCGCCGAGACAACCGCCGTACTCGGGTTCACCACATCCCGGTTGCTGGCTTGGGAGGCCGACGTCGATCGCAGTGATGATCGGCACCATCTGC
GTGTCGACCGCGACGAAGAGGCCGCGTTCTGGGCGTGCTTCGAGGACAGCCAGGGCACGAAGGAAGCAGTGATAGCTGCGGGCGTGAGCCGGTCGACCGG
GTATCGGTGGATCGACAAACGCTTCAACCAGCTGCGGCGCGCGGGCGTCACTCTCCGGCGATGCCAAACCCAGTTGCAGCTCACGGATGACCGCACACAG
AGCCTCGAAGAACGACGGCTGCTCCGGCTGCGAAGAGACGCGGCTGCCGCCGCGGCCGCCCGACGTGAAGCCGCGCGGTCCTCCGGGCGCTACGCCGACC
GGGTCCTGGTGGGCGAGCTAACAGCGGGCCGGCAGCGGCTGAGGCTGCGCAATGAGAGGTATTGGCAGCTGATGCGTGACGGGCTGAGCAACGCGGAGGC
TTGCAGACTATTGGGCATGCATCGAGCCTCTGGCACCCAGATTCGTCAAGCCACCAAGTACCAGATCCCTCGTCTTCCCGGCCCGCGGGAAACACTCGGG
CGCTACCTGGACGCGCGCGAGCGGCTGCAGATCGCGGACTTGTTGCGGCTGGGGCACTCAATGCGCCAGATCGCGGCTGAACTGGGACGACAACCGTCGA
CCATCTCTCGGGAGCTGGGCCGGCACCGAAAAGCCGGGGGTCACTACCTGCCCGCGACGGCCGACCACGACGCGCGCCTGCAACGCGCCCGACCCAAAAT
GCCCAAGCTGGTTGCCAGCGCGAAACTGCGGCTTCTGGTGCAGCGAAAACTGAACCGGTGCTGGTCACCAGACGAGATCTGCGGCTGGATGAGGAAGGAG
TTCCCTGATGATCAGACGATGCGGCTCTGCCCGGAGACGATCTACCGGGCTCTGCTGCTCCGCGAGGGCCAGGGCCTGCACAAACGCTTCTCCGTGAAGC
TGCGCACCGGTCGGCGCATCCGCAAGAGCCGCTGGCGCCGACGAATCGGACGCGGATCAGCGATCATCAACATGACGATGATCGATCAGCGCCCCGCCGA
GGTCGAAGACCGGGAACAGGCCGGCCACTGGGAAGGCGACCTCATCGTCGGTCTCGGATCCGTCTCCGCGATGATGACTCTCCGCGAACGAAAGACCCAG
TACGGCATCATCGTGAACCTGCCCCTGGACCACACCGCCGCGAGCGTCAACGCGGCCGCCATCGCTGCGTTCGCAACCCTGCCGCCGCACCTGAAGCGAA
CCCTGACCTGGGACCAGGGAGTCGAGATGGCCTGGCACGAGAAGCTCACCCTCGCCACCGGAGTCCCGGTCTACTTCGCCGAACGCTCCAGCCCCTGGCA
GCGCGGCGCCAACGAGAACTTCAACGGGCTGGCCCGCCAGTACTTCCCCAAGGGCACCAACCTCGCCGTTCACAGCAGCGAGCACGTCGCCCATGTCATG
CGCGAGCTCAACGAACGGCCTCGGAAAACCCTGGGTTACGACACCCCCGCAGCCCGCCTACAGGCCGAACGCGACGCGCCGTCCGCCGCCGTGCGATAGC
CTCCAAACAGCGGGCACACGCCCCCGAAAAACACTCATCCGTTGCGTCGACCACTGAAATCC
Protein section
ORF number : 1
ORF 1
| Length | Begin | End | Strand | Fusion ORF | |
|---|---|---|---|---|---|
| 1755 bp | 584 aa | 45 | 1799 | + | No |
Chemistry : DDE
ORF sequence :
MFRASSSAAGVRFFASIKEGRGLKPSARDAGIDKEVGYRWLREKYLHLRRAGKTPAETTAVLGFTTSRLLAWEADVDRSDDRHHLRVDRDEEAAFWACFE
DSQGTKEAVIAAGVSRSTGYRWIDKRFNQLRRAGVTLRRCQTQLQLTDDRTQSLEERRLLRLRRDAAAAAAARREAARSSGRYADRVLVGELTAGRQRLR
LRNERYWQLMRDGLSNAEACRLLGMHRASGTQIRQATKYQIPRLPGPRETLGRYLDARERLQIADLLRLGHSMRQIAAELGRQPSTISRELGRHRKAGGH
YLPATADHDARLQRARPKMPKLVASAKLRLLVQRKLNRCWSPDEICGWMRKEFPDDQTMRLCPETIYRALLLREGQGLHKRFSVKLRTGRRIRKSRWRRR
IGRGSAIINMTMIDQRPAEVEDREQAGHWEGDLIVGLGSVSAMMTLRERKTQYGIIVNLPLDHTAASVNAAAIAAFATLPPHLKRTLTWDQGVEMAWHEK
LTLATGVPVYFAERSSPWQRGANENFNGLARQYFPKGTNLAVHSSEHVAHVMRELNERPRKTLGYDTPAARLQAERDAPSAAVR
DSQGTKEAVIAAGVSRSTGYRWIDKRFNQLRRAGVTLRRCQTQLQLTDDRTQSLEERRLLRLRRDAAAAAAARREAARSSGRYADRVLVGELTAGRQRLR
LRNERYWQLMRDGLSNAEACRLLGMHRASGTQIRQATKYQIPRLPGPRETLGRYLDARERLQIADLLRLGHSMRQIAAELGRQPSTISRELGRHRKAGGH
YLPATADHDARLQRARPKMPKLVASAKLRLLVQRKLNRCWSPDEICGWMRKEFPDDQTMRLCPETIYRALLLREGQGLHKRFSVKLRTGRRIRKSRWRRR
IGRGSAIINMTMIDQRPAEVEDREQAGHWEGDLIVGLGSVSAMMTLRERKTQYGIIVNLPLDHTAASVNAAAIAAFATLPPHLKRTLTWDQGVEMAWHEK
LTLATGVPVYFAERSSPWQRGANENFNGLARQYFPKGTNLAVHSSEHVAHVMRELNERPRKTLGYDTPAARLQAERDAPSAAVR
Blast result :
Comments
ISArsp15 is 63% aa similar to ISLxc3.
References
1] Romaniuk, K. (2018) Direct submission.