ISNarch1
- Family IS200/IS605
- Group IS1341
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
LKNF01000204 | ND | Natrialbaceae archaeon | Natrialbaceae archaeon Tc-Br11_E2g28 |
DNA section
IS Length : 1675 bp
Ends
Left end : ACGAAACCTCGTTCTGACTCCGTAGTACAGTGATGTTTTTGAGCGCCTGCAAAGGATGCAGGCGCAGTCGATTGTGCCTGTACAGCACAGGTGATCCGAG II struct. : No
Right end : GTCCAGTCCTGAAACCTCAGCAACGCCTCCTGTGTGGGGCCCCGATGCGGAGAGGATGTCACTCCGGGTTGTCTGCCTACGACTGTAGGTAGGCATTCAA II struct. : Yes
Insertion site
Left flank | LE cleavage site | Right flank | RE cleavage site |
---|---|---|---|
GTAACAGCTG | TTCT | CACAGGGCGA | TCAA |
DNA sequence
ACGAAACCTCGTTCTGACTCCGTAGTACAGTGATGTTTTTGAGCGCCTGCAAAGGATGCAGGCGCAGTCGATTGTGCCTGTACAGCACAGGTGATCCGAG
ATGCGACACTACCACCAGCGACAGGTGTACCATCAGCCGGAGGTGCGATGACCGATGGCGGTCGAACTGAACGAGATCGAAGAGGCTGAGGTTGAACTGG
AACAGTGGTTGGTTGAGCAGGCCGAGAACGGTATCCCGGAGACCGTCCTCGTAGGGATCCTTCGGGACTACGCAGACGACATCGAACAGCTCGGCTACGT
GCCGCGGATGTGGGGACACACCGACCACTGATTCACTGACTCTTCAGTCCTATGTCCAACGAACGTATCCGGACAGTTGTTGCGATACTCGTCGAGCCAA
CGGCGCACAAGGAACGGAAACTTCGCCGTCTGCAGTCGGCCTACCGCGAGGCTCTCGAAGCCGCCGTCAAGGCCGGCGCGACGACGATGACAGCGGTCAA
CGACATCGTGACGCCGTACGACCTCCCGTATCAGGCGAAAGACGCGTTGAAGCGTCACGTTCCGCAGCTTCTCGAGGACGGCACGCCGGACCTCGACGCG
GCGCAGCCGGTCCGGTTCACCAACCGTGCCGCCACGTTCGATCACTCGGCAGCGCGCACCCACGAGTTCTGCTGGGAAATCCCCCAACCGGGCCGCGGGA
CGAATTTCTGGATTCCGCTCGCAATCAACCCAGCTCAACGAGACTGGTGGGATCAGTTGCTCGCCGGCGACGCCACCGCCGGACAGTTGCAACTCGTCCA
CCAGCCGCGACACGACCGCTGGGAACTTCACGTCCCGCTGACGCTCCCAACCCCCGACACCGATGTCGACCACGATTCCTGTACGCCAATCGGGTTCGAC
GTCGGCGAGGCGATCCTGCTGACCGGCTGTGCGCTCCGGAACGGCCGGCCAGTCGACCCGTTGCTGATCGATGGTGGTCGGGCCCGCGACCTGCATCAGA
CGCTCCAGACGACGCTCCAGCGGCTGCAGGAACGCGACGCCGCCGCGTGGCGGATCGACCAACAGGCCGAATATTTCCGGAACGCGCTCGGCGACGAGAT
CGAGACGGCGACACGCCGGGCCGTCGACTACGCCGCCGGCTTCGAGCAGCCGATGATCGTCTTGGAGGCCCTCGAGTCGCTCCAGGAGGACCTCGACGTC
GGGCCCCATCTGACCCGGCGGCTTCACGCGTGGGCGTTCGCCCGACTACAGACGCGGTTCGCGGACAAGGCTGCCGACGCCGGGATTCCCGTCCGGTACG
TCGACCCGGCGTACACGTCCCAGATCTGCCACGCCTGCGGACACATCGGGACGCGGCCTGCACAGGCGGAGTTCCGTTGCACGAACGATGACTGCTGGGT
GTCGGTCTATCAGGCCGATATTAACGCGGCGGCCAATATCGCCAGCCGCCTCGATCCGTGGGGTGAGAGCTGCCCTTGGGAACCGGCCAGCGATGACACG
CTACGGAGTGGGCGCACCCGTGACAGTGCCACAGGACCTCGGGAGCAGAGCCGATCACAGTGACGACGCTCCCGCGTCCAGTCCTGAAACCTCAGCAACG
CCTCCTGTGTGGGGCCCCGATGCGGAGAGGATGTCACTCCGGGTTGTCTGCCTACGACTGTAGGTAGGCATTCAA
ATGCGACACTACCACCAGCGACAGGTGTACCATCAGCCGGAGGTGCGATGACCGATGGCGGTCGAACTGAACGAGATCGAAGAGGCTGAGGTTGAACTGG
AACAGTGGTTGGTTGAGCAGGCCGAGAACGGTATCCCGGAGACCGTCCTCGTAGGGATCCTTCGGGACTACGCAGACGACATCGAACAGCTCGGCTACGT
GCCGCGGATGTGGGGACACACCGACCACTGATTCACTGACTCTTCAGTCCTATGTCCAACGAACGTATCCGGACAGTTGTTGCGATACTCGTCGAGCCAA
CGGCGCACAAGGAACGGAAACTTCGCCGTCTGCAGTCGGCCTACCGCGAGGCTCTCGAAGCCGCCGTCAAGGCCGGCGCGACGACGATGACAGCGGTCAA
CGACATCGTGACGCCGTACGACCTCCCGTATCAGGCGAAAGACGCGTTGAAGCGTCACGTTCCGCAGCTTCTCGAGGACGGCACGCCGGACCTCGACGCG
GCGCAGCCGGTCCGGTTCACCAACCGTGCCGCCACGTTCGATCACTCGGCAGCGCGCACCCACGAGTTCTGCTGGGAAATCCCCCAACCGGGCCGCGGGA
CGAATTTCTGGATTCCGCTCGCAATCAACCCAGCTCAACGAGACTGGTGGGATCAGTTGCTCGCCGGCGACGCCACCGCCGGACAGTTGCAACTCGTCCA
CCAGCCGCGACACGACCGCTGGGAACTTCACGTCCCGCTGACGCTCCCAACCCCCGACACCGATGTCGACCACGATTCCTGTACGCCAATCGGGTTCGAC
GTCGGCGAGGCGATCCTGCTGACCGGCTGTGCGCTCCGGAACGGCCGGCCAGTCGACCCGTTGCTGATCGATGGTGGTCGGGCCCGCGACCTGCATCAGA
CGCTCCAGACGACGCTCCAGCGGCTGCAGGAACGCGACGCCGCCGCGTGGCGGATCGACCAACAGGCCGAATATTTCCGGAACGCGCTCGGCGACGAGAT
CGAGACGGCGACACGCCGGGCCGTCGACTACGCCGCCGGCTTCGAGCAGCCGATGATCGTCTTGGAGGCCCTCGAGTCGCTCCAGGAGGACCTCGACGTC
GGGCCCCATCTGACCCGGCGGCTTCACGCGTGGGCGTTCGCCCGACTACAGACGCGGTTCGCGGACAAGGCTGCCGACGCCGGGATTCCCGTCCGGTACG
TCGACCCGGCGTACACGTCCCAGATCTGCCACGCCTGCGGACACATCGGGACGCGGCCTGCACAGGCGGAGTTCCGTTGCACGAACGATGACTGCTGGGT
GTCGGTCTATCAGGCCGATATTAACGCGGCGGCCAATATCGCCAGCCGCCTCGATCCGTGGGGTGAGAGCTGCCCTTGGGAACCGGCCAGCGATGACACG
CTACGGAGTGGGCGCACCCGTGACAGTGCCACAGGACCTCGGGAGCAGAGCCGATCACAGTGACGACGCTCCCGCGTCCAGTCCTGAAACCTCAGCAACG
CCTCCTGTGTGGGGCCCCGATGCGGAGAGGATGTCACTCCGGGTTGTCTGCCTACGACTGTAGGTAGGCATTCAA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
175 bp | 58 aa | 155 | 329 | + | No |
Annotation : conserved hypothetical proteinDescription :
ORF sequence :
MAVELNEIEEAEVELEQWLVEQAENGIPETVLVGILRDYADDIEQLGYVPRMWGHTDH
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1212 bp | 403 aa | 352 | 1563 | + | No |
AG : TnpB
ORF sequence :
MSNERIRTVVAILVEPTAHKERKLRRLQSAYREALEAAVKAGATTMTAVNDIVTPYDLPYQAKDALKRHVPQLLEDGTPDLDAAQPVRFTNRAATFDHSA
ARTHEFCWEIPQPGRGTNFWIPLAINPAQRDWWDQLLAGDATAGQLQLVHQPRHDRWELHVPLTLPTPDTDVDHDSCTPIGFDVGEAILLTGCALRNGRP
VDPLLIDGGRARDLHQTLQTTLQRLQERDAAAWRIDQQAEYFRNALGDEIETATRRAVDYAAGFEQPMIVLEALESLQEDLDVGPHLTRRLHAWAFARLQ
TRFADKAADAGIPVRYVDPAYTSQICHACGHIGTRPAQAEFRCTNDDCWVSVYQADINAAANIASRLDPWGESCPWEPASDDTLRSGRTRDSATGPREQS
RSQ
ARTHEFCWEIPQPGRGTNFWIPLAINPAQRDWWDQLLAGDATAGQLQLVHQPRHDRWELHVPLTLPTPDTDVDHDSCTPIGFDVGEAILLTGCALRNGRP
VDPLLIDGGRARDLHQTLQTTLQRLQERDAAAWRIDQQAEYFRNALGDEIETATRRAVDYAAGFEQPMIVLEALESLQEDLDVGPHLTRRLHAWAFARLQ
TRFADKAADAGIPVRYVDPAYTSQICHACGHIGTRPAQAEFRCTNDDCWVSVYQADINAAANIASRLDPWGESCPWEPASDDTLRSGRTRDSATGPREQS
RSQ
Blast result :
Comments
ISNarch1 is 70% aa similar (TnpB) to ISH12.
References
1] Friedhelm Pfeiffer (2019) Direct submission.
2] Vavourakis,C.D., Ghai,R., Rodriguez-Valera,F., Sorokin,D.Y., Tringe,S.G., Hugenholtz,P. and Muyzer,G. (2015) Direct GenBank submission.
2] Vavourakis,C.D., Ghai,R., Rodriguez-Valera,F., Sorokin,D.Y., Tringe,S.G., Hugenholtz,P. and Muyzer,G. (2015) Direct GenBank submission.