ISNarch5
- Family IS66
- Group
Isoform Synonym(s)
| Accession number | Transposition | Origin | Host |
|---|---|---|---|
| NZ_CP050695.1 | ND | Natrialbaceae archaeon | Natrialbaceae archaeon XQ-INN 246 strain 2447 |
DNA section
IS Length : 2312 bp
Ends
IR Length : 17/24
IRL : GTAACCGCTCCACGAACCCCATCTGTTGAGCGGATTGTGGTTCCGGGACA
IRR : GTAAGCGATCCGGGAACTGCACCTACTCGGTGGGTTTGAGCGTGTATAGG
Insertion site
| Left flank | Direct repeat | Right flank | DR Length |
|---|---|---|---|
| CTGGAGCAGC | GGAGCAGC | GGAGCAGCGC | 8 |
DNA sequence
GTAACCGCTCCACGAACCCCATCTGTTGAGCGGATTGTGGTTCCGGGACACTGAGCCTTCCGGATTGATGGGAGGCTTGGAGCATGTGGTTATCCCCAGT
ACGCGCCCGCTGGCTGGGCTATCTGTATCACGCGCACGCCAGCGGCCTGACGTTGAGCGAGTATGCCGCCCGGCAGGATGTCTCGCTTGCCGAGCTGATG
GACTGGGAGCGCCGGCTGCGTGAGGCCGGGATTGCGGTGCCGGAGCGCCACCGCCCGGCACGGTTCGTCGCCGTGGAGGTGGTGGCATGATCCGGCCCGG
GAGCGATGTCGCTGTTTATCTGTGCCGCGAGCCCGTGGATATGCGTAAGTCGATGGACGGATTGTCTTTGCTCGTCCAGGAGGTCATGGAGTGCGACCCG
TTCACCGCGGCGGTGTTCGTGTTCTGCAACCGGGCGCGGGATAAGGTGAAGATCCTATTCTGGGAGCGCAACGGCTTCGTGGTCTGGTATAAGCGCCTCG
AGCAGGAGCGGTTCAAGTGGCCGGAGTGCGGCGAAGGGGATCGGCTCACGCTCTCGGGCCAGGAGCTCAACTGGCTGCTTGACGGCATCGATATCACCCG
CATGCAGCCGCACAAAGCGCTGCATTTTCAGTCGGTTGGATGAAATTTTTGCTCGCCGCACCGGTGTCGTTTTGGTACAATTGTCGGCATGAAACGGGCC
GATACCAACACGTTGCCATCCAGTTCCGACCTCCAGCGCGAGCTCGACGAGCAGCGCGCTCTGGTCGAACGCCTCCAGGCCCAGCTCGCCGAGAAGGAGG
CCGCGTGGGCGGCGGAGAAGCGCTCGCTGTTCGAGCAGATCCGGCTGCTGCTCGATAACCGCTTCGGCCCCTCCACCGAGAAGTACAGCATCAAGCAGCA
GGACATGTTCTTCGATGAGGCCGAGAGCCTGGTGGAAGAGCCCGCCGAGTCTGATGAGACCGACGAGGCGGATGAGGACAACCAGCCAGTCCGCGGTAAG
CGCCGCCGTCGGGGCGGCCGCGCCCCGCTGCCACCGGAGTTGCCCCGCGTGGACATCGTCCACGACCTCCCCGAGGACGAACAGCAGTGCGCCTGTGGCT
GCGGTGCGCTCACCCGCATCGGCGAAGAAGTCACCGAGCAGCTCGACATCATCCCGGCCCAGATCCAGGTGCTGCGCCATGTGCGCATCAAGTACGCCTG
CCGGGCCTGCGAAGACGGCGTCCAGATCGCCGATCTGCCACCGCAGCCGCTGCCAAAGAGCAACGCGAGCCCCGGGCTGCTCGCTTATATCGCCACCGCC
AAGTACCAGGATGCGCTGCCACTGTACCGCCAGGAGCAGGTGTTCAAACGACTGGGCCTGGAGTTGCCACGGAACACGCTCGCCCGCTGGATGGTAGACA
TGGGCGCGCTGCTCGCCCCACTGGCCGAGCGCATGCGCGCCCATCTGCACAATGCGGAACTCATCCACATGGACGAGACCACCGTGCAGGTGAACACCGA
GCCCGGGCGGGCCGCCTCCAGCACCTCGTACATGTGGGTCCAGCGCGGCGGACCACCCGGTGCCGAGGTGGTGCTGTTCGACTACGATCCCAGCCGCTCG
GGCCAGGTGCCGCGCCGTCTGCTGGACGACTACAACGGCATCCTGCTCTCTGATGGCTACGAGGGCTATGCCCAGGTGGTGCGCGACAATGCGATCACTC
ACGCTGGCTGCTGGGCGCATGCGCGCCGCAAGTTCGTTGAGGCCCAGAAAGCCCAGCCCAAGGGCAAGACCGGCAAGGCCGACCGAGCTCTGGCGTCCAT
CGGCAAACTCTACCGTGTGGAGCGCGAGGCACAGGGTCTGCCCGTTGAGGAGCGTGAACGCCTGCGTGCCACGCACAGCCGGCCGCTGATCGAGGATCTG
CGCCAGTGGCTTGACCAGTCCCTGGAGAAGGTGCCGCCGAAGAGCGCCATCGGCAAGGCCGTGCACTACCTCAACAGCCAATGGCCCCGGCTCATCCGCT
TCCTGGAGGATGGCCGCATCCCGCTGGACAACAACCCTGCGGAGAACGCCATCCGGCCGTTCGTGGTGGGGCGCAAGAACTGGCTGTTCAGCCAGACGCC
GAGGGGTGCCCACGCCAGCGCAGCGATCTACAGCGTCATCGAGACGGCCAAGATCAACGGCCTGGAGCCCTACGCGTACCTGCTCGAGGTGTTAAAGAAC
CTGCCGGCGGCGGCCAGCGATGAGGCCATCGACGGCCTGCTGCCGTGGCATCAGGATGAGAGCCTATACACGCTCAAACCCACCGAGTAGGTGCAGTTCC
CGGATCGCTTAC
ACGCGCCCGCTGGCTGGGCTATCTGTATCACGCGCACGCCAGCGGCCTGACGTTGAGCGAGTATGCCGCCCGGCAGGATGTCTCGCTTGCCGAGCTGATG
GACTGGGAGCGCCGGCTGCGTGAGGCCGGGATTGCGGTGCCGGAGCGCCACCGCCCGGCACGGTTCGTCGCCGTGGAGGTGGTGGCATGATCCGGCCCGG
GAGCGATGTCGCTGTTTATCTGTGCCGCGAGCCCGTGGATATGCGTAAGTCGATGGACGGATTGTCTTTGCTCGTCCAGGAGGTCATGGAGTGCGACCCG
TTCACCGCGGCGGTGTTCGTGTTCTGCAACCGGGCGCGGGATAAGGTGAAGATCCTATTCTGGGAGCGCAACGGCTTCGTGGTCTGGTATAAGCGCCTCG
AGCAGGAGCGGTTCAAGTGGCCGGAGTGCGGCGAAGGGGATCGGCTCACGCTCTCGGGCCAGGAGCTCAACTGGCTGCTTGACGGCATCGATATCACCCG
CATGCAGCCGCACAAAGCGCTGCATTTTCAGTCGGTTGGATGAAATTTTTGCTCGCCGCACCGGTGTCGTTTTGGTACAATTGTCGGCATGAAACGGGCC
GATACCAACACGTTGCCATCCAGTTCCGACCTCCAGCGCGAGCTCGACGAGCAGCGCGCTCTGGTCGAACGCCTCCAGGCCCAGCTCGCCGAGAAGGAGG
CCGCGTGGGCGGCGGAGAAGCGCTCGCTGTTCGAGCAGATCCGGCTGCTGCTCGATAACCGCTTCGGCCCCTCCACCGAGAAGTACAGCATCAAGCAGCA
GGACATGTTCTTCGATGAGGCCGAGAGCCTGGTGGAAGAGCCCGCCGAGTCTGATGAGACCGACGAGGCGGATGAGGACAACCAGCCAGTCCGCGGTAAG
CGCCGCCGTCGGGGCGGCCGCGCCCCGCTGCCACCGGAGTTGCCCCGCGTGGACATCGTCCACGACCTCCCCGAGGACGAACAGCAGTGCGCCTGTGGCT
GCGGTGCGCTCACCCGCATCGGCGAAGAAGTCACCGAGCAGCTCGACATCATCCCGGCCCAGATCCAGGTGCTGCGCCATGTGCGCATCAAGTACGCCTG
CCGGGCCTGCGAAGACGGCGTCCAGATCGCCGATCTGCCACCGCAGCCGCTGCCAAAGAGCAACGCGAGCCCCGGGCTGCTCGCTTATATCGCCACCGCC
AAGTACCAGGATGCGCTGCCACTGTACCGCCAGGAGCAGGTGTTCAAACGACTGGGCCTGGAGTTGCCACGGAACACGCTCGCCCGCTGGATGGTAGACA
TGGGCGCGCTGCTCGCCCCACTGGCCGAGCGCATGCGCGCCCATCTGCACAATGCGGAACTCATCCACATGGACGAGACCACCGTGCAGGTGAACACCGA
GCCCGGGCGGGCCGCCTCCAGCACCTCGTACATGTGGGTCCAGCGCGGCGGACCACCCGGTGCCGAGGTGGTGCTGTTCGACTACGATCCCAGCCGCTCG
GGCCAGGTGCCGCGCCGTCTGCTGGACGACTACAACGGCATCCTGCTCTCTGATGGCTACGAGGGCTATGCCCAGGTGGTGCGCGACAATGCGATCACTC
ACGCTGGCTGCTGGGCGCATGCGCGCCGCAAGTTCGTTGAGGCCCAGAAAGCCCAGCCCAAGGGCAAGACCGGCAAGGCCGACCGAGCTCTGGCGTCCAT
CGGCAAACTCTACCGTGTGGAGCGCGAGGCACAGGGTCTGCCCGTTGAGGAGCGTGAACGCCTGCGTGCCACGCACAGCCGGCCGCTGATCGAGGATCTG
CGCCAGTGGCTTGACCAGTCCCTGGAGAAGGTGCCGCCGAAGAGCGCCATCGGCAAGGCCGTGCACTACCTCAACAGCCAATGGCCCCGGCTCATCCGCT
TCCTGGAGGATGGCCGCATCCCGCTGGACAACAACCCTGCGGAGAACGCCATCCGGCCGTTCGTGGTGGGGCGCAAGAACTGGCTGTTCAGCCAGACGCC
GAGGGGTGCCCACGCCAGCGCAGCGATCTACAGCGTCATCGAGACGGCCAAGATCAACGGCCTGGAGCCCTACGCGTACCTGCTCGAGGTGTTAAAGAAC
CTGCCGGCGGCGGCCAGCGATGAGGCCATCGACGGCCTGCTGCCGTGGCATCAGGATGAGAGCCTATACACGCTCAAACCCACCGAGTAGGTGCAGTTCC
CGGATCGCTTAC
Protein section
ORF number : 3
ORF 1
| Length | Begin | End | Strand | Fusion ORF | |
|---|---|---|---|---|---|
| 207 bp | 68 aa | 84 | 290 | + | No |
AG : IS66 TnpA
ORF sequence :
MWLSPVRARWLGYLYHAHASGLTLSEYAARQDVSLAELMDWERRLREAGIAVPERHRPARFVAVEVVA
Blast result :ORF 2
| Length | Begin | End | Strand | Fusion ORF | |
|---|---|---|---|---|---|
| 357 bp | 118 aa | 287 | 643 | + | No |
AG : IS66 TnpB
ORF sequence :
MIRPGSDVAVYLCREPVDMRKSMDGLSLLVQEVMECDPFTAAVFVFCNRARDKVKILFWERNGFVVWYKRLEQERFKWPECGEGDRLTLSGQELNWLLDG
IDITRMQPHKALHFQSVG
IDITRMQPHKALHFQSVG
Blast result :ORF 3
| Length | Begin | End | Strand | Fusion ORF | |
|---|---|---|---|---|---|
| 1578 bp | 585 aa | 713 | 2290 | + | No |
Chemistry : DDE
ORF sequence :
LPSSSDLQRELDEQRALVERLQAQLAEKEAAWAAEKRSLFEQIRLLLDNRFGPSTEKYSIKQQDMFFDEAESLVEEPAESDETDEADEDNQPVRGKRRRR
GGRAPLPPELPRVDIVHDLPEDEQQCACGCGALTRIGEEVTEQLDIIPAQIQVLRHVRIKYACRACEDGVQIADLPPQPLPKSNASPGLLAYIATAKYQD
ALPLYRQEQVFKRLGLELPRNTLARWMVDMGALLAPLAERMRAHLHNAELIHMDETTVQVNTEPGRAASSTSYMWVQRGGPPGAEVVLFDYDPSRSGQVP
RRLLDDYNGILLSDGYEGYAQVVRDNAITHAGCWAHARRKFVEAQKAQPKGKTGKADRALASIGKLYRVEREAQGLPVEERERLRATHSRPLIEDLRQWL
DQSLEKVPPKSAIGKAVHYLNSQWPRLIRFLEDGRIPLDNNPAENAIRPFVVGRKNWLFSQTPRGAHASAAIYSVIETAKINGLEPYAYLLEVLKNLPAA
ASDEAIDGLLPWHQDESLYTLKPTE
GGRAPLPPELPRVDIVHDLPEDEQQCACGCGALTRIGEEVTEQLDIIPAQIQVLRHVRIKYACRACEDGVQIADLPPQPLPKSNASPGLLAYIATAKYQD
ALPLYRQEQVFKRLGLELPRNTLARWMVDMGALLAPLAERMRAHLHNAELIHMDETTVQVNTEPGRAASSTSYMWVQRGGPPGAEVVLFDYDPSRSGQVP
RRLLDDYNGILLSDGYEGYAQVVRDNAITHAGCWAHARRKFVEAQKAQPKGKTGKADRALASIGKLYRVEREAQGLPVEERERLRATHSRPLIEDLRQWL
DQSLEKVPPKSAIGKAVHYLNSQWPRLIRFLEDGRIPLDNNPAENAIRPFVVGRKNWLFSQTPRGAHASAAIYSVIETAKINGLEPYAYLLEVLKNLPAA
ASDEAIDGLLPWHQDESLYTLKPTE
Blast result :Comments : 78% similar to ISAeh1 transposase
Comments
ISNarch5 is 97% aa (transposase) similar to ISNarch2.
References
1] Sarah Sonbol (2020) Direct submission.
2] Xue,Q. (2020) Direct GenBank submission.
2] Xue,Q. (2020) Direct GenBank submission.