ISNarch2
- Family IS66
- Group
Isoform Synonym(s)
| Accession number | Transposition | Origin | Host |
|---|---|---|---|
| NZ_CP050695.1 | ND | Aquisalimonas sp. | Natrialbaceae archaeon XQ-INN 246 strain 2447 Aquisalimonas sp. 2447 |
DNA section
IS Length : 2315 bp
Ends
IR Length : 22/24
IRL : GTAAGCGCTCCACGAACCCCATCTGTTGAGTCGATTCCAGTTCCGCGACA
IRR : GTAAGCGCTCCGCGAACCCCACCTACTCAGCGGGTTTGAGCGCGTATAGG
Insertion site
| Left flank | Direct repeat | Right flank | DR Length |
|---|---|---|---|
| TGGAAGCCCT | GAAGCCCT | GAAGCCCTTG | 8 |
| CCGTTCGTCG | GTTCGTCG | GTTCGTCGTC | 8 |
| ATCTGATTTT | GGCTTCGGTG | 0 | |
| AGCACCCGGC | CACCCGGC | CACCCGGCGG | 8 |
DNA sequence
GTAAGCGCTCCACGAACCCCATCTGTTGAGTCGATTCCAGTTCCGCGACACTGAGCCTTCCAGATTGATGGGAGGCTTGGAGCATGTGGTTATCCCCGGT
ACGCGCCCGCTGGCTGGGTCATTTGTATCACGCCCATGCCTTGGGGTTGTCGCTGAGTGAGTATGCGCGGCGCCAGGATGTCTCGCTCGCCGAGTTGATG
GACTGGGAGCGCCGGCTGCATGAGGCCGGAGTTCCGGTTCCGGAGCGTCACCGTCCCGCACGGTTCGTGGCCGTGGAGGTGGTGGCATGATCCGGCCCGG
GACGGATGTGGCGGTGTATCTGTGCCGCGAGCCCGTGGACATGCGCAAGTCGATTGACGGTTTGTCGCTGCTCGTCCAGGAGGTCATGGCGTGCGATCCG
TTCACCGCGGCGGTGTTCGTGTTCTGCAACCGGGCGCGGGATAAGGTGAAGATCCTGTTCTGGGAGCGCAACGGCTTTGTGGTCTGGTACAAACGCCTCG
AGCAGGAGCGGTTCAAGTGGCCGGCTTGCGGTGAGCGGGAGCGGCTCACGCTCTCGGGCCAGGAGCTCAACTGGTTGCTCGACGGCATCGACATCACCCG
TATGCAGCCGCACAAAGCGTTGCATTTTCAGTCGGTTGGATGAAATTTTTGCTCGCCGCACCGGTGGCGTTTTGGTACAATTACCCACATGAAACGGGCC
GATACCAACACGTTGCCATCCAGTTCCGATCTCCAGCGCGAGCTCGATGAGCAGCGCGCTCTGGTCGAACGCCTCCAGGCCCAACTCGCCGAGAAGGAGG
CCGCGTGGGCGGCAGAGAAGCGCTCGCTGTTCGAGCAAATCCGGCTGCTGCTGGACAACCGCTTCGGCCCCTCCACCGAGAAGTACAGCATCAAGCAGCA
GGACCTGTTCTTCGACGAGGCCGAGAGCCTGGTGGAAGAGCCCGCCGAGTCAGGTGAGACTGCCGAGGCGGAAGAGGAAAACCAGCCGGCCCCCAGTGGC
GGCAAGCGTCGCCGGGGTGGGCGCGCCCCACTGCCGCCGGAGCTGCCTCGCGTGGACATCGTCCACGATCTCCCCGAGGACGAACAGCAGTGTGCCTGCG
GCTGCGGTGCGCTCACCCGCATCGGCGAAGAGGTCACCGAGCAGCTCGACATCATCCCGGCCCAGATCCAGGTGCTGCGCCATGTGCGCATCAAGTACGC
CTGCCGGGCCTGCGAGGACGGTGTCCAGATCGCCGATCTGCCGCCGCAGCCGCTGCCGAAGAGCAACGCGAGCCCCGGACTGCTTGCCTATATCGCCACC
GCCAAGTACCAGGACGCGCTGCCGCTGTACCGCCAGGAGCAGGTCTTCAAACGACTGGGCCTGGAGCTGCCACGGAACACGCTCGCCCGCTGGATGGTGG
ACCTGGGTGCGTTGCTCGCGCCACTGGCCGAGCGCATGCGCGCCCATTTGCACAGTGCGGAGCTCATCCACATGGATGAGACCACCGTGCAGGTGAACAC
CGAGCCCGGGCGGGCCGCCTCCAGCACCTCGTACATGTGGGTCCAACGCGGCGGGCCGCCCGGAGCCGAGGTGGTGCTGTTCGACTACGATCCCAGCCGC
TCGGGCCAGGTCCCGCGGCGCCTGCTGGATGACTATGGTGGTATCCTGCTCACCGACGGCTACGAGGGCTATGCCCAGGTCGTGCGCGATAATGCCATCA
CCCATGCCGGGTGCTGGGCGCATGCGCGCCGCAAGTTCAAAGAGGCCCAGAAGGTCCAGCCCAAGGGCAAGACCGGCAAGGCCGACCGGGCGCTGGCGTC
CATCGGCAAGCTCTACCGGGTGGAGCGCGAAGCCCAGGGCCTGCCCGTTGAGAAGCGTGAACGCCTGCGCGCCACGCACAGCCGGCCGCTGATCGAGGAT
CTGCGCCAGTGGCTTGACCAGTCCCTGGAGAAGGTGCCGCCGAAGAGCGCCATCGGCAAGGCCGTGCACTACCTCAACAGCCAATGGCCCCGGCTCATCC
GCTTCCTGGAGGATGGCCGCATCCCGCTGGACAACAACCCCGCGGAGAACGCCATTCGGCCGTTCGTGGTGGGGCGCAAAAACTGGCTGTTCAGTCAGAC
GCCGCGGGGTGCGCACGCCAGTGCCACGATCTACAGCGTCATCGAGACGGCCAAGATCAACGGCCTGGAGCCCTACGCGTACCTGCTCGAGGTGTTAAAG
AACCTGCCGGGCGCGACAACCGGCGAGGCCATCGACCGACTGCTGCCGTGGCATCAGGACGAGAGCCTATACGCGCTCAAACCCGCTGAGTAGGTGGGGT
TCGCGGAGCGCTTAC
ACGCGCCCGCTGGCTGGGTCATTTGTATCACGCCCATGCCTTGGGGTTGTCGCTGAGTGAGTATGCGCGGCGCCAGGATGTCTCGCTCGCCGAGTTGATG
GACTGGGAGCGCCGGCTGCATGAGGCCGGAGTTCCGGTTCCGGAGCGTCACCGTCCCGCACGGTTCGTGGCCGTGGAGGTGGTGGCATGATCCGGCCCGG
GACGGATGTGGCGGTGTATCTGTGCCGCGAGCCCGTGGACATGCGCAAGTCGATTGACGGTTTGTCGCTGCTCGTCCAGGAGGTCATGGCGTGCGATCCG
TTCACCGCGGCGGTGTTCGTGTTCTGCAACCGGGCGCGGGATAAGGTGAAGATCCTGTTCTGGGAGCGCAACGGCTTTGTGGTCTGGTACAAACGCCTCG
AGCAGGAGCGGTTCAAGTGGCCGGCTTGCGGTGAGCGGGAGCGGCTCACGCTCTCGGGCCAGGAGCTCAACTGGTTGCTCGACGGCATCGACATCACCCG
TATGCAGCCGCACAAAGCGTTGCATTTTCAGTCGGTTGGATGAAATTTTTGCTCGCCGCACCGGTGGCGTTTTGGTACAATTACCCACATGAAACGGGCC
GATACCAACACGTTGCCATCCAGTTCCGATCTCCAGCGCGAGCTCGATGAGCAGCGCGCTCTGGTCGAACGCCTCCAGGCCCAACTCGCCGAGAAGGAGG
CCGCGTGGGCGGCAGAGAAGCGCTCGCTGTTCGAGCAAATCCGGCTGCTGCTGGACAACCGCTTCGGCCCCTCCACCGAGAAGTACAGCATCAAGCAGCA
GGACCTGTTCTTCGACGAGGCCGAGAGCCTGGTGGAAGAGCCCGCCGAGTCAGGTGAGACTGCCGAGGCGGAAGAGGAAAACCAGCCGGCCCCCAGTGGC
GGCAAGCGTCGCCGGGGTGGGCGCGCCCCACTGCCGCCGGAGCTGCCTCGCGTGGACATCGTCCACGATCTCCCCGAGGACGAACAGCAGTGTGCCTGCG
GCTGCGGTGCGCTCACCCGCATCGGCGAAGAGGTCACCGAGCAGCTCGACATCATCCCGGCCCAGATCCAGGTGCTGCGCCATGTGCGCATCAAGTACGC
CTGCCGGGCCTGCGAGGACGGTGTCCAGATCGCCGATCTGCCGCCGCAGCCGCTGCCGAAGAGCAACGCGAGCCCCGGACTGCTTGCCTATATCGCCACC
GCCAAGTACCAGGACGCGCTGCCGCTGTACCGCCAGGAGCAGGTCTTCAAACGACTGGGCCTGGAGCTGCCACGGAACACGCTCGCCCGCTGGATGGTGG
ACCTGGGTGCGTTGCTCGCGCCACTGGCCGAGCGCATGCGCGCCCATTTGCACAGTGCGGAGCTCATCCACATGGATGAGACCACCGTGCAGGTGAACAC
CGAGCCCGGGCGGGCCGCCTCCAGCACCTCGTACATGTGGGTCCAACGCGGCGGGCCGCCCGGAGCCGAGGTGGTGCTGTTCGACTACGATCCCAGCCGC
TCGGGCCAGGTCCCGCGGCGCCTGCTGGATGACTATGGTGGTATCCTGCTCACCGACGGCTACGAGGGCTATGCCCAGGTCGTGCGCGATAATGCCATCA
CCCATGCCGGGTGCTGGGCGCATGCGCGCCGCAAGTTCAAAGAGGCCCAGAAGGTCCAGCCCAAGGGCAAGACCGGCAAGGCCGACCGGGCGCTGGCGTC
CATCGGCAAGCTCTACCGGGTGGAGCGCGAAGCCCAGGGCCTGCCCGTTGAGAAGCGTGAACGCCTGCGCGCCACGCACAGCCGGCCGCTGATCGAGGAT
CTGCGCCAGTGGCTTGACCAGTCCCTGGAGAAGGTGCCGCCGAAGAGCGCCATCGGCAAGGCCGTGCACTACCTCAACAGCCAATGGCCCCGGCTCATCC
GCTTCCTGGAGGATGGCCGCATCCCGCTGGACAACAACCCCGCGGAGAACGCCATTCGGCCGTTCGTGGTGGGGCGCAAAAACTGGCTGTTCAGTCAGAC
GCCGCGGGGTGCGCACGCCAGTGCCACGATCTACAGCGTCATCGAGACGGCCAAGATCAACGGCCTGGAGCCCTACGCGTACCTGCTCGAGGTGTTAAAG
AACCTGCCGGGCGCGACAACCGGCGAGGCCATCGACCGACTGCTGCCGTGGCATCAGGACGAGAGCCTATACGCGCTCAAACCCGCTGAGTAGGTGGGGT
TCGCGGAGCGCTTAC
Protein section
ORF number : 3
ORF 1
| Length | Begin | End | Strand | Fusion ORF | |
|---|---|---|---|---|---|
| 207 bp | 68 aa | 84 | 290 | + | No |
AG : IS66 TnpA
ORF sequence :
MWLSPVRARWLGHLYHAHALGLSLSEYARRQDVSLAELMDWERRLHEAGVPVPERHRPARFVAVEVVA
Blast result :ORF 2
| Length | Begin | End | Strand | Fusion ORF | |
|---|---|---|---|---|---|
| 357 bp | 118 aa | 287 | 643 | + | No |
AG : IS66 TnpB
ORF sequence :
MIRPGTDVAVYLCREPVDMRKSIDGLSLLVQEVMACDPFTAAVFVFCNRARDKVKILFWERNGFVVWYKRLEQERFKWPACGERERLTLSGQELNWLLDG
IDITRMQPHKALHFQSVG
IDITRMQPHKALHFQSVG
Blast result :ORF 3
| Length | Begin | End | Strand | Fusion ORF | |
|---|---|---|---|---|---|
| 1605 bp | 534 aa | 689 | 2293 | + | No |
Chemistry : DDE
ORF sequence :
MKRADTNTLPSSSDLQRELDEQRALVERLQAQLAEKEAAWAAEKRSLFEQIRLLLDNRFGPSTEKYSIKQQDLFFDEAESLVEEPAESGETAEAEEENQP
APSGGKRRRGGRAPLPPELPRVDIVHDLPEDEQQCACGCGALTRIGEEVTEQLDIIPAQIQVLRHVRIKYACRACEDGVQIADLPPQPLPKSNASPGLLA
YIATAKYQDALPLYRQEQVFKRLGLELPRNTLARWMVDLGALLAPLAERMRAHLHSAELIHMDETTVQVNTEPGRAASSTSYMWVQRGGPPGAEVVLFDY
DPSRSGQVPRRLLDDYGGILLTDGYEGYAQVVRDNAITHAGCWAHARRKFKEAQKVQPKGKTGKADRALASIGKLYRVEREAQGLPVEKRERLRATHSRP
LIEDLRQWLDQSLEKVPPKSAIGKAVHYLNSQWPRLIRFLEDGRIPLDNNPAENAIRPFVVGRKNWLFSQTPRGAHASATIYSVIETAKINGLEPYAYLL
EVLKNLPGATTGEAIDRLLPWHQDESLYALKPAE
APSGGKRRRGGRAPLPPELPRVDIVHDLPEDEQQCACGCGALTRIGEEVTEQLDIIPAQIQVLRHVRIKYACRACEDGVQIADLPPQPLPKSNASPGLLA
YIATAKYQDALPLYRQEQVFKRLGLELPRNTLARWMVDLGALLAPLAERMRAHLHSAELIHMDETTVQVNTEPGRAASSTSYMWVQRGGPPGAEVVLFDY
DPSRSGQVPRRLLDDYGGILLTDGYEGYAQVVRDNAITHAGCWAHARRKFKEAQKVQPKGKTGKADRALASIGKLYRVEREAQGLPVEKRERLRATHSRP
LIEDLRQWLDQSLEKVPPKSAIGKAVHYLNSQWPRLIRFLEDGRIPLDNNPAENAIRPFVVGRKNWLFSQTPRGAHASATIYSVIETAKINGLEPYAYLL
EVLKNLPGATTGEAIDRLLPWHQDESLYALKPAE
Blast result :
Comments
ISNarch2 is 77% (transposase) aa similar to ISAeh1.
Update of 21 May 2026: the host name has been changed in the GenBank file (same accession number): from Natrialbaceae archaeon XQ-INN 246 strain 2447 to Aquisalimonas sp. 2447.
Update of 21 May 2026: the host name has been changed in the GenBank file (same accession number): from Natrialbaceae archaeon XQ-INN 246 strain 2447 to Aquisalimonas sp. 2447.
References
1] Sarah Sonbol (2020) Direct submission.
2] Xue,Q (202) Direct GenBank submission.
2] Xue,Q (202) Direct GenBank submission.