ISAcma16
- Family IS4
- Group IS4
Isoform Synonym(s)
| Accession number | Transposition | Origin | Host |
|---|---|---|---|
| NC_009925 | ND | Acaryochloris marina | Acaryochloris marina MBIC11017 |
DNA section
IS Length : 1483 bp
Ends
IR Length : 18
IRL : CAATGGCACTGACTTAAGAAGTCAGGAAAAAGCTACAAGTCTTGATAGAT
IRR : CAATGGCACTGACTTAAGCTCAATCAAACAGCTAGCTTAGCTTTGAGAGC
Insertion site
| Left flank | Direct repeat | Right flank | DR Length |
|---|---|---|---|
| TCAGATCAATCACATCCCCC | ACCCCCAGAT | AGCGCACCTGGCTCGCCCTC | 10 |
| AGCAATGAGGTATTGACT | ATCCCTTGGAAT | ATCTTTCTCCTTGACAAC | 12 |
| CTACCATTGGGTCGATGGCT | AGCTATAGTT | TATTCGCTCTTTCTAGCCAG | 10 |
| AAATATTGAAAGGGAGAAAG | CCCCATAGAT | TGCTTGAACAGACGGCTAAA | 10 |
| AGAAGGCGAATCTATCGCT | ATTGCATCGTT | CAATCTCTCAACGCTGAAT | 11 |
| GTCGGTATTTCCAGATACG | TCCCTATGGGT | TGCCATCCATAGGGAATAC | 11 |
| CCCAATGTTCAAGAGAGCT | ATCCCAACAGT | GGTTGTTCAAGAATGAACT | 11 |
| CTCAATTCCCTGCCTGATCA | GTCTATAGAT | GCTTTGAAAATTCAGTAATT | 10 |
| GTTTGCTGAGTCGTTTAAGC | AGTCCTGATT | CGCCCAGAATATCTTCCGGT | 10 |
| ATCACCGCCTTGGGATAAAC | TATTAAAGTT | GGCTTGACCTGAATCGGGCA | 10 |
| GGGAAATCCTTTCCCAAGTT | ATTTATGGAT | TCCTGCAGTTTTTGCCCAAA | 10 |
| AGTTCATTCTTGAACAACC | ACTGTTGGGAT | AGCTCTCTTGAACATTGGG | 11 |
| AGGAAACGCTGCATCTGTC | ATCTAACGGAT | AGGTCTCAGGGGTGCGAGA | 11 |
DNA sequence
CAATGGCACTGACTTAAGAAGTCAGGAAAAAGCTACAAGTCTTGATAGATATAGGATCAAGACTTGCACGCCATAAACTTCTGCCTTTCACTATGTCTGA
GTCAGCCGAAATTCTCAAGCAGCAATTTTCCCAAAGCCTGGGTTTACCGTGGACAGATATCCTTCCAGCCTCAAGGTTAGAGGAACTCCTGAAAGAAGAA
GCATTCTCCTATCGCAACCGGATATATAGCCCCATCGTTACGCTGTGGGCCATGCTTTACCAAGTGCTATCAGCCGATAAAAGCCTTCGTAACACAGTCA
AATGCATCACGACTTGGCTCACAGCGGCTGGCATCCAACCGCCCTCATCTGATACCGGAGCCTACAGCAAAGCCAGAAGTCGCTTTCCAGAGTCACTGTT
GCAACGTTTAATTCCCGAAAGCGCTGAGTGCTTAGCGCAACCCCTCTCCCCAGAGCACCTCTGGTGTGGTCGGCCCGTCAAGGTCTACGACGGAACCACC
GTGTTGATGGCTGATAGTGCGGCCAACCAAGCATCATATCCACAACATGGCAATCAAACAGCAGGCTGTGGTTTTCCCATCGCTCGCTTGGTAGTGTTCT
TCTGTTTGGTTACCGGTGCAGTGGCGTCAGCTTGTATTGCCTCCTGGGACACCAGTGAAATTGTCATGAGTCGTTTGCTCTATCAAGACCTTGAGGTCGG
TGATGTGGTCATGGCGGACCAAGCTTATGGCAGCTATGTTGATCTAGCCATCATTCAACAACACAGGGCTGATGGAGTGTTGCGTAAACATCATGCTCGC
AAGACTGATTTCCGCAAAGGCAACAAGCATGGCATTGGTGATCATCAGGTGACATGGCATAAGCCAGCCCAACGGCCTGAGCACATGAGTGAGCAAGATT
TTGCCCTGATTCCTCAAACATTGGTCGTTAGAGAGGTGTGTTTGCGCTTATCCCTTAAGGGCTTTCGCGACCAGCACATTATTGTGGTGACGACGCTGCT
GGATGCTCAACGCTACAGCGCTGGGCAACTGACTCGCTTGTATGGCTGGCGTTGGCCAGTGGCGGAAGTCAATCTGCGCCATCTCAAAACCACCTTAAAA
ATGGAGATGCTCAGTGCCAAAACTCCGGATATGGTGCGCAAGGACATTTGGGTACATTTGTTGGGCTATAACCTACTCAGAAGTCTCATGGAACTTGCGG
CACCGCTAGCAGATAATGCTAGAACTCAACTGTCTGTGCAAGGAGCACGACAACACTTCAATCAGATGCTTGCTTTGTTGGCGACAGCCAACCGTGCGAC
CAGAAAGCGGTTGTTTACTCATCTACTTGAGACCATGGCAGCCGATCTATTACCCTCTCGACCGAATCGGCACGAACCGAGAGTCGTCAAACGCAGACCC
AAATCTTTCCCGCGAATGCGACAACCTCGCTCTGCTCTCAAAGCTAAGCTAGCTGTTTGATTGAGCTTAAGTCAGTGCCATTG
GTCAGCCGAAATTCTCAAGCAGCAATTTTCCCAAAGCCTGGGTTTACCGTGGACAGATATCCTTCCAGCCTCAAGGTTAGAGGAACTCCTGAAAGAAGAA
GCATTCTCCTATCGCAACCGGATATATAGCCCCATCGTTACGCTGTGGGCCATGCTTTACCAAGTGCTATCAGCCGATAAAAGCCTTCGTAACACAGTCA
AATGCATCACGACTTGGCTCACAGCGGCTGGCATCCAACCGCCCTCATCTGATACCGGAGCCTACAGCAAAGCCAGAAGTCGCTTTCCAGAGTCACTGTT
GCAACGTTTAATTCCCGAAAGCGCTGAGTGCTTAGCGCAACCCCTCTCCCCAGAGCACCTCTGGTGTGGTCGGCCCGTCAAGGTCTACGACGGAACCACC
GTGTTGATGGCTGATAGTGCGGCCAACCAAGCATCATATCCACAACATGGCAATCAAACAGCAGGCTGTGGTTTTCCCATCGCTCGCTTGGTAGTGTTCT
TCTGTTTGGTTACCGGTGCAGTGGCGTCAGCTTGTATTGCCTCCTGGGACACCAGTGAAATTGTCATGAGTCGTTTGCTCTATCAAGACCTTGAGGTCGG
TGATGTGGTCATGGCGGACCAAGCTTATGGCAGCTATGTTGATCTAGCCATCATTCAACAACACAGGGCTGATGGAGTGTTGCGTAAACATCATGCTCGC
AAGACTGATTTCCGCAAAGGCAACAAGCATGGCATTGGTGATCATCAGGTGACATGGCATAAGCCAGCCCAACGGCCTGAGCACATGAGTGAGCAAGATT
TTGCCCTGATTCCTCAAACATTGGTCGTTAGAGAGGTGTGTTTGCGCTTATCCCTTAAGGGCTTTCGCGACCAGCACATTATTGTGGTGACGACGCTGCT
GGATGCTCAACGCTACAGCGCTGGGCAACTGACTCGCTTGTATGGCTGGCGTTGGCCAGTGGCGGAAGTCAATCTGCGCCATCTCAAAACCACCTTAAAA
ATGGAGATGCTCAGTGCCAAAACTCCGGATATGGTGCGCAAGGACATTTGGGTACATTTGTTGGGCTATAACCTACTCAGAAGTCTCATGGAACTTGCGG
CACCGCTAGCAGATAATGCTAGAACTCAACTGTCTGTGCAAGGAGCACGACAACACTTCAATCAGATGCTTGCTTTGTTGGCGACAGCCAACCGTGCGAC
CAGAAAGCGGTTGTTTACTCATCTACTTGAGACCATGGCAGCCGATCTATTACCCTCTCGACCGAATCGGCACGAACCGAGAGTCGTCAAACGCAGACCC
AAATCTTTCCCGCGAATGCGACAACCTCGCTCTGCTCTCAAAGCTAAGCTAGCTGTTTGATTGAGCTTAAGTCAGTGCCATTG
Protein section
ORF number : 1
ORF 1
| Length | Begin | End | Strand | Fusion ORF | |
|---|---|---|---|---|---|
| 1419 bp | 472 aa | 42 | 1460 | + | No |
Chemistry : DDE
ORF sequence :
MIDIGSRLARHKLLPFTMSESAEILKQQFSQSLGLPWTDILPASRLEELLKEEAFSYRNRIYSPIVTLWAMLYQVLSADKSLRNTVKCITTWLTAAGIQP
PSSDTGAYSKARSRFPESLLQRLIPESAECLAQPLSPEHLWCGRPVKVYDGTTVLMADSAANQASYPQHGNQTAGCGFPIARLVVFFCLVTGAVASACIA
SWDTSEIVMSRLLYQDLEVGDVVMADQAYGSYVDLAIIQQHRADGVLRKHHARKTDFRKGNKHGIGDHQVTWHKPAQRPEHMSEQDFALIPQTLVVREVC
LRLSLKGFRDQHIIVVTTLLDAQRYSAGQLTRLYGWRWPVAEVNLRHLKTTLKMEMLSAKTPDMVRKDIWVHLLGYNLLRSLMELAAPLADNARTQLSVQ
GARQHFNQMLALLATANRATRKRLFTHLLETMAADLLPSRPNRHEPRVVKRRPKSFPRMRQPRSALKAKLAV
PSSDTGAYSKARSRFPESLLQRLIPESAECLAQPLSPEHLWCGRPVKVYDGTTVLMADSAANQASYPQHGNQTAGCGFPIARLVVFFCLVTGAVASACIA
SWDTSEIVMSRLLYQDLEVGDVVMADQAYGSYVDLAIIQQHRADGVLRKHHARKTDFRKGNKHGIGDHQVTWHKPAQRPEHMSEQDFALIPQTLVVREVC
LRLSLKGFRDQHIIVVTTLLDAQRYSAGQLTRLYGWRWPVAEVNLRHLKTTLKMEMLSAKTPDMVRKDIWVHLLGYNLLRSLMELAAPLADNARTQLSVQ
GARQHFNQMLALLATANRATRKRLFTHLLETMAADLLPSRPNRHEPRVVKRRPKSFPRMRQPRSALKAKLAV
Blast result :
Comments
ISAcma15 is 57% aa similar to ISArch10.
References
1] Swingley,W.D., Chen,M., Cheung,P.C., Conrad,A.L., Dejesa,L.C.,Hao,J., Honchak,B.M., Karbach,L.E., Kurdoglu,A., Lahiri,S.,Mastrian,S.D., Miyashita,H., Page,L., Ramakrishna,P., Satoh,S., Sattley,W.M., Shimada,Y., Taylor,H.L., Tomo,T., Tsuchiya,T., Wang,Z.T., Raymond,J., Mimuro,M., Blankenship,R.E. and Touchman,J.W. (2008) Proc. Natl. Acad. Sci. U.S.A.
2] ISfinder annotation (2009)
2] ISfinder annotation (2009)