ISAcma26
- Family IS21
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_009925 | ND | Acaryochloris marina | Acaryochloris marina MBIC11017 plasmid pREB7 Acaryochloris marina MBIC11017 plasmid pREB6 Acaryochloris marina MBIC11017 plasmid pREB3 Acaryochloris marina MBIC11017 plasmid pREB4 Acaryochloris marina MBIC11017 plasmid pREB8 Acaryochloris marina MBIC11017 plasmid pREB5 Acaryochloris marina MBIC11017 plasmid pREB2 Acaryochloris marina MBIC11017 Acaryochloris marina MBIC11017 plasmid pREB1 |
DNA section
IS Length : 2669 bp
Ends
IR Length : 11/15
IRL : TATTAGACGACAATCGAGCTGACGGATTAATGACGATGTCACGTCGGCTA
IRR : TATTGGATGTCAACCCCATAGGCGATGTCGCGACAACCACAAGGCGGATC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
ACAGACATAC | CGAGGG | CTTTCGACAG | 6 |
AAGCGATGCC | AGCTTA | TGTCCATTTG | 6 |
TATTAAATCC | TTCCAC | AAAACCACTG | 6 |
GATGTAGAGC | ATGAGG | TTCGTCCCTG | 6 |
CGGCAATTCC | TCAGAT | AGTTGAACCA | 6 |
GTCGATATCG | CCCAAT | ACTGCTACCA | 6 |
CGGACCAACC | ATTGCC | ACGGTCTAGA | 6 |
ATTGATGAAT | ACCATC | ATCAGTCGAA | 6 |
TGTTTCATAA | ATCTTC | TGTGTCACAG | 6 |
GTTTCTTGAG | GCTAAATTAT | 0 | |
AACGCTCCTC | ATTCAGAGGA | 0 | |
GTTCAAAGCT | GCCCAAATTT | 0 |
DNA sequence
TATTAGACGACAATCGAGCTGACGGATTAATGACGATGTCACGTCGGCTATAGCTATGAAGAAACCAGGCATCCCCTGAGAAGGATAAAACTAGATCTCA
ATTCCCTAGAAATGATGTGTGAGATCGCGTTTCTAGAATTCAATCTCAAGAAAAAAGACACTTTTTACATCCCTAATCAATAGAGGGCTGTACAAGTGCC
TAACAAAAGAATTGATTCCTATCATACTCGTGTTTATATGAACGGAAGAGACCTCGGCTTAAAACAAGCTGATGCAGCTTACATTGCCGAAATATCTACA
AGAACCGGACAACGAATTGAGGCCGGCACCCATCAACCTAATCGCGGCCGCCTTCAGGACCAACGCACCGTTCCCGACCCCTTAGCTGATGTGTGGGAAG
ACGAGCTAGAACCAATGCTGCGTCGAGACCCACGCCTCAAACCCATGACCCTGTATGAGTACCTGCAGGATAAGTATCCAGGCCAGTATCCCCAAGTCCT
GCGGACCCTACAACGTCGGGTGAGAACGTGGAAAGCCTTACATGGACCAAGCCCTGAAGTGATGTTTGAATTGCGTCATGAACCAGGGGTACAAGGGTTC
TCCGATTTTACAGAACTCAAAGGCATCACGATTACCATTGCCGGCAAACCCTTTGAGCACCTGATTTACCATTACCGTCTGGGATACAGCGGCTGGCGAT
ATGCCCAGATCATCGAAGGAGGCGAAAGCTTTGTCGCCCTCTCAGAAGGATTGCAAAATGCCTTTGCAGCCTGTGGAGGTGTTCCTACACAGCATCGTAC
TGATAGTTTGAGTGCAGCCTATCGCAACATGGGCGGCCGCCGGTCCAAAAACCTCACTCGTCTGTACGACGAACTGTGTGACCACTATCGGCTAGAACCC
ACTCGTAACAACAAAGGTGTAGCCCATGAGAATGGCTCCATTGAATCTCCCCATGGTCATCTGAAGAACCGAATTAAGCAGGCGATCTATCTGCGCGGCA
GTGCAGATTTTACGAGCGTTGCTGAGTATCAAGCCTTAATTGATGCACAGGTTGCCAAGTTGAATCAGCAGTGCCAAACCAAGTATGAGCAAGAGAAAGA
CCATCTACAACCACTGCCCAAATATCGAACCCCTGACTATGAAGTGCTCACGGCTAAAGTCAGCAAACGCAGCACCATCGATGTTCGCTGCATTCTATAC
ACCGTCCCTTCTCGACTGATTGGTCGTCAATTGGAACTGCATCTATACCATGACCGGATTGTCGGCTATCTGGAGCGACACCCGGTGGTGGAATTGCCGA
GGAAGCGCGTCAGTGGCAAAGGCAAACGTCGCGACCGTTGCATCAACTATCGCCATGTTATTGGTTCAATGCGATTGAAGCCTCGTGCTTTTATCTATTG
CACCTGGCAATCAGACCTACTTCCCAATCCTGAATACCGCCAAATCTGGGAACAGCTCAAAGCCCAATTTGACCTGGAGCAGGCTGCCAAGATCATCGTG
GAAGCCCTGTATATTGCTGCGGTTCAAGATAAAGAACAGGCCGTAGCAGTGTACTTACAGCAGCAGCTTCGCTCATCCAGCCTTACCCTCAATCGCCTGA
AAAAACAGTTTGAGCCGCCTCAGATGAAGCAGGTTCCTGAACTCAGCATTGAACAACATTCACTTGAACTTTATGACAAACTCCTCCCCTCCTGCTCAGT
CCCCGCTGAGCCCCTACCAGCACCTGAGCCTCTATTTAAAAAAGCTCAGGCTCTCCCACATGTTGACCCATTGGGAATCTATCGAATCCCAAGCCATGCA
GGAAAACTGGTCTTATGCGGAATTCTTACTAGCCTTGTGCGAAACGGAGGCCCAACGAAGAGAACAAGCTCGTCTAAAACGTGCCCTCACCGAAGCCAGG
CTCCCAAACGCAAAAAGTTTTACCAACTTTGACTTTAGCCATTGTCCCCAGCTCAATCCAGCTCCCTTGATGCAATTAGCCGCAGATCCGGGTTGGTTGG
AGCGCGCCGAGAATTGCCTTATTCTGGGGCCCTCGGGTGTTGGAAAAACACATCTGGCCACTGGGGTGTCCAAAAAGATGCTGGAATTCGGTAAGCGGGT
GAAGTTCTTTGCAGCCAACGCATTGGTCCAGCAACTGCAACAGGCGAAGCTCCAACTGCAGCTGCATCCAATGCTCAAAAAACTGGACCGCTATGATCTG
TTGATCTTGGATGACTTGGGCTATTGCAAAAAGTCGGAAGCGGAAACGTCCGTTCTGTTTGAATTAATTGCGCATCGCTATGAACGAAAAAGTTTGTTGA
TTACAGCGAATCAACCCTTCAGCCAATGGGATGACATTTTCACTGATTCGATGATGGCGGTGGCTGCCATTGATCGCTTAATTCATCATGGCTTGATTAT
CGAAATTCAAGCCGATAGTTATCGGCGCAAATCAGCGACTCAGAGAACGGCTCAAACTCAGTCTCAACCTCAAAAATCCCAGTCCAAATAATCGAGCGAT
CACTGGTTGTCATTTCGATCACGGGTTGTCGTTTTGTTCTGAAGGTATCGTATGAGGTCTGAAAAAGAGCTAACGCAAAGGGAGTGTACGCCAAATAATA
CGGTTCAAATTTGGGGGTTGATCCGCCTTGTGGTTGTCGCGACATCGCCTATGGGGTTGACATCCAATA
ATTCCCTAGAAATGATGTGTGAGATCGCGTTTCTAGAATTCAATCTCAAGAAAAAAGACACTTTTTACATCCCTAATCAATAGAGGGCTGTACAAGTGCC
TAACAAAAGAATTGATTCCTATCATACTCGTGTTTATATGAACGGAAGAGACCTCGGCTTAAAACAAGCTGATGCAGCTTACATTGCCGAAATATCTACA
AGAACCGGACAACGAATTGAGGCCGGCACCCATCAACCTAATCGCGGCCGCCTTCAGGACCAACGCACCGTTCCCGACCCCTTAGCTGATGTGTGGGAAG
ACGAGCTAGAACCAATGCTGCGTCGAGACCCACGCCTCAAACCCATGACCCTGTATGAGTACCTGCAGGATAAGTATCCAGGCCAGTATCCCCAAGTCCT
GCGGACCCTACAACGTCGGGTGAGAACGTGGAAAGCCTTACATGGACCAAGCCCTGAAGTGATGTTTGAATTGCGTCATGAACCAGGGGTACAAGGGTTC
TCCGATTTTACAGAACTCAAAGGCATCACGATTACCATTGCCGGCAAACCCTTTGAGCACCTGATTTACCATTACCGTCTGGGATACAGCGGCTGGCGAT
ATGCCCAGATCATCGAAGGAGGCGAAAGCTTTGTCGCCCTCTCAGAAGGATTGCAAAATGCCTTTGCAGCCTGTGGAGGTGTTCCTACACAGCATCGTAC
TGATAGTTTGAGTGCAGCCTATCGCAACATGGGCGGCCGCCGGTCCAAAAACCTCACTCGTCTGTACGACGAACTGTGTGACCACTATCGGCTAGAACCC
ACTCGTAACAACAAAGGTGTAGCCCATGAGAATGGCTCCATTGAATCTCCCCATGGTCATCTGAAGAACCGAATTAAGCAGGCGATCTATCTGCGCGGCA
GTGCAGATTTTACGAGCGTTGCTGAGTATCAAGCCTTAATTGATGCACAGGTTGCCAAGTTGAATCAGCAGTGCCAAACCAAGTATGAGCAAGAGAAAGA
CCATCTACAACCACTGCCCAAATATCGAACCCCTGACTATGAAGTGCTCACGGCTAAAGTCAGCAAACGCAGCACCATCGATGTTCGCTGCATTCTATAC
ACCGTCCCTTCTCGACTGATTGGTCGTCAATTGGAACTGCATCTATACCATGACCGGATTGTCGGCTATCTGGAGCGACACCCGGTGGTGGAATTGCCGA
GGAAGCGCGTCAGTGGCAAAGGCAAACGTCGCGACCGTTGCATCAACTATCGCCATGTTATTGGTTCAATGCGATTGAAGCCTCGTGCTTTTATCTATTG
CACCTGGCAATCAGACCTACTTCCCAATCCTGAATACCGCCAAATCTGGGAACAGCTCAAAGCCCAATTTGACCTGGAGCAGGCTGCCAAGATCATCGTG
GAAGCCCTGTATATTGCTGCGGTTCAAGATAAAGAACAGGCCGTAGCAGTGTACTTACAGCAGCAGCTTCGCTCATCCAGCCTTACCCTCAATCGCCTGA
AAAAACAGTTTGAGCCGCCTCAGATGAAGCAGGTTCCTGAACTCAGCATTGAACAACATTCACTTGAACTTTATGACAAACTCCTCCCCTCCTGCTCAGT
CCCCGCTGAGCCCCTACCAGCACCTGAGCCTCTATTTAAAAAAGCTCAGGCTCTCCCACATGTTGACCCATTGGGAATCTATCGAATCCCAAGCCATGCA
GGAAAACTGGTCTTATGCGGAATTCTTACTAGCCTTGTGCGAAACGGAGGCCCAACGAAGAGAACAAGCTCGTCTAAAACGTGCCCTCACCGAAGCCAGG
CTCCCAAACGCAAAAAGTTTTACCAACTTTGACTTTAGCCATTGTCCCCAGCTCAATCCAGCTCCCTTGATGCAATTAGCCGCAGATCCGGGTTGGTTGG
AGCGCGCCGAGAATTGCCTTATTCTGGGGCCCTCGGGTGTTGGAAAAACACATCTGGCCACTGGGGTGTCCAAAAAGATGCTGGAATTCGGTAAGCGGGT
GAAGTTCTTTGCAGCCAACGCATTGGTCCAGCAACTGCAACAGGCGAAGCTCCAACTGCAGCTGCATCCAATGCTCAAAAAACTGGACCGCTATGATCTG
TTGATCTTGGATGACTTGGGCTATTGCAAAAAGTCGGAAGCGGAAACGTCCGTTCTGTTTGAATTAATTGCGCATCGCTATGAACGAAAAAGTTTGTTGA
TTACAGCGAATCAACCCTTCAGCCAATGGGATGACATTTTCACTGATTCGATGATGGCGGTGGCTGCCATTGATCGCTTAATTCATCATGGCTTGATTAT
CGAAATTCAAGCCGATAGTTATCGGCGCAAATCAGCGACTCAGAGAACGGCTCAAACTCAGTCTCAACCTCAAAAATCCCAGTCCAAATAATCGAGCGAT
CACTGGTTGTCATTTCGATCACGGGTTGTCGTTTTGTTCTGAAGGTATCGTATGAGGTCTGAAAAAGAGCTAACGCAAAGGGAGTGTACGCCAAATAATA
CGGTTCAAATTTGGGGGTTGATCCGCCTTGTGGTTGTCGCGACATCGCCTATGGGGTTGACATCCAATA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1737 bp | 578 aa | 196 | 1932 | + | No |
Chemistry : DDE
ORF sequence :
VPNKRIDSYHTRVYMNGRDLGLKQADAAYIAEISTRTGQRIEAGTHQPNRGRLQDQRTVPDPLADVWEDELEPMLRRDPRLKPMTLYEYLQDKYPGQYPQ
VLRTLQRRVRTWKALHGPSPEVMFELRHEPGVQGFSDFTELKGITITIAGKPFEHLIYHYRLGYSGWRYAQIIEGGESFVALSEGLQNAFAACGGVPTQH
RTDSLSAAYRNMGGRRSKNLTRLYDELCDHYRLEPTRNNKGVAHENGSIESPHGHLKNRIKQAIYLRGSADFTSVAEYQALIDAQVAKLNQQCQTKYEQE
KDHLQPLPKYRTPDYEVLTAKVSKRSTIDVRCILYTVPSRLIGRQLELHLYHDRIVGYLERHPVVELPRKRVSGKGKRRDRCINYRHVIGSMRLKPRAFI
YCTWQSDLLPNPEYRQIWEQLKAQFDLEQAAKIIVEALYIAAVQDKEQAVAVYLQQQLRSSSLTLNRLKKQFEPPQMKQVPELSIEQHSLELYDKLLPSC
SVPAEPLPAPEPLFKKAQALPHVDPLGIYRIPSHAGKLVLCGILTSLVRNGGPTKRTSSSKTCPHRSQAPKRKKFYQL
VLRTLQRRVRTWKALHGPSPEVMFELRHEPGVQGFSDFTELKGITITIAGKPFEHLIYHYRLGYSGWRYAQIIEGGESFVALSEGLQNAFAACGGVPTQH
RTDSLSAAYRNMGGRRSKNLTRLYDELCDHYRLEPTRNNKGVAHENGSIESPHGHLKNRIKQAIYLRGSADFTSVAEYQALIDAQVAKLNQQCQTKYEQE
KDHLQPLPKYRTPDYEVLTAKVSKRSTIDVRCILYTVPSRLIGRQLELHLYHDRIVGYLERHPVVELPRKRVSGKGKRRDRCINYRHVIGSMRLKPRAFI
YCTWQSDLLPNPEYRQIWEQLKAQFDLEQAAKIIVEALYIAAVQDKEQAVAVYLQQQLRSSSLTLNRLKKQFEPPQMKQVPELSIEQHSLELYDKLLPSC
SVPAEPLPAPEPLFKKAQALPHVDPLGIYRIPSHAGKLVLCGILTSLVRNGGPTKRTSSSKTCPHRSQAPKRKKFYQL
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
819 bp | 272 aa | 1673 | 2491 | + | No |
AG : IS21 helper
ORF sequence :
MTNSSPPAQSPLSPYQHLSLYLKKLRLSHMLTHWESIESQAMQENWSYAEFLLALCETEAQRREQARLKRALTEARLPNAKSFTNFDFSHCPQLNPAPLM
QLAADPGWLERAENCLILGPSGVGKTHLATGVSKKMLEFGKRVKFFAANALVQQLQQAKLQLQLHPMLKKLDRYDLLILDDLGYCKKSEAETSVLFELIA
HRYERKSLLITANQPFSQWDDIFTDSMMAVAAIDRLIHHGLIIEIQADSYRRKSATQRTAQTQSQPQKSQSK
QLAADPGWLERAENCLILGPSGVGKTHLATGVSKKMLEFGKRVKFFAANALVQQLQQAKLQLQLHPMLKKLDRYDLLILDDLGYCKKSEAETSVLFELIA
HRYERKSLLITANQPFSQWDDIFTDSMMAVAAIDRLIHHGLIIEIQADSYRRKSATQRTAQTQSQPQKSQSK
Blast result :
Comments
ISAcma26 is 64% aa similar to ISEc10.
This IS has atypic IRs: it begins with 'TA' instead of 'TG'.
This IS has atypic IRs: it begins with 'TA' instead of 'TG'.
References
1] Swingley,W.D., Chen,M., Cheung,P.C., Conrad,A.L., Dejesa,L.C.,Hao,J., Honchak,B.M., Karbach,L.E., Kurdoglu,A., Lahiri,S.,Mastrian,S.D., Miyashita,H., Page,L., Ramakrishna,P., Satoh,S., Sattley,W.M., Shimada,Y., Taylor,H.L., Tomo,T., Tsuchiya,T., Wang,Z.T., Raymond,J., Mimuro,M., Blankenship,R.E. and Touchman,J.W. (2008) Proc. Natl. Acad. Sci. U.S.A.
2] ISfinder annotation (2009)
2] ISfinder annotation (2009)