ISAzo4
- Family IS21
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_006513 | ND | Azoarcus sp. | Azoarcus sp. EbN1 |
DNA section
IS Length : 2757 bp
Ends
IR Length : 26/29
IRL : TGCGGATTCCGACCCAACGTGACCGCTGACTCCGAAATAGTGTGACCGGT
IRR : TGCGCATTCCGACGAACGTGACCGCCTGCACCGACGAACGTGACCGGGGG
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GAAGGCCATC | GGCCAGC | CGGCGGGGCT | 7 |
DNA sequence
TGCGGATTCCGACCCAACGTGACCGCTGACTCCGAAATAGTGTGACCGGTGAATCCGCGGTCGTGACCGCGGATTCCGATTTGATCGTGACCGATTTCGG
CGAGTTGTCGGAATGGGCGGTCACGATGTCGGAATCAATGGTCACGATCAAATCGGAACGGGTTTGTGAGGCCAAGCGTGGCAACGTCGTTTGTCAGGCC
GGCTACCCTCGCGCGCTTTGCGCGGAGACCGAGGATGCCGGCGGAGCGGATTGCCATGCACAAGATCAGGGAGCTGTTACGGCTCAAATACGACTGCGCG
CTGTCGCATGAGCGCATTGCCCGCGCGCTGTCGATTTCCAAAGGCGTGGTCGCCAAGTATGTGAAGGCGGCCGAGGAGTGCGGCCGGCCGTGGGCCGAGT
TGTCGGCGGCCGACGAGGCCGAGTTGCGCCGGGTGCTGGGGGTCGCCCGGCGTGGACGTGGCGCGAGCGTCGCGAATGTGCCGCCGGATCTGGCGGCGGT
GCATCAGGGGCTCAAGCGAAAGAACGTCACGCTGGCGCTGCTGTGGGAAGAGTACGTGCAGACGGCCGACGGGCCGAGCTATCAGTACTCGCGCTTTTGC
GACCTGTACCGCGCGTTCGCCCGCACGCTCAAGCGCTCGATGCGCCAGGTGCACCGCGCCGGCGAGAAACTCTTCATCGATTACGCGGGCGACACGGTGC
CGATCGTCGACGCCGACACGGGCGAGATTTCGCGCGCGCAGATCTTCGTCGCGGTGCTCGGCGCCTCGAGCTATACGTTCGCCTGCGCCACGGCGACGCA
GTCGCAGGCCGACTGGCTGGGCTCGCTCGCCCGGGCGCTGGCCTTCATCGGCGGCGTGCCCGAGCTCGTGGTGCCCGACAATACGCGCTCGCTCGTTGGC
CAAGCCGACCGCTACGAGCCGCAACTGCAGCGCACCACCGCCGAGTTCGCCGCGCACTACGGCGTGGCGATCCTGCCCGCGCGCCCCTACAAGCCGCAGG
ACAAGGCCAAGGTCGAAGTCGGCGTACAGATCGTGCAGCGCTGGATTCTGGCGCGGCTGCGGCATCGACGTTTCTTCTCGCTGGGAGAGTTGAACGAGGC
GATCGCCGCGTTACTCGAGCCGCTGAACACGCGTGCCTTCCGGCGCCTGCCAGGCTCGCGCCACGAAGCGTTCGAGACGCTCGATCGGCCGGCGCTGCGC
CCGTTGCCCGCCACCGCGTTCCAGTTCGCCCAGTGGAAACGGGCCAAGCCCAATATCGATTACCACGTCGAGTTCGACGGGCATTACTACAGCGTGCCGT
ATGCGCTCGCCGGCCAACCGGTGGAGTTGCGCATCACCGCGAGCAGCATCGAATGCTTTGCGGCGGGCCGCCGCGTCGCGGTGCATGCGAGAAGCCCCCG
GTACGGAGCGTTCACCACGCTCACCGAGCACATGCCCGCATCCCATCAGGCGCATCGCCAGTGGTCCCCGGGCAAACTCATCGCCTGGGGTGCCACGGTC
GGACCTCACACCGAGCAGGTGGTCAGCCACCAACTCGAACGCATGCCGCACCCCGAGCAGGGCTACCGGGCGTGCCTCGGGCTGATGCGCCTCGGTCGCC
AATACGGCAACGAACGCCTCGAAGCCGCGGCGACCCGCGCCGTCACCCTCGGGGCGATGCGTTACCGCAACGTCGCCTCGATCCTCAAGAGCGGGCTCGA
CCGCGCGCCGCTCCCCGCATCCACTGCGCAGCAAAGCGAGCTCGCGCTACCGGCCGCTCACGAGAACCTGCGCGGGGCGCGCTACTACCACTGATTCCAC
CCACCGGAGAACATCGATGCTGATACAACACAGCCTGCAACAACTGCGCACCCTGCGCCTGGAAGGCATGGCGCGCGCCTTCGAGGAGCAGCTCACCCAG
CCCGCCATCACCGCGCTCAGCTTCGAGGAGCGCTTCGCCCAGCTCATCGATCGCGAGATCCTGCTGCGCGACGGCAAACGCATCGATCGGCTGCTCAAGG
CCGCCCGCATCAAAGCCGCTGCCGCCTGCCTGGAGGACGTCGACTACCGCGCCGGGCGCGGCCTCGAGCGCAGTCAGATCGCCGCGCTCGGAACCGGCCA
GTGGATTCGCCACCACCAGAACTGCCTCATCACGGGCCCCACCGGCAGCGGCAAGACGTGGCTCGCCTGCGCGCTCGCCAACGCGGCGTGCCGGCAGGGC
CTCGCGGCCTACTACGTGCGTCTGCCGCGGCTCTTCGAGGAGCTGCGCATCGCGCATGCCGATGGCAGCTTCAGCCGACGCCTCATGCAGCTCGCGCGCA
TGGATCTCATCGTCATCGACGATTGGGGCCTGGCCGCGCCGTCGGCGCAGGAGCGAAGCGACCTGCTGGAGCTGCTCGACGACCGCGTCGGCACGCGTTC
GACCGTGATCACCAGCCAGCTGCCGATCGAGCACTGGCACACCTACCTGGGCGACCCGACCTTCGCCGATGCGATCCTCGATCGCGTCGTGCACGCCGCG
CACAAGCTCGCCCTCAAGGGCGAGTCGATGAGAAGAAAGGAAAAGGCATGAGGCCGTGCGCGCGAGACGCCTTACCCACAGGCGCCCGCCCGCCAGCGTA
TGAGGCGCGCTGGCGGCCGCCTGTGGATAAGCCTGCGCCGTGCCCCGAATTGACCGTGATCGTGACCGACACGGTTAGAATCTGAACACCACGCTTCGCG
CGCAGACCCCCCGGTCACGTTCGTCGGTGCAGGCGGTCACGTTCGTCGGAATGCGCA
CGAGTTGTCGGAATGGGCGGTCACGATGTCGGAATCAATGGTCACGATCAAATCGGAACGGGTTTGTGAGGCCAAGCGTGGCAACGTCGTTTGTCAGGCC
GGCTACCCTCGCGCGCTTTGCGCGGAGACCGAGGATGCCGGCGGAGCGGATTGCCATGCACAAGATCAGGGAGCTGTTACGGCTCAAATACGACTGCGCG
CTGTCGCATGAGCGCATTGCCCGCGCGCTGTCGATTTCCAAAGGCGTGGTCGCCAAGTATGTGAAGGCGGCCGAGGAGTGCGGCCGGCCGTGGGCCGAGT
TGTCGGCGGCCGACGAGGCCGAGTTGCGCCGGGTGCTGGGGGTCGCCCGGCGTGGACGTGGCGCGAGCGTCGCGAATGTGCCGCCGGATCTGGCGGCGGT
GCATCAGGGGCTCAAGCGAAAGAACGTCACGCTGGCGCTGCTGTGGGAAGAGTACGTGCAGACGGCCGACGGGCCGAGCTATCAGTACTCGCGCTTTTGC
GACCTGTACCGCGCGTTCGCCCGCACGCTCAAGCGCTCGATGCGCCAGGTGCACCGCGCCGGCGAGAAACTCTTCATCGATTACGCGGGCGACACGGTGC
CGATCGTCGACGCCGACACGGGCGAGATTTCGCGCGCGCAGATCTTCGTCGCGGTGCTCGGCGCCTCGAGCTATACGTTCGCCTGCGCCACGGCGACGCA
GTCGCAGGCCGACTGGCTGGGCTCGCTCGCCCGGGCGCTGGCCTTCATCGGCGGCGTGCCCGAGCTCGTGGTGCCCGACAATACGCGCTCGCTCGTTGGC
CAAGCCGACCGCTACGAGCCGCAACTGCAGCGCACCACCGCCGAGTTCGCCGCGCACTACGGCGTGGCGATCCTGCCCGCGCGCCCCTACAAGCCGCAGG
ACAAGGCCAAGGTCGAAGTCGGCGTACAGATCGTGCAGCGCTGGATTCTGGCGCGGCTGCGGCATCGACGTTTCTTCTCGCTGGGAGAGTTGAACGAGGC
GATCGCCGCGTTACTCGAGCCGCTGAACACGCGTGCCTTCCGGCGCCTGCCAGGCTCGCGCCACGAAGCGTTCGAGACGCTCGATCGGCCGGCGCTGCGC
CCGTTGCCCGCCACCGCGTTCCAGTTCGCCCAGTGGAAACGGGCCAAGCCCAATATCGATTACCACGTCGAGTTCGACGGGCATTACTACAGCGTGCCGT
ATGCGCTCGCCGGCCAACCGGTGGAGTTGCGCATCACCGCGAGCAGCATCGAATGCTTTGCGGCGGGCCGCCGCGTCGCGGTGCATGCGAGAAGCCCCCG
GTACGGAGCGTTCACCACGCTCACCGAGCACATGCCCGCATCCCATCAGGCGCATCGCCAGTGGTCCCCGGGCAAACTCATCGCCTGGGGTGCCACGGTC
GGACCTCACACCGAGCAGGTGGTCAGCCACCAACTCGAACGCATGCCGCACCCCGAGCAGGGCTACCGGGCGTGCCTCGGGCTGATGCGCCTCGGTCGCC
AATACGGCAACGAACGCCTCGAAGCCGCGGCGACCCGCGCCGTCACCCTCGGGGCGATGCGTTACCGCAACGTCGCCTCGATCCTCAAGAGCGGGCTCGA
CCGCGCGCCGCTCCCCGCATCCACTGCGCAGCAAAGCGAGCTCGCGCTACCGGCCGCTCACGAGAACCTGCGCGGGGCGCGCTACTACCACTGATTCCAC
CCACCGGAGAACATCGATGCTGATACAACACAGCCTGCAACAACTGCGCACCCTGCGCCTGGAAGGCATGGCGCGCGCCTTCGAGGAGCAGCTCACCCAG
CCCGCCATCACCGCGCTCAGCTTCGAGGAGCGCTTCGCCCAGCTCATCGATCGCGAGATCCTGCTGCGCGACGGCAAACGCATCGATCGGCTGCTCAAGG
CCGCCCGCATCAAAGCCGCTGCCGCCTGCCTGGAGGACGTCGACTACCGCGCCGGGCGCGGCCTCGAGCGCAGTCAGATCGCCGCGCTCGGAACCGGCCA
GTGGATTCGCCACCACCAGAACTGCCTCATCACGGGCCCCACCGGCAGCGGCAAGACGTGGCTCGCCTGCGCGCTCGCCAACGCGGCGTGCCGGCAGGGC
CTCGCGGCCTACTACGTGCGTCTGCCGCGGCTCTTCGAGGAGCTGCGCATCGCGCATGCCGATGGCAGCTTCAGCCGACGCCTCATGCAGCTCGCGCGCA
TGGATCTCATCGTCATCGACGATTGGGGCCTGGCCGCGCCGTCGGCGCAGGAGCGAAGCGACCTGCTGGAGCTGCTCGACGACCGCGTCGGCACGCGTTC
GACCGTGATCACCAGCCAGCTGCCGATCGAGCACTGGCACACCTACCTGGGCGACCCGACCTTCGCCGATGCGATCCTCGATCGCGTCGTGCACGCCGCG
CACAAGCTCGCCCTCAAGGGCGAGTCGATGAGAAGAAAGGAAAAGGCATGAGGCCGTGCGCGCGAGACGCCTTACCCACAGGCGCCCGCCCGCCAGCGTA
TGAGGCGCGCTGGCGGCCGCCTGTGGATAAGCCTGCGCCGTGCCCCGAATTGACCGTGATCGTGACCGACACGGTTAGAATCTGAACACCACGCTTCGCG
CGCAGACCCCCCGGTCACGTTCGTCGGTGCAGGCGGTCACGTTCGTCGGAATGCGCA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1560 bp | 519 aa | 235 | 1794 | + | No |
Chemistry : DDE
ORF sequence :
MPAERIAMHKIRELLRLKYDCALSHERIARALSISKGVVAKYVKAAEECGRPWAELSAADEAELRRVLGVARRGRGASVANVPPDLAAVHQGLKRKNVTL
ALLWEEYVQTADGPSYQYSRFCDLYRAFARTLKRSMRQVHRAGEKLFIDYAGDTVPIVDADTGEISRAQIFVAVLGASSYTFACATATQSQADWLGSLAR
ALAFIGGVPELVVPDNTRSLVGQADRYEPQLQRTTAEFAAHYGVAILPARPYKPQDKAKVEVGVQIVQRWILARLRHRRFFSLGELNEAIAALLEPLNTR
AFRRLPGSRHEAFETLDRPALRPLPATAFQFAQWKRAKPNIDYHVEFDGHYYSVPYALAGQPVELRITASSIECFAAGRRVAVHARSPRYGAFTTLTEHM
PASHQAHRQWSPGKLIAWGATVGPHTEQVVSHQLERMPHPEQGYRACLGLMRLGRQYGNERLEAAATRAVTLGAMRYRNVASILKSGLDRAPLPASTAQQ
SELALPAAHENLRGARYYH
ALLWEEYVQTADGPSYQYSRFCDLYRAFARTLKRSMRQVHRAGEKLFIDYAGDTVPIVDADTGEISRAQIFVAVLGASSYTFACATATQSQADWLGSLAR
ALAFIGGVPELVVPDNTRSLVGQADRYEPQLQRTTAEFAAHYGVAILPARPYKPQDKAKVEVGVQIVQRWILARLRHRRFFSLGELNEAIAALLEPLNTR
AFRRLPGSRHEAFETLDRPALRPLPATAFQFAQWKRAKPNIDYHVEFDGHYYSVPYALAGQPVELRITASSIECFAAGRRVAVHARSPRYGAFTTLTEHM
PASHQAHRQWSPGKLIAWGATVGPHTEQVVSHQLERMPHPEQGYRACLGLMRLGRQYGNERLEAAATRAVTLGAMRYRNVASILKSGLDRAPLPASTAQQ
SELALPAAHENLRGARYYH
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
735 bp | 244 aa | 1817 | 2551 | + | No |
AG : IS21 helper
ORF sequence :
MLIQHSLQQLRTLRLEGMARAFEEQLTQPAITALSFEERFAQLIDREILLRDGKRIDRLLKAARIKAAAACLEDVDYRAGRGLERSQIAALGTGQWIRHH
QNCLITGPTGSGKTWLACALANAACRQGLAAYYVRLPRLFEELRIAHADGSFSRRLMQLARMDLIVIDDWGLAAPSAQERSDLLELLDDRVGTRSTVITS
QLPIEHWHTYLGDPTFADAILDRVVHAAHKLALKGESMRRKEKA
QNCLITGPTGSGKTWLACALANAACRQGLAAYYVRLPRLFEELRIAHADGSFSRRLMQLARMDLIVIDDWGLAAPSAQERSDLLELLDDRVGTRSTVITS
QLPIEHWHTYLGDPTFADAILDRVVHAAHKLALKGESMRRKEKA
Blast result :
Comments
ISAzo4 is 70% (ORF1) and 80% (ORF2) aa simialr to IS408. There are 16 copies in Azoarcus EbN1 genome.
References
1] Rabus,R., Kube,M., Heider,J., Beck,A., Heitmann,K., Widdel,F. and Reinhardt,R. (2005) Arch. Microbiol. 183 (1), 27-36