ISKpn19
- Family ISKra4
- Group ISAzba1
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_010886 | ND | Klebsiella pneumoniae | Escherichia coli 078 plasmid pT078 Klebsiella pneumoniae K1HV plasmid pK1HV Klebsiella pneumoniae K18A plasmid pK18An Klebsiella pneumoniae NK245 plasmid pK245 Escherichia coli E66An plasmid pE66An Escherichia coli KT58A plasmid pKT58A |
DNA section
IS Length : 2851 bp
Ends
IR Length : 22/26
IRL : GAGAGCATGAAAAACCTCGGGGCAGCCATCGAATGATGCCATCTACAGGG
IRR : GAGAGCATAAAAAATATTGGGGCAGCGTGAGCTATGCTGCCATCGCTGGT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GAGACGACAATTTG | GGTCGTATGGAAGCT | 0 | |
TAGCGGAAAGTTCTTTTAG | GGTCGTATGGAAGCTGGAC | 0 | |
TTGCCCCGAGGTCATCAAC | CATATAGGCAGCGCTAAGG | 0 | |
CCTCAGACCCCATATAGGC | GGTCGTATGGAAGCTGGAC | 0 | |
CCGCTGCCGGAGCGCAACG | TAAAATCCTGATGCTGATG | 0 | |
GAAACTGATCTCAGTATCG | GGTCGTATGGAAGCTGGAC | 0 | |
CCTTGCGCAGATGAGATAG | GGTCGTATGGAAGCTGGAC | 0 |
DNA sequence
GAGAGCATGAAAAACCTCGGGGCAGCCATCGAATGATGCCATCTACAGGGGTTAAAATAACCACCATCATTTCATAACAATACCTGCTGACTATGAAGAT
CTACGTCATCAAAATAGCAGTTCATGGAGTCAGTCCGATGGTCTGGCGTCGGCTGAGAATTGCTGCCGACACGTCACTGGCCGCGCTTCACTTCATTTTC
CAGATAGTGCAGGGCTGGGGGGACGACCATCTCCATCAATTTCACATTTATGGCAAAGATTACGGTATCTCCTACGAAGGAGGCATCGGTTTTGTCGATA
ATCCGTTCCGGGTCGTGATTGATGATTTTGCTTTTGATGCTGGCGATCGCTTTACCTATGAATACAATTTCTTTGAGCACTGGCTTCATGACATTCGTGT
TGAGGCCATTTATGAAAACTCTACGCTGAAAGCGCCGTTTTGTATAAGCGGCCATGGTATGCCCGGAGCCACAGCTGCGGATGAGTTCGATAAAACCCTG
GCGTTTCTTGAAGCTATCGTTAATGCGGATGATGAAACAACGGTCGGCGAGATCCGCCCTTTTGCTGACGACCTGGATGCTGTCAGGTTCAACCGCCACA
AAATCAACCGGCAACTGAGCAGGCTCGACCTTGCATCTCCGGTACTGGAGCCTGAAGTTATCTGGCTGGGCCGTCGTCGCTGACTGATGTCTATAAAGGT
ATACGTTTTCAGCATGCCTCTCAGATCCCCGAAAAAATGTCTATAAACCTCCCTAAATCAGTGTTTATGGATTATTCTTTAAATCTCTATGATGTTTAAA
GACCTTCCGGAGGTTTTGCGTGCAATATGCCTACATCAGGGTAAGCTCGGCCGATCAGAATACGGCCCGTCAGGAAGAGGCATTGTCAAAGGCGGGTTTT
CAGCCTGATAAAATTTGTGTTGAGCATGCCAGCGCGAAGGACACAAACCGCCCGGGGTTACAGGAACTCCTGGGGCAATTACGCCCTGGTGATACGCTTC
TGGTTCATTCCATCGATCGCCTTTGCCGGAATATGTCGGATATGTGCGCTGTCACCACCCGGCTTCGTGACCAGGGCGTTACCCTTATTTTTCTGAAAGA
GCAGCTGACGTTCAGCGCCGGCACAAACAATCCGATGCAGGAACTGCAGCTGCACATGATGTCGGCATTCAGCCAGTTTGAACGAGCCTTACTGAAGGAA
AGGCAGGCCGACGGCATCGCCGCGAAAAAAGCACGGGGTGAAAAAACAGGGCGCCCCGGCGCCGATATCAAAAAGATCCAGGAAATTGACGCACTCAGAA
GCAGAGGGGTCAGGCTCAGGATCGCCTGCGACCATGCGGGCCTTGGCGTTTCCACATATTATAAGCTTCGTCACCAGATGAACAACCAGTAGCCGGGAGA
TAGTCATCCACCACAGCAGGAGGCAATGATGCAACTGACGCTCCAGATAGTCATCACCGATGAATCAGGCTCCAGTCGAACGGAGGAACTGATGACAATC
CAGAAATCAGGGGAGACCCGGAACGACATCGGATTATCGGTGTCAGAATCCAAGCTGTTGCTGAATACGGTCCAGCAGTCGGTGGTCCAGCTGCAGGCAG
ACGAATATACTCAACACCATATCCGGTGCCCTCACTGCCTTGCTGCGCGCAGAATCAAAGGCAAACAGAAAATACGGTACCGGACGCTGTTTGGCGTTAT
TCCGGTATCCGGGCTTCGGGTGTACCGGTGCCGTTGTGAAGAGAGTGATACAAAGACAGTCAGTTTGCTCAGCGACTGGGCCGGCGATTATTCTCACCCG
GCACTGAAATATATTGAAACCCGCTGGGCCTCGATGATCTCCTATGAAATGACGACTCGTCTGCTGAAAGATATTCTGCCGGTGGGCCACAGCCTGAATG
CCTCAACGGTGAGGAATCATTTATGCCAGGTGGCGCAGCGTCTTGACGCTGAGGCTGAAGCTCATTCAGGTTTTCTTTCCGGCTGCCCCCGCGACTGGGG
GAACCTGCCCAGGCCAGGAAAACCGCTTGTGGTGGGTATAGATGGTGGTTATGTTCGCGATCGGGATGACAAAAAGCGTAATTTTGAAATTATTGCCGGG
AAGTCTTTTTCCGTTGGTGCGCCTGCTGACACCCGCCGCTTCGGTTTTGTTCAGAAGGATGACTGTCATCCGGAACGCAGGCTCATGACCCACCTTTCAG
CGCAGGGAATGCAGGCCAACCAGCAGATATTTTTTCTGTCCGACGGCGCGGATAATCTCAGGGACCTCCAGTTCGGTATGTACCCTGAGTCAACGCATGT
GCTGGACTGGTTTCATATCACCATGAGGCTGAAGGTGCTCATGCAATATGCCCGGGGACTGCTGGTATCAGATCCAGAGGCGGGGAGTAAAGTTCTGGCA
CTGCTGGAAAGCATTAAACGATATCTCTGGCATGGCAATGTTGTCGCTGCGCTGGAACATATTGATAATTGCGTTATGTATTGCGATGACCCTGAACTCA
GCTATCCTAGCCTGAAATCCCTGCAAAAACATCTGGATGAAATGTATACCTACATCCGGAATAATAAGATGATGATCCCAAACTACGGCGAAATGCGTCG
GTACGGAGAGCCGGTATCAACCGCGTTTGTGGAATCCACGATCAATGAAGTGATCGCCAGACGAATGGCCAAAAAGCAGCAGATGCAGTGGAGCAGAAAA
GGGGCTCATTATTTGTTACAGACCCGGACCGCCGTTTTGAATAACGAACTGCAGGATAAATTTGTCTGCTGGTATCCGGGTTTTCAAAGTGACGGGAAGG
GACCAGCGATGGCAGCATAGCTCACGCTGCCCCAATATTTTTTATGCTCTC
CTACGTCATCAAAATAGCAGTTCATGGAGTCAGTCCGATGGTCTGGCGTCGGCTGAGAATTGCTGCCGACACGTCACTGGCCGCGCTTCACTTCATTTTC
CAGATAGTGCAGGGCTGGGGGGACGACCATCTCCATCAATTTCACATTTATGGCAAAGATTACGGTATCTCCTACGAAGGAGGCATCGGTTTTGTCGATA
ATCCGTTCCGGGTCGTGATTGATGATTTTGCTTTTGATGCTGGCGATCGCTTTACCTATGAATACAATTTCTTTGAGCACTGGCTTCATGACATTCGTGT
TGAGGCCATTTATGAAAACTCTACGCTGAAAGCGCCGTTTTGTATAAGCGGCCATGGTATGCCCGGAGCCACAGCTGCGGATGAGTTCGATAAAACCCTG
GCGTTTCTTGAAGCTATCGTTAATGCGGATGATGAAACAACGGTCGGCGAGATCCGCCCTTTTGCTGACGACCTGGATGCTGTCAGGTTCAACCGCCACA
AAATCAACCGGCAACTGAGCAGGCTCGACCTTGCATCTCCGGTACTGGAGCCTGAAGTTATCTGGCTGGGCCGTCGTCGCTGACTGATGTCTATAAAGGT
ATACGTTTTCAGCATGCCTCTCAGATCCCCGAAAAAATGTCTATAAACCTCCCTAAATCAGTGTTTATGGATTATTCTTTAAATCTCTATGATGTTTAAA
GACCTTCCGGAGGTTTTGCGTGCAATATGCCTACATCAGGGTAAGCTCGGCCGATCAGAATACGGCCCGTCAGGAAGAGGCATTGTCAAAGGCGGGTTTT
CAGCCTGATAAAATTTGTGTTGAGCATGCCAGCGCGAAGGACACAAACCGCCCGGGGTTACAGGAACTCCTGGGGCAATTACGCCCTGGTGATACGCTTC
TGGTTCATTCCATCGATCGCCTTTGCCGGAATATGTCGGATATGTGCGCTGTCACCACCCGGCTTCGTGACCAGGGCGTTACCCTTATTTTTCTGAAAGA
GCAGCTGACGTTCAGCGCCGGCACAAACAATCCGATGCAGGAACTGCAGCTGCACATGATGTCGGCATTCAGCCAGTTTGAACGAGCCTTACTGAAGGAA
AGGCAGGCCGACGGCATCGCCGCGAAAAAAGCACGGGGTGAAAAAACAGGGCGCCCCGGCGCCGATATCAAAAAGATCCAGGAAATTGACGCACTCAGAA
GCAGAGGGGTCAGGCTCAGGATCGCCTGCGACCATGCGGGCCTTGGCGTTTCCACATATTATAAGCTTCGTCACCAGATGAACAACCAGTAGCCGGGAGA
TAGTCATCCACCACAGCAGGAGGCAATGATGCAACTGACGCTCCAGATAGTCATCACCGATGAATCAGGCTCCAGTCGAACGGAGGAACTGATGACAATC
CAGAAATCAGGGGAGACCCGGAACGACATCGGATTATCGGTGTCAGAATCCAAGCTGTTGCTGAATACGGTCCAGCAGTCGGTGGTCCAGCTGCAGGCAG
ACGAATATACTCAACACCATATCCGGTGCCCTCACTGCCTTGCTGCGCGCAGAATCAAAGGCAAACAGAAAATACGGTACCGGACGCTGTTTGGCGTTAT
TCCGGTATCCGGGCTTCGGGTGTACCGGTGCCGTTGTGAAGAGAGTGATACAAAGACAGTCAGTTTGCTCAGCGACTGGGCCGGCGATTATTCTCACCCG
GCACTGAAATATATTGAAACCCGCTGGGCCTCGATGATCTCCTATGAAATGACGACTCGTCTGCTGAAAGATATTCTGCCGGTGGGCCACAGCCTGAATG
CCTCAACGGTGAGGAATCATTTATGCCAGGTGGCGCAGCGTCTTGACGCTGAGGCTGAAGCTCATTCAGGTTTTCTTTCCGGCTGCCCCCGCGACTGGGG
GAACCTGCCCAGGCCAGGAAAACCGCTTGTGGTGGGTATAGATGGTGGTTATGTTCGCGATCGGGATGACAAAAAGCGTAATTTTGAAATTATTGCCGGG
AAGTCTTTTTCCGTTGGTGCGCCTGCTGACACCCGCCGCTTCGGTTTTGTTCAGAAGGATGACTGTCATCCGGAACGCAGGCTCATGACCCACCTTTCAG
CGCAGGGAATGCAGGCCAACCAGCAGATATTTTTTCTGTCCGACGGCGCGGATAATCTCAGGGACCTCCAGTTCGGTATGTACCCTGAGTCAACGCATGT
GCTGGACTGGTTTCATATCACCATGAGGCTGAAGGTGCTCATGCAATATGCCCGGGGACTGCTGGTATCAGATCCAGAGGCGGGGAGTAAAGTTCTGGCA
CTGCTGGAAAGCATTAAACGATATCTCTGGCATGGCAATGTTGTCGCTGCGCTGGAACATATTGATAATTGCGTTATGTATTGCGATGACCCTGAACTCA
GCTATCCTAGCCTGAAATCCCTGCAAAAACATCTGGATGAAATGTATACCTACATCCGGAATAATAAGATGATGATCCCAAACTACGGCGAAATGCGTCG
GTACGGAGAGCCGGTATCAACCGCGTTTGTGGAATCCACGATCAATGAAGTGATCGCCAGACGAATGGCCAAAAAGCAGCAGATGCAGTGGAGCAGAAAA
GGGGCTCATTATTTGTTACAGACCCGGACCGCCGTTTTGAATAACGAACTGCAGGATAAATTTGTCTGCTGGTATCCGGGTTTTCAAAGTGACGGGAAGG
GACCAGCGATGGCAGCATAGCTCACGCTGCCCCAATATTTTTTATGCTCTC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
591 bp | 196 aa | 93 | 683 | + | No |
Annotation : Hypothetical proteinDescription :
ORF sequence :
MKIYVIKIAVHGVSPMVWRRLRIAADTSLAALHFIFQIVQGWGDDHLHQFHIYGKDYGISYEGGIGFVDNPFRVVIDDFAFDAGDRFTYEYNFFEHWLHD
IRVEAIYENSTLKAPFCISGHGMPGATAADEFDKTLAFLEAIVNADDETTVGEIRPFADDLDAVRFNRHKINRQLSRLDLASPVLEPEVIWLGRRR
IRVEAIYENSTLKAPFCISGHGMPGATAADEFDKTLAFLEAIVNADDETTVGEIRPFADDLDAVRFNRHKINRQLSRLDLASPVLEPEVIWLGRRR
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
573 bp | 190 aa | 820 | 1392 | + | No |
Annotation : Serine recombinaseDescription :
ORF sequence :
MQYAYIRVSSADQNTARQEEALSKAGFQPDKICVEHASAKDTNRPGLQELLGQLRPGDTLLVHSIDRLCRNMSDMCAVTTRLRDQGVTLIFLKEQLTFSA
GTNNPMQELQLHMMSAFSQFERALLKERQADGIAAKKARGEKTGRPGADIKKIQEIDALRSRGVRLRIACDHAGLGVSTYYKLRHQMNNQ
GTNNPMQELQLHMMSAFSQFERALLKERQADGIAAKKARGEKTGRPGADIKKIQEIDALRSRGVRLRIACDHAGLGVSTYYKLRHQMNNQ
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1398 bp | 463 aa | 1423 | 2820 | + | No |
Chemistry : DDE
ORF sequence :
MQLTLQIVITDESGSSRTEELMTIQKSGETRNDIGLSVSESKLLLNTVQQSVVQLQADEYTQHHIRCPHCLAARRIKGKQKIRYRTLFGVIPVSGLRVYR
CRCEESDTKTVSLLSDWAGDYSHPALKYIETRWASMISYEMTTRLLKDILPVGHSLNASTVRNHLCQVAQRLDAEAEAHSGFLSGCPRDWGNLPRPGKPL
VVGIDGGYVRDRDDKKRNFEIIAGKSFSVGAPADTRRFGFVQKDDCHPERRLMTHLSAQGMQANQQIFFLSDGADNLRDLQFGMYPESTHVLDWFHITMR
LKVLMQYARGLLVSDPEAGSKVLALLESIKRYLWHGNVVAALEHIDNCVMYCDDPELSYPSLKSLQKHLDEMYTYIRNNKMMIPNYGEMRRYGEPVSTAF
VESTINEVIARRMAKKQQMQWSRKGAHYLLQTRTAVLNNELQDKFVCWYPGFQSDGKGPAMAA
CRCEESDTKTVSLLSDWAGDYSHPALKYIETRWASMISYEMTTRLLKDILPVGHSLNASTVRNHLCQVAQRLDAEAEAHSGFLSGCPRDWGNLPRPGKPL
VVGIDGGYVRDRDDKKRNFEIIAGKSFSVGAPADTRRFGFVQKDDCHPERRLMTHLSAQGMQANQQIFFLSDGADNLRDLQFGMYPESTHVLDWFHITMR
LKVLMQYARGLLVSDPEAGSKVLALLESIKRYLWHGNVVAALEHIDNCVMYCDDPELSYPSLKSLQKHLDEMYTYIRNNKMMIPNYGEMRRYGEPVSTAF
VESTINEVIARRMAKKQQMQWSRKGAHYLLQTRTAVLNNELQDKFVCWYPGFQSDGKGPAMAA
Blast result :
Comments
The two first ORFs are passenger genes annotated as hypothetical protein (ORFA) and putative serine recombinase protein (ORFB).
There are probably a recombination event between the two insertion sites of the two copies in Escherichia coli E66An plasmid pE66An (generating a DR of 9 bp : CATATAGGC).
There are probably a recombination event between the two insertion sites of the two copies in Escherichia coli E66An plasmid pE66An (generating a DR of 9 bp : CATATAGGC).
References
1] Guérillot R, Glaser P, (2013) Diversity of bacterial prokaryotic mutator like transposase, association with conjugation conjugative elements and insertion specificity (on preparation).
2] Le,T.M.V., Nguyen,T.K.N., Campbell,J.I., Schultsz,C., Cerdeno-Tarraga,A.M., Thomson,N.R., Farrar,J.J. and Baker,S. (2012) Direct submission GenBank.
2] Le,T.M.V., Nguyen,T.K.N., Campbell,J.I., Schultsz,C., Cerdeno-Tarraga,A.M., Thomson,N.R., Farrar,J.J. and Baker,S. (2012) Direct submission GenBank.