ISKpn33
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
CP002474 | ND | Klebsiella pneumoniae | Klebsiella pneumoniae U-0608239 plasmid pUUH239.2 |
DNA section
IS Length : 2526 bp
Ends
IR Length : 16/21
IRL : GTAAGCGTCAAGCCACTGCCGCCTGTGCTTCTGGTCCCGGGATCGCTAGC
IRR : GTAAGCGTTAAGTGAGCGCCGTATTGACGGTTATTTATCGGTAAGAACCA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GTTTCATGTGTA | TGGAAGTC | CGCTTTATCACA | 8 |
DNA sequence
GTAAGCGTCAAGCCACTGCCGCCTGTGCTTCTGGTCCCGGGATCGCTAGCTTAGAGCTCCGTCTAATTTAGAAGGAGCTCTGTTTATGGAAAATGCTGCC
AACTGGCGAACTGAATCGCGTACCGTCTATTCCAATGACTTCAAACTTCGGATGGTCGAACTGGCTTCACGACCAGATGCCAACGTCGCGCAACTGGCGC
GGGAACATGGCGTTGATAATAATCTCATTTTTAAGTGGCTACGCCTCTGGCAGAGAGAGGGGCGAATCTCTCGTCGAATGCCTGCAACTATCGTGGGGCC
GGTGGTACCACAACCTCTTCCGGTTTCCCCGACTCTGTTGTCCGTCGACGTAATCAACGACCCGCAGCCCATTGCAGAGAATGACACTCTGTGTACGTTC
TCCTCCACTCACTCCAGCGCGACTTCCTGTCATGTTGAGTTCCGCCACGGCAAAATGACGCTGGAAAACCCGTCGTCAGAGTTGCTGGCCGTGCTGATCC
GCGAACTGACCGGGAGAACACAATGATATCCCTCCCGTCAGGCACCCGCATCTGGCTGGTCGCTGGTGCCACGGATATGCGTAAATCCTTCAACGGGCTG
GGCGAACAGATACAACACGTTCTGGATGAGGCTCCCTTCTCCGGCCACCTGTTCATCTTCCGGGGACGCCGTGGCGATACCGTGAAAATACTCTGGGCTG
ATGCTGACGGTCTGTGCCTGTTTATCAAACGTCTGGAAGAAGGCCAGTTCGTCTGGCCAACAGTCCGTGATGGCAAAATCGCGATCACCCGCTCACAACT
TGCCATGCTTCTCGATAAGCTGGACTGGCGTCAGCCAAAAACAGCACGCCTTAACTCACTGACAATGTTGTAAAAAGAGCATGATCGCATTATAAATGGG
GTCATGAGTCAGGACTATCCCGCCCGTATCGCTGCGCTGGAAGACGCGCTTCGCCAGAAAGACAGTCAGCTCAGTCTCATTGCAGAGACTGAGTCGTTCC
TGCGTTCGGCGCTGTCCCGCGCTGAAGAGAAAATAGAGAACGAAGAGCGCGAAATAGAGCATCTGCGGGCATAGATAGAAAAACTGCGGCGGATGTTGTT
CGGTACCCGTTCTGAAAAGCTGCGCCGGCAGGTTGAAGAAGCAGAGGCCCTGCTGAAGTTGCAGGAGCAGCAAAGCGATCGTTACAACGGTCGGGATAAC
GATCAACAGGTACCCCGCCAGCTGCGCCAGTCCCGTCACCGTCGTCCTCTCCCGGAACATCTTCCCCGCGAGATCCATCGCCTGGAGCCTGTTGAACGCT
GCTGCCCGGACTGCGGCAGTGATATGAAATACCTCAGTGAAGTCAGCGCGGAACAGCTGGAACTGGTCTCCAGCGCCCTGAAAGTGATCCGCACGGTCAG
GGTGAAAAAGGCCTGTACCCGATGCGACTGCATCGTTGAAGCTCCTGCGCCATCACGCCCCATCGACCGGGGTATCGCCGGGCCGGGTCTGCTGGCCCGC
GTGTTAACGGCCAAATACTGTGAGCACACCCCGTTATACCGGCAGACAGAAATCCTTGCGCGTCAGGGCGTGGAGCTGAGCCGGGCACTGCTCTCTAACT
GGGTGGATGCCTGTTGCCGGTTAATGGCACCGCTGGATGATGCCCTTTATCACTACGTGATGGACTGCCGCAAACTGCATACGGATGATACCCCGGTGCC
AGTTCTGGCACCGGGCAGAAAGAAGACGAAAACCGGGCGCATCTGGACGTATGTTCGCGATGACCGGAATGTCGGCTCGTCAGACCCGCCCGCAGCATGG
TTCGCCTTCTCGCCGGACCGGCAGGGGAAGCATCCTCAGCAACATCTTCGCTATTATCATGGTGTGCTGCAGGCGGATGCCTTCGCAGGCTACGACCGGT
TGTTCAGCGCAGAACGTGACGGCGGCCCGTTAACCGAAGCGGCCTGCTGGGCCCACGCGCGCCGAAAAATCCACGACGTCTATATAAGCACTCATACAGC
AACAGCAGAGGAAGCCCTGAAACGTATCGGCGAGCTGTACGCAATAGAAGAAGCAATACGGGGCCTCCCTGCAACTGAGCGGCTGGCAGCCAGGCAGTCC
CGAAGTAAGCCGCTGCTGATATCCCTGCATGACTGGTTGGTGGAGAAAAGCGCCACTCTGTCGAAAAAATCCCGTCTGGGCGAAGCGTTCGCTTATGCCC
TGAATCAGTGGGATGCGCTGTGCTACTACTGTGATGATGGCCTGGCAGAGCCTGATAACAACGCAGCCGAACGAGCTCTTCGTGCCGTCTGTCTTGGGAA
AAAGAATTTTATCTTCTTCGGCAGCGACCACGGCGGTGAGCGCGGAGCCCTGCTGTATGGTCTTATCGGAACATGCAGGCTCAATGGTATCGATCCGGAA
GCCTACCTTTGCCATATCCTGAGCCTACTGCCGGAATGGCCCAGCAACAAAGTGGCGGAACTGCTGCCATGGAACGTGGTTCTTACCGATAAATAACCGT
CAATACGGCGCTCACTTAACGCTTAC
AACTGGCGAACTGAATCGCGTACCGTCTATTCCAATGACTTCAAACTTCGGATGGTCGAACTGGCTTCACGACCAGATGCCAACGTCGCGCAACTGGCGC
GGGAACATGGCGTTGATAATAATCTCATTTTTAAGTGGCTACGCCTCTGGCAGAGAGAGGGGCGAATCTCTCGTCGAATGCCTGCAACTATCGTGGGGCC
GGTGGTACCACAACCTCTTCCGGTTTCCCCGACTCTGTTGTCCGTCGACGTAATCAACGACCCGCAGCCCATTGCAGAGAATGACACTCTGTGTACGTTC
TCCTCCACTCACTCCAGCGCGACTTCCTGTCATGTTGAGTTCCGCCACGGCAAAATGACGCTGGAAAACCCGTCGTCAGAGTTGCTGGCCGTGCTGATCC
GCGAACTGACCGGGAGAACACAATGATATCCCTCCCGTCAGGCACCCGCATCTGGCTGGTCGCTGGTGCCACGGATATGCGTAAATCCTTCAACGGGCTG
GGCGAACAGATACAACACGTTCTGGATGAGGCTCCCTTCTCCGGCCACCTGTTCATCTTCCGGGGACGCCGTGGCGATACCGTGAAAATACTCTGGGCTG
ATGCTGACGGTCTGTGCCTGTTTATCAAACGTCTGGAAGAAGGCCAGTTCGTCTGGCCAACAGTCCGTGATGGCAAAATCGCGATCACCCGCTCACAACT
TGCCATGCTTCTCGATAAGCTGGACTGGCGTCAGCCAAAAACAGCACGCCTTAACTCACTGACAATGTTGTAAAAAGAGCATGATCGCATTATAAATGGG
GTCATGAGTCAGGACTATCCCGCCCGTATCGCTGCGCTGGAAGACGCGCTTCGCCAGAAAGACAGTCAGCTCAGTCTCATTGCAGAGACTGAGTCGTTCC
TGCGTTCGGCGCTGTCCCGCGCTGAAGAGAAAATAGAGAACGAAGAGCGCGAAATAGAGCATCTGCGGGCATAGATAGAAAAACTGCGGCGGATGTTGTT
CGGTACCCGTTCTGAAAAGCTGCGCCGGCAGGTTGAAGAAGCAGAGGCCCTGCTGAAGTTGCAGGAGCAGCAAAGCGATCGTTACAACGGTCGGGATAAC
GATCAACAGGTACCCCGCCAGCTGCGCCAGTCCCGTCACCGTCGTCCTCTCCCGGAACATCTTCCCCGCGAGATCCATCGCCTGGAGCCTGTTGAACGCT
GCTGCCCGGACTGCGGCAGTGATATGAAATACCTCAGTGAAGTCAGCGCGGAACAGCTGGAACTGGTCTCCAGCGCCCTGAAAGTGATCCGCACGGTCAG
GGTGAAAAAGGCCTGTACCCGATGCGACTGCATCGTTGAAGCTCCTGCGCCATCACGCCCCATCGACCGGGGTATCGCCGGGCCGGGTCTGCTGGCCCGC
GTGTTAACGGCCAAATACTGTGAGCACACCCCGTTATACCGGCAGACAGAAATCCTTGCGCGTCAGGGCGTGGAGCTGAGCCGGGCACTGCTCTCTAACT
GGGTGGATGCCTGTTGCCGGTTAATGGCACCGCTGGATGATGCCCTTTATCACTACGTGATGGACTGCCGCAAACTGCATACGGATGATACCCCGGTGCC
AGTTCTGGCACCGGGCAGAAAGAAGACGAAAACCGGGCGCATCTGGACGTATGTTCGCGATGACCGGAATGTCGGCTCGTCAGACCCGCCCGCAGCATGG
TTCGCCTTCTCGCCGGACCGGCAGGGGAAGCATCCTCAGCAACATCTTCGCTATTATCATGGTGTGCTGCAGGCGGATGCCTTCGCAGGCTACGACCGGT
TGTTCAGCGCAGAACGTGACGGCGGCCCGTTAACCGAAGCGGCCTGCTGGGCCCACGCGCGCCGAAAAATCCACGACGTCTATATAAGCACTCATACAGC
AACAGCAGAGGAAGCCCTGAAACGTATCGGCGAGCTGTACGCAATAGAAGAAGCAATACGGGGCCTCCCTGCAACTGAGCGGCTGGCAGCCAGGCAGTCC
CGAAGTAAGCCGCTGCTGATATCCCTGCATGACTGGTTGGTGGAGAAAAGCGCCACTCTGTCGAAAAAATCCCGTCTGGGCGAAGCGTTCGCTTATGCCC
TGAATCAGTGGGATGCGCTGTGCTACTACTGTGATGATGGCCTGGCAGAGCCTGATAACAACGCAGCCGAACGAGCTCTTCGTGCCGTCTGTCTTGGGAA
AAAGAATTTTATCTTCTTCGGCAGCGACCACGGCGGTGAGCGCGGAGCCCTGCTGTATGGTCTTATCGGAACATGCAGGCTCAATGGTATCGATCCGGAA
GCCTACCTTTGCCATATCCTGAGCCTACTGCCGGAATGGCCCAGCAACAAAGTGGCGGAACTGCTGCCATGGAACGTGGTTCTTACCGATAAATAACCGT
CAATACGGCGCTCACTTAACGCTTAC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
441 bp | 146 aa | 86 | 526 | + | No |
AG : IS66 TnpA
ORF sequence :
MENAANWRTESRTVYSNDFKLRMVELASRPDANVAQLAREHGVDNNLIFKWLRLWQREGRISRRMPATIVGPVVPQPLPVSPTLLSVDVINDPQPIAEND
TLCTFSSTHSSATSCHVEFRHGKMTLENPSSELLAVLIRELTGRTQ
TLCTFSSTHSSATSCHVEFRHGKMTLENPSSELLAVLIRELTGRTQ
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
351 bp | 116 aa | 523 | 873 | + | No |
AG : IS66 TnpB
ORF sequence :
MISLPSGTRIWLVAGATDMRKSFNGLGEQIQHVLDEAPFSGHLFIFRGRRGDTVKILWADADGLCLFIKRLEEGQFVWPTVRDGKIAITRSQLAMLLDKL
DWRQPKTARLNSLTML
DWRQPKTARLNSLTML
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1593 bp | 530 aa | 904 | 2496 | + | No |
Chemistry : DDE
ORF sequence :
MSQDYPARIAALEDALRQKDSQLSLIAETESFLRSALSRAEEKIENEEREIEHLRA*IEKLRRMLFGTRSEKLRRQVEEAEALLKLQEQQSDRYNGRDND
QQVPRQLRQSRHRRPLPEHLPREIHRLEPVERCCPDCGSDMKYLSEVSAEQLELVSSALKVIRTVRVKKACTRCDCIVEAPAPSRPIDRGIAGPGLLARV
LTAKYCEHTPLYRQTEILARQGVELSRALLSNWVDACCRLMAPLDDALYHYVMDCRKLHTDDTPVPVLAPGRKKTKTGRIWTYVRDDRNVGSSDPPAAWF
AFSPDRQGKHPQQHLRYYHGVLQADAFAGYDRLFSAERDGGPLTEAACWAHARRKIHDVYISTHTATAEEALKRIGELYAIEEAIRGLPATERLAARQSR
SKPLLISLHDWLVEKSATLSKKSRLGEAFAYALNQWDALCYYCDDGLAEPDNNAAERALRAVCLGKKNFIFFGSDHGGERGALLYGLIGTCRLNGIDPEA
YLCHILSLLPEWPSNKVAELLPWNVVLTDK
QQVPRQLRQSRHRRPLPEHLPREIHRLEPVERCCPDCGSDMKYLSEVSAEQLELVSSALKVIRTVRVKKACTRCDCIVEAPAPSRPIDRGIAGPGLLARV
LTAKYCEHTPLYRQTEILARQGVELSRALLSNWVDACCRLMAPLDDALYHYVMDCRKLHTDDTPVPVLAPGRKKTKTGRIWTYVRDDRNVGSSDPPAAWF
AFSPDRQGKHPQQHLRYYHGVLQADAFAGYDRLFSAERDGGPLTEAACWAHARRKIHDVYISTHTATAEEALKRIGELYAIEEAIRGLPATERLAARQSR
SKPLLISLHDWLVEKSATLSKKSRLGEAFAYALNQWDALCYYCDDGLAEPDNNAAERALRAVCLGKKNFIFFGSDHGGERGALLYGLIGTCRLNGIDPEA
YLCHILSLLPEWPSNKVAELLPWNVVLTDK
Blast result :Comments : Internal stop codon due to SNP compared with transposase in ISKox3. 100% identical identical sequence in 11 other sequences in GenBank, suggesting that it is real - possible fusion protein?
Comments
ISKpn33 is 95% aa similar to ISEcl6.
References
1] Sally Partridge (2018) Direct submission.
2] Sandegren,L., Linkevicius,M., Lytsy,B., Melhus,A. and Andersson,D.I. (2012) J. Antimicrob. Chemother. 67 (1), 74-83.
2] Sandegren,L., Linkevicius,M., Lytsy,B., Melhus,A. and Andersson,D.I. (2012) J. Antimicrob. Chemother. 67 (1), 74-83.