ISKrh1
- Family IS481
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_010617 | ND | Kocuria rhizophila | Kocuria rhizophila DC2201 |
DNA section
IS Length : 2854 bp
Ends
IR Length : 25/28
IRL : TGTACTAACCGGGGACGTTGATCGAAGACTTACGCGAAGAGTTGCCGAGC
IRR : TGTACTGACCGGAGACGTTGATCGAGGATCGGCCGGTGACGCGAGATACT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GCAGGCGGCGTCCC | CCTAGA | CGTTCCCGTCGGCA | 6 |
TTGCTGGGGGTTCG | GCTAGC | TGTAGGAGGGCGT | 6 |
DNA sequence
TGTACTAACCGGGGACGTTGATCGAAGACTTACGCGAAGAGTTGCCGAGCGGCGCGGCGGGCGAGATCGTCTGTTTGAAGATCGGGGGTAGCGCCTTCGG
CTGCAAGGGATGCAAGCCCGTGAACCAGGGACCAGGCCACGTGCGCTGCTTCCGAGGTTGTCAGGTGTGCCCTGCGTTCTGCGGGTATTGATTCGACGCC
TGCCATAAGCGCCTCCATCGAGGATGCCCGGGCTGCACTCAACTCACCGTCATCGGCGTGGAGCAGCTCCGGCCTGTGCATCACGGCGTAGTGGCCAGGG
TGCTCACGTGCGAACCTGACGTACGCGACTGCTGCCTCGTCGAAGCCACCAGCCGTGGCTGCGGCCACATCGGCGGCAAGCAGGCGCAGACCTTGCGCGG
CTAGCGCCGTGAGCAGGCCCCGTCGATCGCCGAAATGATGCGCGGGTGCGCCATGCGAGACCCCGGCCCGACGGGCAAGCTCGCGCAGCGACAGTGAACT
GGGTCCAACCTCGTCGATCATCTGCGCCGACTCGTCGAGGAGTGCTCGTCTTAGGTCGCCGTGGTGATATCCGTCGCTCATGACCACACCCTATGAGTTC
GATCTAGGCATCGACCAGATCAAAGGCTAGCCTGATCTAGTCATTGTCTAGATAGGGAGTTCCCATGGCACCTTTGCTCGTCCTACTCGTCTTCACAACT
CTTGCCCGAGCGGTGGGGGCGCTGGGCGTCGGCTACGTCGCCTCCTGGCCCGCCGCCACGGCTGTGGGGCTGGCTGCCATGTTCATCGTGACCGGCGCCA
GCCACTTCTTCCCTGCACGGCGGGCAGGGCTCATCGCCATCGTCCCACCGGCTCTCAAGCATCCCGCCGCGCTGGTCGCACTCACCGGCGTGCTCGAGCT
GCTGGGTGCCGTCGCCCTTCTCGTTCCTCCCGAGGCGGGACAGCTGCGCGTGGCAGCAGCGCTCAGTCTGGCCGTGCTGCTGTTCGTGATGTTCCCCGCG
AATGTCTACGCGTCACAAGCCCGTAGGTCTGAGCACTCACCCAACACGCCCCTGCCGCAGCGAACGGCGATGCAGTGCGTCTTCATCGCGGCGACACTCT
TCGTGGCTCTGGCTTCATGACACGAGGACATCACGCCAACGCCGCTCTCACGCCCCGACACCGACTCAAGGTCGCCCGTCTCGTCGTCGACGACGGCTGG
CCGATCAGTGAAGTCGCCGCGCGGTTCCAAGTGTCATGGCCGACCGTGAAGAGATGGGTCGACCGCTACCTGGTCGGCGAGTCCATGCAGGACCGGTCGT
CGCGCCCGAGGACTTCGCCGAACAAGACCCCGAAGTCGGTGACGAAGCGCTGCGTGAGCCTTCGAATGCGACTGCGGGAAGGGCCCGTTCAGCTCGCTTC
CCGACTCGACATCGCACCGTCCACCGTGCATCGCATCCTCACCACGGCGCGCCTGAACCGGTTGTCCTACGTGGACCGCGCCACCGGTGAGCCCGTCCGC
CGGTACGAGCACCCTCACCCCGGGTCGCTGGTTCACGTGGACGTGAAGAAGGTCGGGAACATCCCTGACGGTGGCGGTTGGCGGTACGTCGGCCGCCGCC
AAGGCGAGAAGAACCGCGCCGCCACGCCAGGCAAGCCGCGCAACCAGTACGGCGGCCCGAAACTCGGGTACGCGTTCGTCCACACCGTCATCGACGACCA
CTCCCGCGTCGCATACACCGAGGTCCACGACGACGAGACCGCCGTCACTGCCGTCGCCGTCCTTCGCCGGGCGGTGCAGTGGTTCGCCGGCCATGGCGTC
ACCATCGAGCGGGTGCTGTCGGACAACGGTGGGGCGTACCGCTCGCACCTGTGGCGCGACACCTGCCATGGCCTGTCGATCACGCCGAAACGGACCTGCC
CGTACCGCCCGCAGACCAATGGGAAGGTGGAGCGCTTCCACCGGACGATGGCCGACGGGTGGGCCTACGCCCGCTGCTACACCAGCGAGCAAGAACGCCG
CGACGCGCTCGAGCCGTGGCTGGGGCACTACAACGAAGTGCGACCGCATACCGCGTGCGGTAACCAGCCGCCACTCACCAAACTCAGAGCGGCTGGCTAC
TCTTCGGTGGCTTCGACGCCCACGCCGGGCGGCTCCTCGGTGAAGAACTCGGGGTCCGAGTAGGGAAGGATCGTCCAGTGGAGATCCTCCGCCGAGTCGA
TCCGGCTAGGAACGGAGGCGTCTTGTTGGCGGGTGTGGTCCTGGATCAGATGCATCTGAACTACCTTGGCCGCTCGACGAATCTGCTCCACGATCCCCCC
GGCGGGTATGTCCAGCGGCGAGGAGATCCGGTCGGCTGCACGGTCGACATAGAAGCCAGCCTGCTTGTCGAGGTTCCGCTGCCGCGCCAACCCATGGAAC
GTCTCCAAGTCCGGCAACTGGTAGTACTCGATCCGACGGTCTGGGTCCCAGAAGCCACCCAGACCTGACGCAAACTGCTCCGCCACCTGCAGCTTTTCTG
CATGTGGTGCGCGGGTGGTGCGTAGGCCGTCCGGAACCCGAACCAGGCCAGGTTCGTGAGGCTGCACCCCGTAGAGGCTCAAAGGTGCACTCCACTCCCA
CTCGGCGGCCTCGTAAAGCCAGCGTGCCTTCGCCACTTCCTCCATCGCGAGCACCAGGAGCGCCTGAGCCCGGCCGGCACTGCCCCGGTCCGCCAGGGCG
TGGGCATCCTCCACAAGCGCCACCGCGTTCGCCATCAGGGACTTCCACCATCGGCGCGCGAAGTCTGCGCTGACCTCTTCAACCTGCGGCTCGCCCATGC
AGCGAGTATCTCGCGTCACCGGCCGATCCTCGATCAACGTCTCCGGTCAGTACA
CTGCAAGGGATGCAAGCCCGTGAACCAGGGACCAGGCCACGTGCGCTGCTTCCGAGGTTGTCAGGTGTGCCCTGCGTTCTGCGGGTATTGATTCGACGCC
TGCCATAAGCGCCTCCATCGAGGATGCCCGGGCTGCACTCAACTCACCGTCATCGGCGTGGAGCAGCTCCGGCCTGTGCATCACGGCGTAGTGGCCAGGG
TGCTCACGTGCGAACCTGACGTACGCGACTGCTGCCTCGTCGAAGCCACCAGCCGTGGCTGCGGCCACATCGGCGGCAAGCAGGCGCAGACCTTGCGCGG
CTAGCGCCGTGAGCAGGCCCCGTCGATCGCCGAAATGATGCGCGGGTGCGCCATGCGAGACCCCGGCCCGACGGGCAAGCTCGCGCAGCGACAGTGAACT
GGGTCCAACCTCGTCGATCATCTGCGCCGACTCGTCGAGGAGTGCTCGTCTTAGGTCGCCGTGGTGATATCCGTCGCTCATGACCACACCCTATGAGTTC
GATCTAGGCATCGACCAGATCAAAGGCTAGCCTGATCTAGTCATTGTCTAGATAGGGAGTTCCCATGGCACCTTTGCTCGTCCTACTCGTCTTCACAACT
CTTGCCCGAGCGGTGGGGGCGCTGGGCGTCGGCTACGTCGCCTCCTGGCCCGCCGCCACGGCTGTGGGGCTGGCTGCCATGTTCATCGTGACCGGCGCCA
GCCACTTCTTCCCTGCACGGCGGGCAGGGCTCATCGCCATCGTCCCACCGGCTCTCAAGCATCCCGCCGCGCTGGTCGCACTCACCGGCGTGCTCGAGCT
GCTGGGTGCCGTCGCCCTTCTCGTTCCTCCCGAGGCGGGACAGCTGCGCGTGGCAGCAGCGCTCAGTCTGGCCGTGCTGCTGTTCGTGATGTTCCCCGCG
AATGTCTACGCGTCACAAGCCCGTAGGTCTGAGCACTCACCCAACACGCCCCTGCCGCAGCGAACGGCGATGCAGTGCGTCTTCATCGCGGCGACACTCT
TCGTGGCTCTGGCTTCATGACACGAGGACATCACGCCAACGCCGCTCTCACGCCCCGACACCGACTCAAGGTCGCCCGTCTCGTCGTCGACGACGGCTGG
CCGATCAGTGAAGTCGCCGCGCGGTTCCAAGTGTCATGGCCGACCGTGAAGAGATGGGTCGACCGCTACCTGGTCGGCGAGTCCATGCAGGACCGGTCGT
CGCGCCCGAGGACTTCGCCGAACAAGACCCCGAAGTCGGTGACGAAGCGCTGCGTGAGCCTTCGAATGCGACTGCGGGAAGGGCCCGTTCAGCTCGCTTC
CCGACTCGACATCGCACCGTCCACCGTGCATCGCATCCTCACCACGGCGCGCCTGAACCGGTTGTCCTACGTGGACCGCGCCACCGGTGAGCCCGTCCGC
CGGTACGAGCACCCTCACCCCGGGTCGCTGGTTCACGTGGACGTGAAGAAGGTCGGGAACATCCCTGACGGTGGCGGTTGGCGGTACGTCGGCCGCCGCC
AAGGCGAGAAGAACCGCGCCGCCACGCCAGGCAAGCCGCGCAACCAGTACGGCGGCCCGAAACTCGGGTACGCGTTCGTCCACACCGTCATCGACGACCA
CTCCCGCGTCGCATACACCGAGGTCCACGACGACGAGACCGCCGTCACTGCCGTCGCCGTCCTTCGCCGGGCGGTGCAGTGGTTCGCCGGCCATGGCGTC
ACCATCGAGCGGGTGCTGTCGGACAACGGTGGGGCGTACCGCTCGCACCTGTGGCGCGACACCTGCCATGGCCTGTCGATCACGCCGAAACGGACCTGCC
CGTACCGCCCGCAGACCAATGGGAAGGTGGAGCGCTTCCACCGGACGATGGCCGACGGGTGGGCCTACGCCCGCTGCTACACCAGCGAGCAAGAACGCCG
CGACGCGCTCGAGCCGTGGCTGGGGCACTACAACGAAGTGCGACCGCATACCGCGTGCGGTAACCAGCCGCCACTCACCAAACTCAGAGCGGCTGGCTAC
TCTTCGGTGGCTTCGACGCCCACGCCGGGCGGCTCCTCGGTGAAGAACTCGGGGTCCGAGTAGGGAAGGATCGTCCAGTGGAGATCCTCCGCCGAGTCGA
TCCGGCTAGGAACGGAGGCGTCTTGTTGGCGGGTGTGGTCCTGGATCAGATGCATCTGAACTACCTTGGCCGCTCGACGAATCTGCTCCACGATCCCCCC
GGCGGGTATGTCCAGCGGCGAGGAGATCCGGTCGGCTGCACGGTCGACATAGAAGCCAGCCTGCTTGTCGAGGTTCCGCTGCCGCGCCAACCCATGGAAC
GTCTCCAAGTCCGGCAACTGGTAGTACTCGATCCGACGGTCTGGGTCCCAGAAGCCACCCAGACCTGACGCAAACTGCTCCGCCACCTGCAGCTTTTCTG
CATGTGGTGCGCGGGTGGTGCGTAGGCCGTCCGGAACCCGAACCAGGCCAGGTTCGTGAGGCTGCACCCCGTAGAGGCTCAAAGGTGCACTCCACTCCCA
CTCGGCGGCCTCGTAAAGCCAGCGTGCCTTCGCCACTTCCTCCATCGCGAGCACCAGGAGCGCCTGAGCCCGGCCGGCACTGCCCCGGTCCGCCAGGGCG
TGGGCATCCTCCACAAGCGCCACCGCGTTCGCCATCAGGGACTTCCACCATCGGCGCGCGAAGTCTGCGCTGACCTCTTCAACCTGCGGCTCGCCCATGC
AGCGAGTATCTCGCGTCACCGGCCGATCCTCGATCAACGTCTCCGGTCAGTACA
Protein section
ORF number : 4
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
552 bp | 183 aa | 581 | 30 | - | No |
Annotation : Transcriptional regulator, TetR familyDescription : Transcriptional Regulator factor
ORF sequence :
MSDGYHHGDLRRALLDESAQMIDEVGPSSLSLRELARRAGVSHGAPAHHFGDRRGLLTALAAQGLRLLAADVAAATAGGFDEAAVAYVRFAREHPGHYAV
MHRPELLHADDGELSAARASSMEALMAGVESIPAERRAHLTTSEAAHVAWSLVHGLASLAAEGATPDLQTDDLARRAARQLFA
MHRPELLHADDGELSAARASSMEALMAGVESIPAERRAHLTTSEAAHVAWSLVHGLASLAAEGATPDLQTDDLARRAARQLFA
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
456 bp | 151 aa | 665 | 1120 | + | No |
Annotation : Hypothetical proteinDescription :
ORF sequence :
MAPLLVLLVFTTLARAVGALGVGYVASWPAATAVGLAAMFIVTGASHFFPARRAGLIAIVPPALKHPAALVALTGVLELLGAVALLVPPEAGQLRVAAAL
SLAVLLFVMFPANVYASQARRSEHSPNTPLPQRTAMQCVFIAATLFVALAS
SLAVLLFVMFPANVYASQARRSEHSPNTPLPQRTAMQCVFIAATLFVALAS
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1047 bp | 348 aa | 1117 | 2163 | + | No |
Chemistry : DDE
ORF sequence :
MTRGHHANAALTPRHRLKVARLVVDDGWPISEVAARFQVSWPTVKRWVDRYLVGESMQDRSSRPRTSPNKTPKSVTKRCVSLRMRLREGPVQLASRLDIA
PSTVHRILTTARLNRLSYVDRATGEPVRRYEHPHPGSLVHVDVKKVGNIPDGGGWRYVGRRQGEKNRAATPGKPRNQYGGPKLGYAFVHTVIDDHSRVAY
TEVHDDETAVTAVAVLRRAVQWFAGHGVTIERVLSDNGGAYRSHLWRDTCHGLSITPKRTCPYRPQTNGKVERFHRTMADGWAYARCYTSEQERRDALEP
WLGHYNEVRPHTACGNQPPLTKLRAAGYSSVASTPTPGGSSVKNSGSE
PSTVHRILTTARLNRLSYVDRATGEPVRRYEHPHPGSLVHVDVKKVGNIPDGGGWRYVGRRQGEKNRAATPGKPRNQYGGPKLGYAFVHTVIDDHSRVAY
TEVHDDETAVTAVAVLRRAVQWFAGHGVTIERVLSDNGGAYRSHLWRDTCHGLSITPKRTCPYRPQTNGKVERFHRTMADGWAYARCYTSEQERRDALEP
WLGHYNEVRPHTACGNQPPLTKLRAAGYSSVASTPTPGGSSVKNSGSE
Blast result :ORF 4
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
741 bp | 246 aa | 2837 | 2097 | - | No |
Annotation : Hypothetical proteinDescription :
ORF sequence :
MIEDRPVTRDTRCMGEPQVEEVSADFARRWWKSLMANAVALVEDAHALADRGSAGRAQALLVLAMEEVAKARWLYEAAEWEWSAPLSLYGVQPHEPGLVR
VPDGLRTTRAPHAEKLQVAEQFASGLGGFWDPDRRIEYYQLPDLETFHGLARQRNLDKQAGFYVDRAADRISSPLDIPAGGIVEQIRRAAKVVQMHLIQD
HTRQQDASVPSRIDSAEDLHWTILPYSDPEFFTEEPPGVGVEATEE
VPDGLRTTRAPHAEKLQVAEQFASGLGGFWDPDRRIEYYQLPDLETFHGLARQRNLDKQAGFYVDRAADRISSPLDIPAGGIVEQIRRAAKVVQMHLIQD
HTRQQDASVPSRIDSAEDLHWTILPYSDPEFFTEEPPGVGVEATEE
Blast result :
Comments
The third ORF is the transposase, it is 73% aa similar to IS5564.The first ORF is annotated as TetR family transcriptional regulator, the second and the fourth as hypothetical protein.
References
1] Takarada,H., Sekine,M., Kosugi,H., Matsuo,Y., Fujisawa,T., Omata,S., Kishi,E., Shimizu,A., Tsukatani,N., Tanikawa,S., Fujita,N. and Harayama,S.(2008) J. Bacteriol. 190 (12), 4139-4146
2] ISfinder annotation (2008)
2] ISfinder annotation (2008)