ISRel24

  • Family IS66
  • Group
MGE type ISRelated element(s) :
Isoform Synonym(s)
Accession numberTranspositionOriginHost
NC_007762 ND Rhizobium etli
Rhizobium etli CFN 42 plasmid p42a
DNA section
IS Length : 2837 bp

Ends


IR Length : 19/21

IRL : GTATCCATCCGGCGAAGGCCGTCTGTCTTGATTTGCTGATTGGCTTTAAG
IRR : GTAACCATCCGGCCAAGGCCGCGGGGCTTATGATTACGGCTCGTCTGCCA

Insertion site


Left flankDirect repeatRight flankDR Length
AGGCCTCATCGAACTATCTC0

DNA sequence

GTATCCATCCGGCGAAGGCCGTCTGTCTTGATTTGCTGATTGGCTTTAAGTCCATACCTTATGTCATGCGCTACAATCATTTCCCCCATTGAGGTTCTGT
CCGTCGACGATCTTGGTCGCCGTCGAGATTGGTCGGATGAAGAGAAGGTGCGGATTGTCGAAGAAAGTCTGCACGGATACCGGCAGGGTTCGGCGACAGC
ACGGCGTTATGGATTGTCGCGGTCATTGTTGACGACCTGGCGGCGGGAGTGCCGAAGCGGGCTTCTGAGCGTTTCCGCATCAACGAGCTTCGTGCCGCTT
TCGATTTTGCCGCCGCCTGCAGCATCTTCCGAGATGATGGCTCCACTCCAGGCGGACGGCGATAAGTTGATCGAGATCGGCCTGCCGAACGGCCGACGAC
TGATGATCCCAGCCTCGCTTGATCCGACCATTCTTGCCCGCCTGTTGCCCGTCGTGGATGGGTCATGATCGCGTTTCCCGCTGGTGTGAAGGTCTGGATC
GCGGGTGGCGTGACGGACATGCGTTGCGGCATGAACAGCCTGGCGCTGAAGGTCCAGCAAGGCCTTGGCCGTGGGACCCTGATGGCGGTGAGGTCTTCTG
CTTCCGGGGTCGCAAGGGTGACCTGATCAAGGTCCTCTGGCATGACGGCGTCGGCATGTCGCTTTACCTGAAGCGGCTGGAAGCTGGAAAGTTCATCTGG
CCGGTCAGCCAGAATGGCTCCGCCGTGCCTGTATCGTCGGCGCAGCTCGGCTATCTCCTGGAAGGGATCGACTGGCGCAACCCGCGCTGGACGCAGCGGC
CTTCGAAGGCAGGCTGGCCGCCTGCATTCCTCTGTTTTTGTTGGGCTTTCGGCATGCGGCATGGTAGCTTTCGGCCATGGATGATGCTGCTTCGGAGATA
GCCAGACTGCGCGCCGCGCTTGCGGCATCGGAAGCGCGTGCCGCCTCTGCCGAGGCCGACCTCGCACAGGTGCGCGCGGTCGTGACGACGTCTGAGGCGA
TGATCCGGCATCTCAAGCTCGAGATCGCCAAGATACGTCGCGAGCAGTACGGCCAGAGCTCGGAGCGCCGCGCCCGGCTGATCGAGCAAATGAAATTGCA
GCTCCAAGAACTTGAAGCCGACGCCACCGAAGACGAGATCGCTGCGGAACGCGTGGCGACGAGAATCACAAATGTCTCCGCTTTCGAACGCCGCCGGCCG
GCCCGCAAGCCGTTTCCCGAGCACCTGCCGCGCGAGCGCCTGGTCATCGATGCCCCGTCGACCTGCACCTGCTGCGGCTCGCCCCGCATCGTGAAGATGG
GCGAAGACATCACCGAGACGCTGGAGATCATTCCGCGCCAGTGGAAGGTGATCCAGACGGTTCGCGAGAAGTTCACCTGTAGGGACTGCGAGAAGATCAG
CCAGCCACCGGCCCCTTTCCATGCGACACCGCGGGGATGGGCAGGACCGCACCTGCTGGCGACGATCCTGTTCGAGAAGTTCGGCCAACATCAGCCATTG
AACCGCCAGGCTGAGCGCTACGCCAGGGAAGGCGTCGCTCTCAGTCTCTCCACACTGGCCGATCAGGTCGGAGCCTGCACGACGGCCCTGCAGCCGATCC
ATGACCTGATCCGTGCCCATGTTCTGGCCGCCGAGCGGCTGCATGGTGACGACACCACCGTGCCGCTTCTGGCCAGGGGAGCAACAAAGCAGGCGAGGCT
CTGGACTTACGTCCGCGATGACCGCCCTTTCGCGGGCGGCGCGCCTCCCGCCGCACTCTTCCACTTCTCTCCCGATCGCGAGAAGACCCACCCCAACACG
CATCTCGCCGGATGGCACGGCACCCTGCAAGCCGATGCCTATGGCGGCTACAACGACCTCTATCGTGTCGACCGCCGCCCCGCGCTGGTGATCAGCGCAC
TTTGCTGGAGCCACGCGCGGCGCAAATTCTTCGAACTCGCTGACATCGCCGGCAACGTGCGCAAGGGCAAACCTGCCCACGAGATATCGCCCGTCGCGCT
TGAGGCCGTTGCCCGCATCGACGCGCTCTTCGACATCGAGCGCGGCATAAACGGAATGCCTGCCGAGGATAGGCTCGCAGCGAGGCTGCAACATGCTCGC
CCGCTCGTCGAAGAACTGCACGATTGGCTCATGGCCCAGCGCGGGCAAATGTCGAAGCACAACCCCGTCGCCAAGGCGATCAACTACATGTTCGAGAAGG
AGGGTCGCTGGGAAGCCTTCGCCCGGTTCCTCGACGACGGCAGACTGTGTCTGACGAACAATGCCGCCGAACGAGCCCTGCGCGGCGTTGCTCTCGGAAG
AACGGCATGGCTATTCGCCGGTTCCCAGCGCGGAGGCGAGCGTGCTGCCTTCATGTATTCCCTAATCGTCACGGCAAAGATGAACGATATCGATCCGCAG
GCCTGGTTGGCGGACGTGCTCGCCCGCATGCCTGGCATTCCCGTATCACGGCTGCCGGAACTGTTGCCGTGGAACTTGCCCGCCGGAAGCGCCCGGCAGG
TGGCGGCCTGATGGCGCGCCCGACGCATGTCTACACCATCGAATATGTCGCCACGCTGATCGGTGAGAACCTCGAGTTGCTTGAAGAAATCGCCAGCAAT
TCCGACAACATCGACTACGGCGAGATGATCCATGTCTACGACGGCACCGACGAGGGCATCACAACCTTTACCGACCGCGGCATCGAGAGCCTGCAGGAGT
TCCTTGCCGACGTGCGCTCCTGGGAAGGCGGCGTTCGCCAATTTCTTCTCGACGAGCAACGTGATCCGGAAAAGATCGAACGCATCATGGCAGACGAGCC
GTAATCATAAGCCCCGCGGCCTTGGCCGGATGGTTAC
Protein section
ORF number : 3

 

ORF 1
LengthBeginEndStrandFusion ORF
408 bp135 aa61468+No
ORF function : Accessory Gene
AG : IS66 TnpA

ORF sequence :

MSCATIISPIEVLSVDDLGRRRDWSDEEKVRIVEESLHGYRQGSATARRYGLSRSLLTTWRRECRSGLLSVSASTSFVPLSILPPPAASSEMMAPLQADG
DKLIEIGLPNGRRLMIPASLDPTILARLLPVVDGS

 

Blast result :
ORF 2
LengthBeginEndStrandFusion ORF
118 bp145 aa458575+No
ORF function : Accessory Gene
AG : IS66 TnpB

ORF sequence :

MIAFPAGVKVWIAGGVTDMRCGMNSLALKVQQGLGRGDPDGGEVFCFRGRKGDLIKVLWHDGVGMSLYLKRLEAGKFIWPVSQNGSAVPVSSAQLGYLLE
GIDWRNPRWTQRPSKAGWPPAFLCFCWAFGMRHGSFRPWMMLLRR

 

Blast result :
ORF 3
LengthBeginEndStrandFusion ORF
1635 bp544 aa8772511+No
ORF function : Transposase
Chemistry : DDE

ORF sequence :

MDDAASEIARLRAALAASEARAASAEADLAQVRAVVTTSEAMIRHLKLEIAKIRREQYGQSSERRARLIEQMKLQLQELEADATEDEIAAERVATRITNV
SAFERRRPARKPFPEHLPRERLVIDAPSTCTCCGSPRIVKMGEDITETLEIIPRQWKVIQTVREKFTCRDCEKISQPPAPFHATPRGWAGPHLLATILFE
KFGQHQPLNRQAERYAREGVALSLSTLADQVGACTTALQPIHDLIRAHVLAAERLHGDDTTVPLLARGATKQARLWTYVRDDRPFAGGAPPAALFHFSPD
REKTHPNTHLAGWHGTLQADAYGGYNDLYRVDRRPALVISALCWSHARRKFFELADIAGNVRKGKPAHEISPVALEAVARIDALFDIERGINGMPAEDRL
AARLQHARPLVEELHDWLMAQRGQMSKHNPVAKAINYMFEKEGRWEAFARFLDDGRLCLTNNAAERALRGVALGRTAWLFAGSQRGGERAAFMYSLIVTA
KMNDIDPQAWLADVLARMPGIPVSRLPELLPWNLPAGSARQVAA

 

Blast result :
Comments
ISRel24 is 57% (ORFA) aa similar to ISRsp1, 72% (ORFB) to IS66 and 86% (ORFC : the transposase) to IS66-1.
The second ORF was reconstructed in silico, there is a non programmed frameshift at position 575, may be it is a sequencing error.
The IS was reconstructed in silico by deletion of the ISRel19(IS66 family member) and of one the direct repeat generated by its insertion inside ISRel24.
References
1] González V, Santamaría RI, Bustos P, Hernández-González I, Medrano-Soto A, Moreno-Hagelsieb G, Janga SC, Ramírez MA, Jiménez-Jacinto V, Collado-Vides J, Dávila G. (2006) PNAS. 7;103(10):3834-9