ISNov4
- Family IS1202
- Group ISAba32
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
FWXL01000010 | ND | Novosphingobium sp. | Novosphingobium sp. B1 |
DNA section
IS Length : 1740 bp
Ends
IR Length : 31/43
IRL : TGTAGGCGCTAAGTAGAAATGACACTGCTCCGCTAGATAGAAAAGGCACA
IRR : TGTTGCTGCCACTTAGAGAAGACACATTCGTCGCCAGATAGAACTGACAC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
TTTCTTATCGATCTTG | CGCGTT | TGCGCTTGCAGATCATTTAC | 6 |
DNA sequence
TGTAGGCGCTAAGTAGAAATGACACTGCTCCGCTAGATAGAAAAGGCACACAGTGCCAGCGGGTCGCCGGTGTCATGGCCATCCTCGGCAGCGAGGATGT
GCGAACGATGACGGTGCTGACGATGAGTACTGCCGAGGTGAGCCGGTTCGACACCCTGATGCGGCTCGACCGCGGTGAGATCCGGGTTGCAGACGCGATG
GCGCTACTGAGCCTTGAGCGTCGACAGATCTACCGGTTGCTGGAGCGAGTTCGGCAGGACGGCGCAGCCGGGCTGATCTCGCGCAAGCGTGGCCGTCCGA
GCAACCGGCGTTACGGCGATGCCTTTCGTGACCAGGTCGTCAGCCTCGTGCGCGAGCAGTATAACGGGTTCGGGCCCACGCTGGCGCGCGAGTATCTGGC
CGAGCGGCACGGCATCCGGGTGTCGTGCGAGACGCTGCGGCAGATGATGATGGCAGCGGGGCTGTGGAAGGACCGCGCAGCGCGTCGCCCTCGCCCGCAC
CAGCCGCGTTACCGGCGGGACTGCCGCGGCGAGCTGATCCAGGTCGACGGGTCCAAACACTGGTGGTTCGAGGACCGGGGCCCGCAATGCACGCTGCTGG
TCTATATCGACGATGCGACCAGCGAATTGATGCACCTGGAGATGGTAGAGAGCGAGAGCACCTTCGCCTACATGCGGGCCACGCGGACGTATATCGAGCG
CCACGGAAAGCCGGTCGCCTTGTACTCCGACAAGCACAGCGCGTTTCGCAACAACACCGCTTCGGCGAACGGTGACGGCATGACCCATCTCGGGCGGGCG
CTGCATGCGCTCAACATCGAGATCATCTGCGCCAACTCGCCGCAGGCCAAGGGGCGTGTCGAGCGGGCGAACGGCACGTTGCAGGACCGGCTGGTCAAGG
CGATGCGACTGGAGGGCATCTCCACCATCGAGGAGGCCAATGCCTTTCTCCCCGCCTACATGGCCCGGCACAACCGGCAGTTCGCCAGGACGCCGTTCGA
CCCGCGCGATCTCCACCGCCCGCTGGCGCCGCACGAGAACATCGAGGCAGAGATGGTCTGGCGCGAGCAGCGCACGGTGACCGCGGCGCTGACGCTGCAC
TACAACAAGGCGCTGTTCATCCTGGAGCCGAACCGGGTCAGCCAGGCACTGGCGCGCAAGCGGGTAGATGTCTGCGAGTTCCCTGACGGGCGGATCGAGA
TCCGACACGACGGGCAGGCCTTGCCCTATCGCGTGTTCGACAAGATGCAGCGGGTGAACCAGGCAGCGGTGGTCGACAACAAGCATCTCGACGCGGCGCT
GGCGATGGCGCGGCTCCTGCAGGAAACGGCACCACACCACCGCAAGCGGAACAACAACGAGCCGGCGCGGACCGCCCAGACTGGCGGTATGTTCGCCGCC
TCGGCTGCTCCGACTGCATCGAAGGTGGATCGCCGCACGCTCTGCACGCCCAAGCTGAAGCGTGGACCGCGCCTTCCCAACACGGAACTCGTCGCGCGTG
GACTGGCCGAATACGCGGGCCGAGATCGGAACACGCCGCCTCAGGAACCTGCCTCACTGCCTGCTGTTCTAGCGTGACCGGGCCCGCTGCTGGCGCAGCG
GGTCCGTAGCCCTCTCAGAACTCACCGCATCAGCGGGGTCAGCGCTGCGCTTGTGGATGGCTCCACGCTGCCCCCGCTGCTACCAACGGTGTGTCAGTTC
TATCTGGCGACGAATGTGTCTTCTCTAAGTGGCAGCAACA
GCGAACGATGACGGTGCTGACGATGAGTACTGCCGAGGTGAGCCGGTTCGACACCCTGATGCGGCTCGACCGCGGTGAGATCCGGGTTGCAGACGCGATG
GCGCTACTGAGCCTTGAGCGTCGACAGATCTACCGGTTGCTGGAGCGAGTTCGGCAGGACGGCGCAGCCGGGCTGATCTCGCGCAAGCGTGGCCGTCCGA
GCAACCGGCGTTACGGCGATGCCTTTCGTGACCAGGTCGTCAGCCTCGTGCGCGAGCAGTATAACGGGTTCGGGCCCACGCTGGCGCGCGAGTATCTGGC
CGAGCGGCACGGCATCCGGGTGTCGTGCGAGACGCTGCGGCAGATGATGATGGCAGCGGGGCTGTGGAAGGACCGCGCAGCGCGTCGCCCTCGCCCGCAC
CAGCCGCGTTACCGGCGGGACTGCCGCGGCGAGCTGATCCAGGTCGACGGGTCCAAACACTGGTGGTTCGAGGACCGGGGCCCGCAATGCACGCTGCTGG
TCTATATCGACGATGCGACCAGCGAATTGATGCACCTGGAGATGGTAGAGAGCGAGAGCACCTTCGCCTACATGCGGGCCACGCGGACGTATATCGAGCG
CCACGGAAAGCCGGTCGCCTTGTACTCCGACAAGCACAGCGCGTTTCGCAACAACACCGCTTCGGCGAACGGTGACGGCATGACCCATCTCGGGCGGGCG
CTGCATGCGCTCAACATCGAGATCATCTGCGCCAACTCGCCGCAGGCCAAGGGGCGTGTCGAGCGGGCGAACGGCACGTTGCAGGACCGGCTGGTCAAGG
CGATGCGACTGGAGGGCATCTCCACCATCGAGGAGGCCAATGCCTTTCTCCCCGCCTACATGGCCCGGCACAACCGGCAGTTCGCCAGGACGCCGTTCGA
CCCGCGCGATCTCCACCGCCCGCTGGCGCCGCACGAGAACATCGAGGCAGAGATGGTCTGGCGCGAGCAGCGCACGGTGACCGCGGCGCTGACGCTGCAC
TACAACAAGGCGCTGTTCATCCTGGAGCCGAACCGGGTCAGCCAGGCACTGGCGCGCAAGCGGGTAGATGTCTGCGAGTTCCCTGACGGGCGGATCGAGA
TCCGACACGACGGGCAGGCCTTGCCCTATCGCGTGTTCGACAAGATGCAGCGGGTGAACCAGGCAGCGGTGGTCGACAACAAGCATCTCGACGCGGCGCT
GGCGATGGCGCGGCTCCTGCAGGAAACGGCACCACACCACCGCAAGCGGAACAACAACGAGCCGGCGCGGACCGCCCAGACTGGCGGTATGTTCGCCGCC
TCGGCTGCTCCGACTGCATCGAAGGTGGATCGCCGCACGCTCTGCACGCCCAAGCTGAAGCGTGGACCGCGCCTTCCCAACACGGAACTCGTCGCGCGTG
GACTGGCCGAATACGCGGGCCGAGATCGGAACACGCCGCCTCAGGAACCTGCCTCACTGCCTGCTGTTCTAGCGTGACCGGGCCCGCTGCTGGCGCAGCG
GGTCCGTAGCCCTCTCAGAACTCACCGCATCAGCGGGGTCAGCGCTGCGCTTGTGGATGGCTCCACGCTGCCCCCGCTGCTACCAACGGTGTGTCAGTTC
TATCTGGCGACGAATGTGTCTTCTCTAAGTGGCAGCAACA
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1470 bp | 489 aa | 108 | 1577 | + | No |
Chemistry : DDE
ORF sequence :
MTVLTMSTAEVSRFDTLMRLDRGEIRVADAMALLSLERRQIYRLLERVRQDGAAGLISRKRGRPSNRRYGDAFRDQVVSLVREQYNGFGPTLAREYLAER
HGIRVSCETLRQMMMAAGLWKDRAARRPRPHQPRYRRDCRGELIQVDGSKHWWFEDRGPQCTLLVYIDDATSELMHLEMVESESTFAYMRATRTYIERHG
KPVALYSDKHSAFRNNTASANGDGMTHLGRALHALNIEIICANSPQAKGRVERANGTLQDRLVKAMRLEGISTIEEANAFLPAYMARHNRQFARTPFDPR
DLHRPLAPHENIEAEMVWREQRTVTAALTLHYNKALFILEPNRVSQALARKRVDVCEFPDGRIEIRHDGQALPYRVFDKMQRVNQAAVVDNKHLDAALAM
ARLLQETAPHHRKRNNNEPARTAQTGGMFAASAAPTASKVDRRTLCTPKLKRGPRLPNTELVARGLAEYAGRDRNTPPQEPASLPAVLA
HGIRVSCETLRQMMMAAGLWKDRAARRPRPHQPRYRRDCRGELIQVDGSKHWWFEDRGPQCTLLVYIDDATSELMHLEMVESESTFAYMRATRTYIERHG
KPVALYSDKHSAFRNNTASANGDGMTHLGRALHALNIEIICANSPQAKGRVERANGTLQDRLVKAMRLEGISTIEEANAFLPAYMARHNRQFARTPFDPR
DLHRPLAPHENIEAEMVWREQRTVTAALTLHYNKALFILEPNRVSQALARKRVDVCEFPDGRIEIRHDGQALPYRVFDKMQRVNQAAVVDNKHLDAALAM
ARLLQETAPHHRKRNNNEPARTAQTGGMFAASAAPTASKVDRRTLCTPKLKRGPRLPNTELVARGLAEYAGRDRNTPPQEPASLPAVLA
Blast result :
Comments
ISNov4 is 82% aa similar to ISShsp5.
References
1] Patricia Siguier (2020) Direct submission.
2] DOE-JOINT GENOME INSTITUTE (2017) Direct GenBank submission.
2] DOE-JOINT GENOME INSTITUTE (2017) Direct GenBank submission.