A motif in a DNA sequence represents a short, recurring pattern that often has biological implications. They often play role as specific binding sites for transcription factors and other proteins.
While investigating motifs and their sites generated using MEME and FIMO, I came across a trivial issue. MEME is a tool for motif discovery while FIMO outputs the sites where a particular motif is found.
Examplei(this is a modified form of the original FIMO output):
1 2 3
Motifs are generally described using position weight matrix where each base A,T,G,C takes a value for 0 to 1, depending on how frequently that base occurs at that position.
If you ignore the strand information, the two sites are not in sync, which should be apparent from their start and stop coordinates.
the case of ‘+’ strand is pretty straightforward, but to decipher the start and stop cooridnates for ‘-‘ strand, required me to draw this diagram:
1 2 3 4 5 6 7
1 2 3
- [What are DNA sequence motifs?] (http://www.nature.com/nbt/journal/v24/n4/full/nbt0406-423.html)
- MEME Suite: Tools for motif discovery and searching