Skip to main content
Figure 1 | Plant Methods

Figure 1

From: Using an ensemble of statistical metrics to quantify large sets of plant transcription factor binding sites

Figure 1

Marina workflow. a) A group is an umbrella-term to represent a set of promoter sequences. In order to run Marina, at least two groups must be provided. In doing so, TFBSs within each group can be contrasted and statistically quantified using TFBSs modeled as either DNA motifs or PWMs. Marina can also run if both these data-structures are provided, hence the name combined. b) Each group is modeled as a uni-directional graph, providing a means of trimming low-abundant promoter-sequences and TFBSs. c) A diverse collection of statistical metrics are used to model and quantify TFBS abundance. Magnitude of TFBS abundance is ranked and the hypergeometric distribution p-value computes significance of TFBS over-representation.

Back to article page