CRIT/workflow
From GersteinInfo
(→Output) |
|||
Line 63: | Line 63: | ||
===Output=== | ===Output=== | ||
- | Cross Patterns have a natural X relationship Y representation making a network representation | + | Cross Patterns have a natural X relationship Y representation making a network representation an ideal way to visualize results. |
Cross patterns can easily be formatted in .sif for loading into various network browsers including | Cross patterns can easily be formatted in .sif for loading into various network browsers including | ||
[http://tyna.gersteinlab.org/tyna/ tYNA] or [http://www.cytoscape.org/ Cytoscape]. | [http://tyna.gersteinlab.org/tyna/ tYNA] or [http://www.cytoscape.org/ Cytoscape]. |
Revision as of 21:17, 30 January 2011
Contents |
Transcription Factor Example
Motivation and Problem Set Up
Cis regulatory elements as a means of regulating gene expression have been extensively studied. However, beyond such motifs, are there inherent properties of the targets themselves that make them more or less likely to be regulated by a given class of transcription factors? As an example, do essential transcription factors preferentially regulate essential targets? Are there genome composition features such as GC or codon bias that influence which targets are regulated by which TFs?
Input Data
Here, we use three different datasets as shown.
These objects are named as follows in the R dataset:
(1) T: Transcription factors and their associated properties
(2) C: Connector Matrix matching transcription factors to their associated targets
(3) G: Gene targets and their associated properties
T and G are both post processed from:
Y. Xia, E. A. Franzosa, and M. B. Gerstein. Integrated assessment of genomic correlates of protein evolutionary rate. PLoS Comput Biol, 5(6):e1000413–e1000413, 2009.
C is post processed from:
C. T. Harbison, et al. Transcriptional regulatory code of a eukaryotic genome. Nature, 431(7004):99–104, 2004.
As in Harbison et al, p<.001 was used to indicate a TF-gene target. We binarized the matrix such that any TF-gene pair with a pval<.001 had a 1 and anything greater than this had a 0.
Example Code
#Load Data load(file="TFExample.RData") #Load CRIT functions source(file="CRIT.R") #Generate label for feature of interest - set x for column variable tLabel<-initializer(T[,x], type="median") #Determine set of targets sensitive to this feature DC<-discriminator(C, tLabel, multCorrect=TRUE) #Generate new label based on sensitivity identified in previous step gLabel<-labelSlicer(DC, .05) #Identify features that seem to discriminate between sens/insens targets DG<-discriminator(t(G), gLabel, multCorrect=TRUE)
Output
Cross Patterns have a natural X relationship Y representation making a network representation an ideal way to visualize results.
Cross patterns can easily be formatted in .sif for loading into various network browsers including