From GersteinInfo

Revision as of 17:50, 10 September 2010 by Justin.jee (Talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search


The standard output for the aggregation script is a three column file corresponding to nucleotide position, bin number, and average signal (or a two column file corresponding to bin number and average signal if bins are not uniform length). Options are available to add extra columns corresponding to standard deviation or quartiles in signal. Click here for a powerpoint introduction to the Aggregation tool. Several example plots made using the GSA (see below) can be found in the GSA source documentation bundle on the ACT website. We also have extensive examples of SNP density aggregation plots around various structural sites including TSS's, CNV's, and pseudogenes; and aggregation plots from modENCODE data showing histone binding around TSS's in various developmental stages of worm. These will be made public once the data is officially public.

Binding from two types of chip experiments (Baf155) Pol II binding to TSS Baf155, with STD dev error bars PolII, from whole genome ChIP-seq

  • Web ACT

For aggregation, Web ACT generates a page with figures like the one shown here: example aggregation output

There is also the the Genomic Signal Aggregator (GSA, Zlab) which produces higher-resolution plots and works on a wide range of file types


Example of correlation between a large number of transcription factors from ChIP-chip experiment

The main output of the correlation script is a text file containing the matrix of correlation coefficients between all signal track inputs. This can be viewed as a heatmap, or can be used as the basis of a phylogenetic tree based on a specified number of bootstraps.

  • Web ACT

For correlation, Web ACT generates a page with figures like the one shown here: example correlation output


Most of the examples for the saturation program have involved data from the modENCODE project which are not publicly available yet. The figures from these calculations will be available once the data is officially public. A powerpoint describing the saturation tool can be found here. The program creates pdf outputs which look like this:

Personal tools