|
|
Line 1: |
Line 1: |
| | | |
| + | = Additional Resources Related to PeakSeq = |
| | | |
- | '''PeakSeq: Systematic Scoring of ChIP-Seq Experiments Relative to Controls'''
| + | Yale Chip-seq Data Pipeline |
- | | + | Software download |
- | Rozowsky J, Euskirchen G, Auerbach R, Zhang Z, Gibson T, Bjornson R, Carriero N, Snyder M, Gerstein M
| + | Illumina pipeline (wrapper for Illumina pipeline) |
- | | + | Chip-seq scoring pipeline (runs PeakSeq) |
- | Chromatin Immunoprecipitation followed by tag sequencing (ChIP-Seq) using high-throughput next-generation instrumentation is replacing ChIP-chip for genome-wide mapping of sites of transcription factor binding and chromatin modification, especially for mammalian genomes. Here we develop a methodology for identifying punctate binding sites in ChIP-Seq experiments based on their characteristics. In particular, we produce two deeply sequenced datasets for human RNA polymerase II and STAT1 with matching input DNA controls. In these sets, we observe that signal peaks, corresponding to sites of potential binding, are strongly correlated with peaks in input DNA that likely reveal features of open chromatin. Based on these observations we develop a two-pass approach for scoring ChIP-Seq data relative to controls. The first pass identifies putative binding sites and compensates for variation in the mappability of sequences across the genome. The second pass filters out sites that are not significantly enriched compared to the normalized input DNA and computes a precise enrichment and significance. Using our scoring approach we investigate the design of an optimal ChIP-Seq experiment. We examine the number of identified binding sites as a function of sequencing depth (i.e. saturation) and the value of multiple replicas. In particular, we find that little additional biological information is gained from more than two replicas.
| + | |
- | | + | |
- | | + | |
- | Mappability Map
| + | |
- | | + | |
- | [http://archive.gersteinlab.org/proj/PeakSeq/Mappability_Map/Code/ Mappability Map Code]
| + | |
- | | + | |
- | Maps:
| + | |
- | | + | |
- | [http://archive.gersteinlab.org/proj/PeakSeq/Mappability_Map/C.elegans/ Map for C. elegans]
| + | |
- | | + | |
- | [http://archive.gersteinlab.org/proj/PeakSeq/Mappability_Map/D.melanogaster/ Map for D. melanogaster]
| + | |
- | | + | |
- | [http://archive.gersteinlab.org/proj/PeakSeq/Mappability_Map/H.sapiens/ Map for H. sapiens"]
| + | |
- | | + | |
- | [http://archive.gersteinlab.org/proj/PeakSeq/Mappability_Map/M.musculus/ Map for M. musculus]
| + | |
- | | + | |
- | | + | |
- | ChIP-Seq Scoring
| + | |
- | | + | |
- | [http://archive.gersteinlab.org/proj/PeakSeq/Scoring_ChIPSeq/Code/Preprocessing/ Preprocessing Code]
| + | |
- | | + | |
- | [http://archive.gersteinlab.org/proj/PeakSeq/Scoring_ChIPSeq/Code/C/ C Code for ChIP-Seq Scoring]
| + | |
- | | + | |
- | [http://archive.gersteinlab.org/proj/PeakSeq/Scoring_ChIPSeq/Code/Perl/ Perl Code for ChIP-Seq Scoring]
| + | |
- | | + | |
- | | + | |
- | Scored results:
| + | |
- | | + | |
- | [http://archive.gersteinlab.org/proj/PeakSeq/Scoring_ChIPSeq/Results/PolII Pol II Results]
| + | |
- | | + | |
- | [http://archive.gersteinlab.org/proj/PeakSeq/Scoring_ChIPSeq/Results/STAT1 STAT1 Results]
| + | |
- | | + | |
- | | + | |
- | Raw ChIP-Seq Sequence Data:
| + | |
- | | + | |
- | [http://archive.gersteinlab.org/proj/PeakSeq/Sequence_Data/HeLa-S3 Pol II Sequence Data]
| + | |
- | | + | |
- | [http://archive.gersteinlab.org/proj/PeakSeq/Sequence_Data/Stimulated_HeLa-S3 STAT1 Sequence Data]
| + | |
| | | |
| + | Also, one might want to consider ChipSeqSim : http://papers.gersteinlab.org/papers/chip-seq-simu/ |
| | | |
| Contact Joel Rozowsky at joel DOT rozowsky AT yale DOT edu. | | Contact Joel Rozowsky at joel DOT rozowsky AT yale DOT edu. |
Yale Chip-seq Data Pipeline
Software download
Illumina pipeline (wrapper for Illumina pipeline)
Chip-seq scoring pipeline (runs PeakSeq)
Contact Joel Rozowsky at joel DOT rozowsky AT yale DOT edu.