VAT/dataSets

From GersteinInfo

Revision as of 18:17, 9 March 2011 by Lukas.habegger (Talk | contribs)
Jump to: navigation, search
VAT Main Page

Contents


Data sets

1000 Genomes Project

Top

1000 Genomes Pilot Project: Low coverage samples

- Data files
    - Source: pilot_data, release: 2010_07, FTP:  ftp://ftp-trace.ncbi.nih.gov/1000genomes/ftp/pilot_data/release/2010_07/low_coverage/
    - Indels
        - CEU.low_coverage.2010_07.indel.genotypes.vcf.gz
        - JPTCHB.low_coverage.2010_07.indel.genotypes.vcf.gz
        - YRI.low_coverage.2010_07.indel.genotypes.vcf.gz
    - SNPs
        - CEU.low_coverage.2010_07.genotypes.vcf.gz
        - CHBJPT.low_coverage.2010_07.genotypes.vcf.gz
        - YRI.low_coverage.2010_07.genotypes.vcf.gz
- Annotation file: GENCODE (version 3b, hg18) using CDS elements where gene_type = protein_coding and transcript_type = protein_coding
- Results: VAT


Top

1000 Genomes Project, Phase I, chr22, SNP calls

- Data files
    - Source: release: 20100804, FTP: ftp://ftp-trace.ncbi.nih.gov/1000genomes/ftp/release/20100804/
    - SNPs: ALL.2of4intersection.20100804.genotypes.vcf.gz
- Annotation file: GENCODE (version 3c, hg19) using CDS elements where gene_type = protein_coding and transcript_type = protein_coding
- Results: VAT
- Detailed workflow
Personal tools