<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://info.gersteinlab.org/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Lucasl</id>
	<title>GersteinInfo - User contributions [en]</title>
	<link rel="self" type="application/atom+xml" href="https://info.gersteinlab.org/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Lucasl"/>
	<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/Special:Contributions/Lucasl"/>
	<updated>2026-05-07T14:41:18Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.42.6</generator>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=Resources&amp;diff=4243</id>
		<title>Resources</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=Resources&amp;diff=4243"/>
		<updated>2017-12-04T21:35:58Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: /* Functional Annotation */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;The tools &amp;amp; resources listed below have been &#039;&#039;&#039;published and are actively maintained&#039;&#039;&#039; by the Gerstein lab. You may view a list of the associated literature [http://papers.gersteinlab.org/subject/coretools/index.html &#039;&#039;&#039;here&#039;&#039;&#039;]. &lt;br /&gt;
&lt;br /&gt;
In addition to the tools below, the lab has also published a number of [http://info.gersteinlab.org/More_tools &#039;&#039;&#039;tools that are not currently being actively maintained&#039;&#039;&#039;]. &lt;br /&gt;
&lt;br /&gt;
You may also access tools that have [http://info.gersteinlab.org/Even_more_tools &#039;&#039;&#039;not yet been published&#039;&#039;&#039;]&lt;br /&gt;
&lt;br /&gt;
Source code for all software is available on our [http://github.gersteinlab.org/ Github page].&lt;br /&gt;
&lt;br /&gt;
=Portals=&lt;br /&gt;
&lt;br /&gt;
=== MolMovDB  ===&lt;br /&gt;
:{|class=&amp;quot;wikitable sortable&amp;quot; border=&amp;quot;1&amp;quot; cellspacing=&amp;quot;0&amp;quot; cellpadding=&amp;quot;10&amp;quot;&lt;br /&gt;
|- bgcolor=&amp;quot;lightsteelblue&amp;quot;&lt;br /&gt;
!Name!!class=&amp;quot;unsortable&amp;quot;|Description&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;| [[File:Morph-icon.jpg‎]] &amp;lt;br&amp;gt; [http://molmovdb.mbb.yale.edu/molmovdb/ MolMovDB] ||&lt;br /&gt;
Servers and a suite of accessory tools for the analysis of conformational changes in protein and nucleic acid structures.  &lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Networks===&lt;br /&gt;
:{|class=&amp;quot;wikitable sortable&amp;quot; border=&amp;quot;1&amp;quot; cellspacing=&amp;quot;0&amp;quot; cellpadding=&amp;quot;10&amp;quot;&lt;br /&gt;
|- bgcolor=&amp;quot;lightsteelblue&amp;quot;&lt;br /&gt;
!Name!!class=&amp;quot;unsortable&amp;quot;|Description&lt;br /&gt;
|-style=&amp;quot;height: 10px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;| [[File:Network.jpg|center|x75px]] &amp;lt;br&amp;gt; [http://networks.gersteinlab.org/ Networks] ||&lt;br /&gt;
The Gerstein lab has been a pioneer in applying network analysis to generate knowledge form large-scale experiments. To this end, we have developed a portal for our network research.&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Pseudogene.org===&lt;br /&gt;
:{|class=&amp;quot;wikitable sortable&amp;quot; border=&amp;quot;1&amp;quot; cellspacing=&amp;quot;0&amp;quot; cellpadding=&amp;quot;10&amp;quot;&lt;br /&gt;
|- bgcolor=&amp;quot;lightsteelblue&amp;quot;&lt;br /&gt;
!Name!!class=&amp;quot;unsortable&amp;quot;|Description&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;| [[File:pseudogene.png‎]] &amp;lt;br&amp;gt; [http://www.pseudogene.org/ Pseudogene.org] ||Pseudogene.org is a collection of resources related to our efforts to survey eukaryotic genomes for pseudogene sequences, &amp;quot;pseudo-fold&amp;quot; usage, amino-acid composition, and single-nucleotide polymorphisms (SNPs) to help elucidate the relationships between pseudogene families across several organisms.&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Structural Variants (SV)===&lt;br /&gt;
:{|class=&amp;quot;wikitable sortable&amp;quot; border=&amp;quot;1&amp;quot; cellspacing=&amp;quot;0&amp;quot; cellpadding=&amp;quot;10&amp;quot;&lt;br /&gt;
|- bgcolor=&amp;quot;lightsteelblue&amp;quot;&lt;br /&gt;
!Name!!class=&amp;quot;unsortable&amp;quot;|Description&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;| [[File:SVpage logo.png‎|center|x85px]] &amp;lt;br&amp;gt; [http://sv.gersteinlab.org/ Structural Variants]||&lt;br /&gt;
Software that may be used to investigate Structural Variations (SVs) and Copy Number Variations (CNVs).&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
=Data Sets=&lt;br /&gt;
:{|class=&amp;quot;wikitable sortable&amp;quot; border=&amp;quot;1&amp;quot; cellspacing=&amp;quot;0&amp;quot; cellpadding=&amp;quot;10&amp;quot;&lt;br /&gt;
|- bgcolor=&amp;quot;lightsteelblue&amp;quot;&lt;br /&gt;
!Name!!Release Date!!class=&amp;quot;unsortable&amp;quot;|Description&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://sv.gersteinlab.org/breakdb/ &#039;&#039;&#039;BreakDB&#039;&#039;&#039;]||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2009||This database, which is part of the PEMer package, contains information about structural variants and associated breakpoints.&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
=Evolution=&lt;br /&gt;
:{|class=&amp;quot;wikitable sortable&amp;quot; border=&amp;quot;1&amp;quot; cellspacing=&amp;quot;0&amp;quot; cellpadding=&amp;quot;10&amp;quot;&lt;br /&gt;
|- bgcolor=&amp;quot;lightsteelblue&amp;quot;&lt;br /&gt;
!Name!!Release Date!!class=&amp;quot;unsortable&amp;quot;|Description&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://coevolution.gersteinlab.org/coevolution/ &#039;&#039;&#039;Coevolution analysis of protein residues&#039;&#039;&#039;]||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2008||An integrated online system that enables comparative analyses of residue coevolution with a comprehensive set of commonly used scoring functions, including statistical coupling analysis (SCA), explicit likelihood of subset variation (ELSC), mutual information and correlation-based methods.&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
=Genome Technology=&lt;br /&gt;
&lt;br /&gt;
===Gene Regulation===&lt;br /&gt;
:{|class=&amp;quot;wikitable sortable&amp;quot; border=&amp;quot;1&amp;quot; cellspacing=&amp;quot;0&amp;quot; cellpadding=&amp;quot;10&amp;quot;&lt;br /&gt;
|- bgcolor=&amp;quot;lightsteelblue&amp;quot;&lt;br /&gt;
!Name!!Release Date!!class=&amp;quot;unsortable&amp;quot;|Description&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[https://github.com/gersteinlab/Loregic &#039;&#039;&#039;Loregic&#039;&#039;&#039;]||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2015||Loregic is a computational method integrating gene expression and regulatory network data, to characterize the logical cooperativity of regulatory factors. Loregic uses all 16 possible two-input-one-output logic gates (e.g. AND or XOR) to describe triplets of two factors regulating a common target, and finds the gate that best matches each triplet’s observed gene expression pattern across many conditions. Using human ENCODE ChIP-Seq and TCGA RNA-Seq data, we are able to demonstrate how Loregic characterizes complex circuits involving both proximally and distally regulating transcription factors (TFs) and also miRNAs.&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Allele-Specific Effects===&lt;br /&gt;
:{|class=&amp;quot;wikitable sortable&amp;quot; border=&amp;quot;1&amp;quot; cellspacing=&amp;quot;0&amp;quot; cellpadding=&amp;quot;10&amp;quot;&lt;br /&gt;
|- bgcolor=&amp;quot;lightsteelblue&amp;quot;&lt;br /&gt;
!Name!!Release Date!!class=&amp;quot;unsortable&amp;quot;|Description&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://alleledb.gersteinlab.org &#039;&#039;&#039;AlleleDB&#039;&#039;&#039;]||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2016||AlleleDB is an online resource for storing and visualizing allele-specific binding (ASB) and gene expression (ASE). Using variants from the 1000-Genomes Project and RNA-seq and ChIP-seq data from related projects, this resource serves as a repository for the catalog of ASB and ASE variants, associated genomic elements and personal genomes used in the study. AlleleDB also interfaces with the UCSC browser for visualization of results.&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://alleleseq.gersteinlab.org/home.html &#039;&#039;&#039;AlleleSeq&#039;&#039;&#039;]||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2011||AlleleSeq is a computational pipeline that is used to study allele-specific expression (ASE) and allele specific binding (ASB). The pipeline first constructs a diploid personal genome sequence, then maps RNA-seq and ChIP-seq functional genomic data onto this personal genome. Consequently, locations in which there are differences in number of mapped reads between maternally- and paternally-derived sequences can be identified, thereby providing evidence for allele-specific events.&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===ChIP-Seq ===&lt;br /&gt;
:{|class=&amp;quot;wikitable sortable&amp;quot; border=&amp;quot;1&amp;quot; cellspacing=&amp;quot;0&amp;quot; cellpadding=&amp;quot;10&amp;quot;&lt;br /&gt;
|- bgcolor=&amp;quot;lightsteelblue&amp;quot;&lt;br /&gt;
!Name!!Release Date!!class=&amp;quot;unsortable&amp;quot;|Description&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://music.gersteinlab.org &#039;&#039;&#039;MUSIC&#039;&#039;&#039;] &amp;lt;br&amp;gt; [http://github.gersteinlab.org/MUSIC/ Github repo] ||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2014||&lt;br /&gt;
MUSIC is an algorithm for identification of enriched regions at multiple scales in the read depth signals from ChIP-Seq experiments.&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://www.gersteinlab.org/proj/PeakSeq/ &#039;&#039;&#039;PeakSeq&#039;&#039;&#039;] &amp;lt;br&amp;gt; [http://github.gersteinlab.org/PeakSeq/ Github repo] ||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2009|| A tool for calling peaks corresponding to transcription factor binding sites from ChIP-Seq data scored against a matched control such as input DNA. PeakSeq employs a two-pass strategy in which putative binding sites are first identified in order to compensate for genomic variation in the &#039;mappability&#039; of sequences, before a second pass filters out sites not significantly enriched compared to the normalized control, computing precise enrichments and significances.&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Functional Annotation ===&lt;br /&gt;
:{|class=&amp;quot;wikitable sortable&amp;quot; border=&amp;quot;1&amp;quot; cellspacing=&amp;quot;0&amp;quot; cellpadding=&amp;quot;10&amp;quot;&lt;br /&gt;
|- bgcolor=&amp;quot;lightsteelblue&amp;quot;&lt;br /&gt;
!Name!!Release Date!!class=&amp;quot;unsortable&amp;quot;|Description&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://funseq.gersteinlab.org/ &#039;&#039;&#039;FunSeq&#039;&#039;&#039;]&amp;lt;br&amp;gt;   - &amp;amp; -   &amp;lt;br&amp;gt;[http://funseq2.gersteinlab.org/ &#039;&#039;&#039;FunSeq2&#039;&#039;&#039;]  [http://github.gersteinlab.org/FunSeq2/  Github] ||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2013&amp;lt;br&amp;gt;&amp;amp;&amp;lt;br&amp;gt;2014||These tools can be used to automatically score and annotate the disease-causing potential of SNVs, particularly those which are non-coding. FunSeq can detect recurrent annotation elements in non-coding regions when running with multiple personal genomes. FunSeq2 is an extension of FunSeq that provides a means of prioritizing somatic variants from cancer whole genome sequencing.&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://larva.gersteinlab.org/ &#039;&#039;&#039;LARVA&#039;&#039;&#039;] &amp;lt;br&amp;gt; [https://github.com/gersteinlab/LARVA Github repo] ||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2015||LARVA is a computational framework designed to facilitate the study of noncoding variants. It addresses issues that have made it difficult to derive an accurate model of the background mutation rates of noncoding elements in cancer genomes. LARVA integrates a comprehensive set of noncoding functional elements, modeling their mutation count with a beta-binomial distribution to handle overdispersion. Moreover, LARVA uses regional genomic features (such as replication timing) to better estimate local mutation rates and mutational enrichments.&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://vat.gersteinlab.org/ &#039;&#039;&#039;VAT&#039;&#039;&#039;] &amp;lt;br&amp;gt; [http://github.gersteinlab.org/vat/ Github repo] ||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2012|| A computational framework to functionally annotate variants in personal genomes using a cloud-computing environment.&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://aloft.gersteinlab.org/ &#039;&#039;&#039;ALoFT&#039;&#039;&#039;] &amp;lt;br&amp;gt; [https://github.com/gersteinlab/aloft Github repo] ||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2017|| A method to annotate and predict the disease-causing potential of loss-of-function variants.&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://moat.gersteinlab.org/ &#039;&#039;&#039;MOAT&#039;&#039;&#039;] &amp;lt;br&amp;gt; [https://github.com/gersteinlab/MOAT Github repo] ||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2017|| MOAT (Mutations Overburdening Annotations Tool) is a computational system for identifying significant mutation burdens in genomic elements with an empirical, nonparametric method. Taking a set of variant calls and a set of annotations, MOAT calculates which annotations have observed variant counts that are substantially elevated with respect to a distribution of expected variant counts determined by permutation of the input data.&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Microarrays &amp;amp; Proteomics===&lt;br /&gt;
:{|class=&amp;quot;wikitable sortable&amp;quot; border=&amp;quot;1&amp;quot; cellspacing=&amp;quot;0&amp;quot; cellpadding=&amp;quot;10&amp;quot;&lt;br /&gt;
|- bgcolor=&amp;quot;lightsteelblue&amp;quot;&lt;br /&gt;
!Name!!Release Date!!class=&amp;quot;unsortable&amp;quot;|Description&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://motips.gersteinlab.org/ &#039;&#039;&#039;MOTIPS&#039;&#039;&#039;]||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2010||MOTIPS employs an efficient search algorithm to scan a target proteome for potential domain targets and to increase the accuracy of each hit by integrating a variety of pre-computed features, such as conservation, surface propensity, and disorder.&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://proteomics.gersteinlab.org &#039;&#039;&#039;PARE&#039;&#039;&#039;]||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2007||Protein Abundance and mRNA Expression (PARE) is a tool for comparing protein abundance and mRNA expression data. In addition to globally comparing the quantities of protein and mRNA, PARE allows users to select subsets of proteins for focused study (based on functional categories and complexes). Furthermore, it highlights correlation outliers, which may warrant further investigation.&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===RNA-Seq===&lt;br /&gt;
:{|class=&amp;quot;wikitable sortable&amp;quot; border=&amp;quot;1&amp;quot; cellspacing=&amp;quot;0&amp;quot; cellpadding=&amp;quot;10&amp;quot;&lt;br /&gt;
|- bgcolor=&amp;quot;lightsteelblue&amp;quot;&lt;br /&gt;
!Name!!Release Date!!class=&amp;quot;unsortable&amp;quot;|Description&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://act.gersteinlab.org/ &#039;&#039;&#039;ACT&#039;&#039;&#039;]||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2011||The aggregation and correlation toolbox (ACT) is an efficient, multifaceted toolbox for analyzing continuous signal and discrete region tracks from high-throughput genomic experiments, such as RNA-seq or ChIP-chip signal profiles from the ENCODE and modENCODE projects, or lists of single nucleotide polymorphisms from the 1000 genomes project.&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://archive.gersteinlab.org/proj/rnaseq/fusionseq/ &#039;&#039;&#039;FusionSeq&#039;&#039;&#039;]&amp;lt;br&amp;gt;[http://github.gersteinlab.org/FusionSeq/ Github repo]||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2010||FusionSeq may be used to identify fusion transcripts from paired-end RNA-sequencing. FusionSeq includes filters to remove spurious candidate fusions with artifacts, such as misalignment or random pairing of transcript fragments, and it ranks candidates according to several statistics. It also includes a module to identify exact sequences at breakpoint junctions.&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;text-align: center;&amp;quot;|[http://archive.gersteinlab.org/proj/rnaseq/IQSeq/ &#039;&#039;&#039;IQseq&#039;&#039;&#039;]&amp;lt;br&amp;gt;[http://github.gersteinlab.org/IQSeq/ Github repo]||style=&amp;quot;text-align:center;&amp;quot;|2012||A tool for isoform quantification with RNA-seq data. Given isoform annotation and alignment of RNA-seq reads, it will use an EM algorithm to infer the most probable expression level for each isoform of a gene.&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;text-align: center;&amp;quot;|[http://archive.gersteinlab.org/proj/rnaseq/rseqtools/ &#039;&#039;&#039;RSEQtools&#039;&#039;&#039;]&amp;lt;br&amp;gt;[http://github.gersteinlab.org/RSEQtools/ Github repo]||style=&amp;quot;text-align:center;&amp;quot;|2011||A suite of tools that use Mapped Read Format (MRF) for the analysis of RNA-Seq experiments. MRF is a compact data format that enables anonymization of confidential sequence information while maintaining the ability to conduct subsequent functional genomics studies. RSEQtools provides a suite of modules that convert to/from MRF data and perform common tasks such as calculating gene expression values, generating signal tracks of mapped reads, and segmenting that signal into actively transcribed regions.&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
=== Structural Variation ===&lt;br /&gt;
:{|class=&amp;quot;wikitable sortable&amp;quot; border=&amp;quot;1&amp;quot; cellspacing=&amp;quot;0&amp;quot; cellpadding=&amp;quot;10&amp;quot;&lt;br /&gt;
|- bgcolor=&amp;quot;lightsteelblue&amp;quot;&lt;br /&gt;
!Name!!Release Date!!class=&amp;quot;unsortable&amp;quot;|Description&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://sv.gersteinlab.org/age/ &#039;&#039;&#039;AGE&#039;&#039;&#039;]&amp;lt;br&amp;gt; [http://papers.gersteinlab.org/papers/age/index.html (citation)]||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2011|| AGE is used for defining breakpoints of genomic structural variants at single-nucleotide resolution, using optimal alignments with gap excision.&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://sv.gersteinlab.org/cnvnator/ &#039;&#039;&#039;CNVnator&#039;&#039;&#039;] &amp;lt;br&amp;gt;[http://papers.gersteinlab.org/papers/CNVnator/index.html (citation)]||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2011|| CNVnator may be used to discover, genotype, and characterize typical and atypical CNVs from familial and population genome sequencing.&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
=Networks=&lt;br /&gt;
:{|class=&amp;quot;wikitable sortable&amp;quot; border=&amp;quot;1&amp;quot; cellspacing=&amp;quot;0&amp;quot; cellpadding=&amp;quot;10&amp;quot;&lt;br /&gt;
|- bgcolor=&amp;quot;lightsteelblue&amp;quot;&lt;br /&gt;
!Name!!Release Date!!class=&amp;quot;unsortable&amp;quot;|Description&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://archive.gersteinlab.org/proj/DynaSIN/index.html &#039;&#039;&#039;DynaSIN&#039;&#039;&#039;]||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2011||The Dynamic Structure Interaction Network (DynaSIN) is a resource for studying protein-protein interaction networks in the context of conformational changes.&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[https://github.com/gersteinlab/OrthoClust &#039;&#039;&#039;OrthoClust&#039;&#039;&#039;]||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2014||A computational framework that integrates the co-association networks of individual species by utilizing the orthology relationships of genes between species. It outputs optimized modules that are fundamentally cross-species, which can either be conserved or species-specific.&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://pubnet.gersteinlab.org/ &#039;&#039;&#039;PubNet&#039;&#039;&#039;]||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2005||A web-based tool that extracts several types of relationships returned by PubMed queries and maps them on to networks, allowing for graphical visualization, textual navigation, and topological analysis. &lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://networks.gersteinlab.org/genome/interactions/networks/core.html &#039;&#039;&#039;TopNet&#039;&#039;&#039;]||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2004||TopNet is an automated web tool designed compare the topologies of sub-networks, looking for global differences associated with different types of proteins. This automated web tool designed to address this question, calculating and comparing topological characteristics for different sub-networks derived from any given protein network.&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://tyna.gersteinlab.org/tyna/ &#039;&#039;&#039;tYNA&#039;&#039;&#039;]||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2006||(TopNet-like Yale Network Analyzer). A Web system for managing, comparing and mining multiple networks, both directed and undirected. tYNA efficiently implements methods that have proven useful in network analysis, including identifying defective cliques, finding small network motifs (such as feed-forward loops), calculating global statistics (such as the clustering coefficient and eccentricity), and identifying hubs and bottlenecks etc.&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
=Structure and Macromolecular Motions=&lt;br /&gt;
:{|class=&amp;quot;wikitable sortable&amp;quot; border=&amp;quot;1&amp;quot; cellspacing=&amp;quot;0&amp;quot; cellpadding=&amp;quot;10&amp;quot;&lt;br /&gt;
|- bgcolor=&amp;quot;lightsteelblue&amp;quot;&lt;br /&gt;
!Name!!Release Date!!class=&amp;quot;unsortable&amp;quot;|Description&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|  [http://3vee.molmovdb.org/ &#039;&#039;&#039;3V&#039;&#039;&#039;]||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2010||&lt;br /&gt;
The 3V web server extracts and comprehensively analyzes the internal volumes of input RNA and protein structures. It identifies internal volumes by taking the difference between two rolling-probe solvent-excluded surfaces.&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://helix.gersteinlab.org/ &#039;&#039;&#039;HIT&#039;&#039;&#039;]||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2006||The Helix Interaction Tool (HIT) is a comprehensive package for analyzing helix-helix packing in proteins. This enables the user to obtain quantitative measures of the helix interaction surface area and helix crossing angle, as well as several methods for visualizing the helical interaction.&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://geometry.molmovdb.org &#039;&#039;&#039;Macromolecular Geometry and Packing Tools&#039;&#039;&#039;]||style=&amp;quot;width:7%; text-align:center;&amp;quot;|1994-2009||A number of programs for calculating properties of protein and nucleic acid structures have been collected into a single distribution. Included is a library functions for analyzing structures, a convenient interactive command-line interpreter, and software for the calculation of geometrical quantities associated with macromolecular structures and their motions.&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://www2.molmovdb.org/wiki/info/index.php/Morph_Server &#039;&#039;&#039;Morph Server&#039;&#039;&#039;]||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2000||A web server for generating and viewing models of protein conformational change using interpolation with energy minimization. The user may opt to use either single- or multi-chain proteins as input.&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://stress.molmovdb.org/ &#039;&#039;&#039;STRESS&#039;&#039;&#039;] &amp;lt;br&amp;gt; [https://github.com/gersteinlab/STRESS Github repo] ||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2016||STRucturally-identified ESSential residues (STRESS) is a web tool that enables users to submit PDB-formatted protein structures to predict both surface- and interior-allosteric residues. The software behind this tool employs 3D structures to build models of protein conformational change in order to perform allosteric site predictions.&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://intensification.gersteinlab.org// &#039;&#039;&#039;Intensification&#039;&#039;&#039;] &amp;lt;br&amp;gt; [https://github.com/gersteinlab/Intensification Github repo] ||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2016||Intensification is a database that contains the results for 12 repeat protein domains, from the amplification of population-genetic signal by constructing a motif-based multiple sequence alignment (motif-MSA). We make use of the modular structure of repeat motifs to amplify signals of selection from population genetics and traditional inter-species conservation.&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
=more=&lt;br /&gt;
[http://info.gersteinlab.org/More_tools more tools &amp;amp; resources]&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=Resources&amp;diff=4242</id>
		<title>Resources</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=Resources&amp;diff=4242"/>
		<updated>2017-12-04T21:25:49Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: /* Functional Annotation */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;The tools &amp;amp; resources listed below have been &#039;&#039;&#039;published and are actively maintained&#039;&#039;&#039; by the Gerstein lab. You may view a list of the associated literature [http://papers.gersteinlab.org/subject/coretools/index.html &#039;&#039;&#039;here&#039;&#039;&#039;]. &lt;br /&gt;
&lt;br /&gt;
In addition to the tools below, the lab has also published a number of [http://info.gersteinlab.org/More_tools &#039;&#039;&#039;tools that are not currently being actively maintained&#039;&#039;&#039;]. &lt;br /&gt;
&lt;br /&gt;
You may also access tools that have [http://info.gersteinlab.org/Even_more_tools &#039;&#039;&#039;not yet been published&#039;&#039;&#039;]&lt;br /&gt;
&lt;br /&gt;
Source code for all software is available on our [http://github.gersteinlab.org/ Github page].&lt;br /&gt;
&lt;br /&gt;
=Portals=&lt;br /&gt;
&lt;br /&gt;
=== MolMovDB  ===&lt;br /&gt;
:{|class=&amp;quot;wikitable sortable&amp;quot; border=&amp;quot;1&amp;quot; cellspacing=&amp;quot;0&amp;quot; cellpadding=&amp;quot;10&amp;quot;&lt;br /&gt;
|- bgcolor=&amp;quot;lightsteelblue&amp;quot;&lt;br /&gt;
!Name!!class=&amp;quot;unsortable&amp;quot;|Description&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;| [[File:Morph-icon.jpg‎]] &amp;lt;br&amp;gt; [http://molmovdb.mbb.yale.edu/molmovdb/ MolMovDB] ||&lt;br /&gt;
Servers and a suite of accessory tools for the analysis of conformational changes in protein and nucleic acid structures.  &lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Networks===&lt;br /&gt;
:{|class=&amp;quot;wikitable sortable&amp;quot; border=&amp;quot;1&amp;quot; cellspacing=&amp;quot;0&amp;quot; cellpadding=&amp;quot;10&amp;quot;&lt;br /&gt;
|- bgcolor=&amp;quot;lightsteelblue&amp;quot;&lt;br /&gt;
!Name!!class=&amp;quot;unsortable&amp;quot;|Description&lt;br /&gt;
|-style=&amp;quot;height: 10px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;| [[File:Network.jpg|center|x75px]] &amp;lt;br&amp;gt; [http://networks.gersteinlab.org/ Networks] ||&lt;br /&gt;
The Gerstein lab has been a pioneer in applying network analysis to generate knowledge form large-scale experiments. To this end, we have developed a portal for our network research.&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Pseudogene.org===&lt;br /&gt;
:{|class=&amp;quot;wikitable sortable&amp;quot; border=&amp;quot;1&amp;quot; cellspacing=&amp;quot;0&amp;quot; cellpadding=&amp;quot;10&amp;quot;&lt;br /&gt;
|- bgcolor=&amp;quot;lightsteelblue&amp;quot;&lt;br /&gt;
!Name!!class=&amp;quot;unsortable&amp;quot;|Description&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;| [[File:pseudogene.png‎]] &amp;lt;br&amp;gt; [http://www.pseudogene.org/ Pseudogene.org] ||Pseudogene.org is a collection of resources related to our efforts to survey eukaryotic genomes for pseudogene sequences, &amp;quot;pseudo-fold&amp;quot; usage, amino-acid composition, and single-nucleotide polymorphisms (SNPs) to help elucidate the relationships between pseudogene families across several organisms.&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Structural Variants (SV)===&lt;br /&gt;
:{|class=&amp;quot;wikitable sortable&amp;quot; border=&amp;quot;1&amp;quot; cellspacing=&amp;quot;0&amp;quot; cellpadding=&amp;quot;10&amp;quot;&lt;br /&gt;
|- bgcolor=&amp;quot;lightsteelblue&amp;quot;&lt;br /&gt;
!Name!!class=&amp;quot;unsortable&amp;quot;|Description&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;| [[File:SVpage logo.png‎|center|x85px]] &amp;lt;br&amp;gt; [http://sv.gersteinlab.org/ Structural Variants]||&lt;br /&gt;
Software that may be used to investigate Structural Variations (SVs) and Copy Number Variations (CNVs).&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
=Data Sets=&lt;br /&gt;
:{|class=&amp;quot;wikitable sortable&amp;quot; border=&amp;quot;1&amp;quot; cellspacing=&amp;quot;0&amp;quot; cellpadding=&amp;quot;10&amp;quot;&lt;br /&gt;
|- bgcolor=&amp;quot;lightsteelblue&amp;quot;&lt;br /&gt;
!Name!!Release Date!!class=&amp;quot;unsortable&amp;quot;|Description&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://sv.gersteinlab.org/breakdb/ &#039;&#039;&#039;BreakDB&#039;&#039;&#039;]||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2009||This database, which is part of the PEMer package, contains information about structural variants and associated breakpoints.&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
=Evolution=&lt;br /&gt;
:{|class=&amp;quot;wikitable sortable&amp;quot; border=&amp;quot;1&amp;quot; cellspacing=&amp;quot;0&amp;quot; cellpadding=&amp;quot;10&amp;quot;&lt;br /&gt;
|- bgcolor=&amp;quot;lightsteelblue&amp;quot;&lt;br /&gt;
!Name!!Release Date!!class=&amp;quot;unsortable&amp;quot;|Description&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://coevolution.gersteinlab.org/coevolution/ &#039;&#039;&#039;Coevolution analysis of protein residues&#039;&#039;&#039;]||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2008||An integrated online system that enables comparative analyses of residue coevolution with a comprehensive set of commonly used scoring functions, including statistical coupling analysis (SCA), explicit likelihood of subset variation (ELSC), mutual information and correlation-based methods.&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
=Genome Technology=&lt;br /&gt;
&lt;br /&gt;
===Gene Regulation===&lt;br /&gt;
:{|class=&amp;quot;wikitable sortable&amp;quot; border=&amp;quot;1&amp;quot; cellspacing=&amp;quot;0&amp;quot; cellpadding=&amp;quot;10&amp;quot;&lt;br /&gt;
|- bgcolor=&amp;quot;lightsteelblue&amp;quot;&lt;br /&gt;
!Name!!Release Date!!class=&amp;quot;unsortable&amp;quot;|Description&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[https://github.com/gersteinlab/Loregic &#039;&#039;&#039;Loregic&#039;&#039;&#039;]||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2015||Loregic is a computational method integrating gene expression and regulatory network data, to characterize the logical cooperativity of regulatory factors. Loregic uses all 16 possible two-input-one-output logic gates (e.g. AND or XOR) to describe triplets of two factors regulating a common target, and finds the gate that best matches each triplet’s observed gene expression pattern across many conditions. Using human ENCODE ChIP-Seq and TCGA RNA-Seq data, we are able to demonstrate how Loregic characterizes complex circuits involving both proximally and distally regulating transcription factors (TFs) and also miRNAs.&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Allele-Specific Effects===&lt;br /&gt;
:{|class=&amp;quot;wikitable sortable&amp;quot; border=&amp;quot;1&amp;quot; cellspacing=&amp;quot;0&amp;quot; cellpadding=&amp;quot;10&amp;quot;&lt;br /&gt;
|- bgcolor=&amp;quot;lightsteelblue&amp;quot;&lt;br /&gt;
!Name!!Release Date!!class=&amp;quot;unsortable&amp;quot;|Description&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://alleledb.gersteinlab.org &#039;&#039;&#039;AlleleDB&#039;&#039;&#039;]||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2016||AlleleDB is an online resource for storing and visualizing allele-specific binding (ASB) and gene expression (ASE). Using variants from the 1000-Genomes Project and RNA-seq and ChIP-seq data from related projects, this resource serves as a repository for the catalog of ASB and ASE variants, associated genomic elements and personal genomes used in the study. AlleleDB also interfaces with the UCSC browser for visualization of results.&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://alleleseq.gersteinlab.org/home.html &#039;&#039;&#039;AlleleSeq&#039;&#039;&#039;]||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2011||AlleleSeq is a computational pipeline that is used to study allele-specific expression (ASE) and allele specific binding (ASB). The pipeline first constructs a diploid personal genome sequence, then maps RNA-seq and ChIP-seq functional genomic data onto this personal genome. Consequently, locations in which there are differences in number of mapped reads between maternally- and paternally-derived sequences can be identified, thereby providing evidence for allele-specific events.&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===ChIP-Seq ===&lt;br /&gt;
:{|class=&amp;quot;wikitable sortable&amp;quot; border=&amp;quot;1&amp;quot; cellspacing=&amp;quot;0&amp;quot; cellpadding=&amp;quot;10&amp;quot;&lt;br /&gt;
|- bgcolor=&amp;quot;lightsteelblue&amp;quot;&lt;br /&gt;
!Name!!Release Date!!class=&amp;quot;unsortable&amp;quot;|Description&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://music.gersteinlab.org &#039;&#039;&#039;MUSIC&#039;&#039;&#039;] &amp;lt;br&amp;gt; [http://github.gersteinlab.org/MUSIC/ Github repo] ||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2014||&lt;br /&gt;
MUSIC is an algorithm for identification of enriched regions at multiple scales in the read depth signals from ChIP-Seq experiments.&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://www.gersteinlab.org/proj/PeakSeq/ &#039;&#039;&#039;PeakSeq&#039;&#039;&#039;] &amp;lt;br&amp;gt; [http://github.gersteinlab.org/PeakSeq/ Github repo] ||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2009|| A tool for calling peaks corresponding to transcription factor binding sites from ChIP-Seq data scored against a matched control such as input DNA. PeakSeq employs a two-pass strategy in which putative binding sites are first identified in order to compensate for genomic variation in the &#039;mappability&#039; of sequences, before a second pass filters out sites not significantly enriched compared to the normalized control, computing precise enrichments and significances.&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Functional Annotation ===&lt;br /&gt;
:{|class=&amp;quot;wikitable sortable&amp;quot; border=&amp;quot;1&amp;quot; cellspacing=&amp;quot;0&amp;quot; cellpadding=&amp;quot;10&amp;quot;&lt;br /&gt;
|- bgcolor=&amp;quot;lightsteelblue&amp;quot;&lt;br /&gt;
!Name!!Release Date!!class=&amp;quot;unsortable&amp;quot;|Description&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://funseq.gersteinlab.org/ &#039;&#039;&#039;FunSeq&#039;&#039;&#039;]&amp;lt;br&amp;gt;   - &amp;amp; -   &amp;lt;br&amp;gt;[http://funseq2.gersteinlab.org/ &#039;&#039;&#039;FunSeq2&#039;&#039;&#039;]  [http://github.gersteinlab.org/FunSeq2/  Github] ||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2013&amp;lt;br&amp;gt;&amp;amp;&amp;lt;br&amp;gt;2014||These tools can be used to automatically score and annotate the disease-causing potential of SNVs, particularly those which are non-coding. FunSeq can detect recurrent annotation elements in non-coding regions when running with multiple personal genomes. FunSeq2 is an extension of FunSeq that provides a means of prioritizing somatic variants from cancer whole genome sequencing.&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://larva.gersteinlab.org/ &#039;&#039;&#039;LARVA&#039;&#039;&#039;] &amp;lt;br&amp;gt; [https://github.com/gersteinlab/LARVA Github repo] ||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2015||LARVA is a computational framework designed to facilitate the study of noncoding variants. It addresses issues that have made it difficult to derive an accurate model of the background mutation rates of noncoding elements in cancer genomes. LARVA integrates a comprehensive set of noncoding functional elements, modeling their mutation count with a beta-binomial distribution to handle overdispersion. Moreover, LARVA uses regional genomic features (such as replication timing) to better estimate local mutation rates and mutational enrichments.&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://vat.gersteinlab.org/ &#039;&#039;&#039;VAT&#039;&#039;&#039;] &amp;lt;br&amp;gt; [http://github.gersteinlab.org/vat/ Github repo] ||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2012|| A computational framework to functionally annotate variants in personal genomes using a cloud-computing environment.&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://aloft.gersteinlab.org/ &#039;&#039;&#039;ALoFT&#039;&#039;&#039;] &amp;lt;br&amp;gt; [http://github.gersteinlab.org/aloft/ Github repo] ||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2017|| A method to annotate and predict the disease-causing potential of loss-of-function variants.&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://moat.gersteinlab.org/ &#039;&#039;&#039;MOAT&#039;&#039;&#039;] &amp;lt;br&amp;gt; [http://github.gersteinlab.org/MOAT/ Github repo] ||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2017|| MOAT (Mutations Overburdening Annotations Tool) is a computational system for identifying significant mutation burdens in genomic elements with an empirical, nonparametric method. Taking a set of variant calls and a set of annotations, MOAT calculates which annotations have observed variant counts that are substantially elevated with respect to a distribution of expected variant counts determined by permutation of the input data.&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Microarrays &amp;amp; Proteomics===&lt;br /&gt;
:{|class=&amp;quot;wikitable sortable&amp;quot; border=&amp;quot;1&amp;quot; cellspacing=&amp;quot;0&amp;quot; cellpadding=&amp;quot;10&amp;quot;&lt;br /&gt;
|- bgcolor=&amp;quot;lightsteelblue&amp;quot;&lt;br /&gt;
!Name!!Release Date!!class=&amp;quot;unsortable&amp;quot;|Description&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://motips.gersteinlab.org/ &#039;&#039;&#039;MOTIPS&#039;&#039;&#039;]||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2010||MOTIPS employs an efficient search algorithm to scan a target proteome for potential domain targets and to increase the accuracy of each hit by integrating a variety of pre-computed features, such as conservation, surface propensity, and disorder.&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://proteomics.gersteinlab.org &#039;&#039;&#039;PARE&#039;&#039;&#039;]||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2007||Protein Abundance and mRNA Expression (PARE) is a tool for comparing protein abundance and mRNA expression data. In addition to globally comparing the quantities of protein and mRNA, PARE allows users to select subsets of proteins for focused study (based on functional categories and complexes). Furthermore, it highlights correlation outliers, which may warrant further investigation.&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===RNA-Seq===&lt;br /&gt;
:{|class=&amp;quot;wikitable sortable&amp;quot; border=&amp;quot;1&amp;quot; cellspacing=&amp;quot;0&amp;quot; cellpadding=&amp;quot;10&amp;quot;&lt;br /&gt;
|- bgcolor=&amp;quot;lightsteelblue&amp;quot;&lt;br /&gt;
!Name!!Release Date!!class=&amp;quot;unsortable&amp;quot;|Description&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://act.gersteinlab.org/ &#039;&#039;&#039;ACT&#039;&#039;&#039;]||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2011||The aggregation and correlation toolbox (ACT) is an efficient, multifaceted toolbox for analyzing continuous signal and discrete region tracks from high-throughput genomic experiments, such as RNA-seq or ChIP-chip signal profiles from the ENCODE and modENCODE projects, or lists of single nucleotide polymorphisms from the 1000 genomes project.&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://archive.gersteinlab.org/proj/rnaseq/fusionseq/ &#039;&#039;&#039;FusionSeq&#039;&#039;&#039;]&amp;lt;br&amp;gt;[http://github.gersteinlab.org/FusionSeq/ Github repo]||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2010||FusionSeq may be used to identify fusion transcripts from paired-end RNA-sequencing. FusionSeq includes filters to remove spurious candidate fusions with artifacts, such as misalignment or random pairing of transcript fragments, and it ranks candidates according to several statistics. It also includes a module to identify exact sequences at breakpoint junctions.&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;text-align: center;&amp;quot;|[http://archive.gersteinlab.org/proj/rnaseq/IQSeq/ &#039;&#039;&#039;IQseq&#039;&#039;&#039;]&amp;lt;br&amp;gt;[http://github.gersteinlab.org/IQSeq/ Github repo]||style=&amp;quot;text-align:center;&amp;quot;|2012||A tool for isoform quantification with RNA-seq data. Given isoform annotation and alignment of RNA-seq reads, it will use an EM algorithm to infer the most probable expression level for each isoform of a gene.&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;text-align: center;&amp;quot;|[http://archive.gersteinlab.org/proj/rnaseq/rseqtools/ &#039;&#039;&#039;RSEQtools&#039;&#039;&#039;]&amp;lt;br&amp;gt;[http://github.gersteinlab.org/RSEQtools/ Github repo]||style=&amp;quot;text-align:center;&amp;quot;|2011||A suite of tools that use Mapped Read Format (MRF) for the analysis of RNA-Seq experiments. MRF is a compact data format that enables anonymization of confidential sequence information while maintaining the ability to conduct subsequent functional genomics studies. RSEQtools provides a suite of modules that convert to/from MRF data and perform common tasks such as calculating gene expression values, generating signal tracks of mapped reads, and segmenting that signal into actively transcribed regions.&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
=== Structural Variation ===&lt;br /&gt;
:{|class=&amp;quot;wikitable sortable&amp;quot; border=&amp;quot;1&amp;quot; cellspacing=&amp;quot;0&amp;quot; cellpadding=&amp;quot;10&amp;quot;&lt;br /&gt;
|- bgcolor=&amp;quot;lightsteelblue&amp;quot;&lt;br /&gt;
!Name!!Release Date!!class=&amp;quot;unsortable&amp;quot;|Description&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://sv.gersteinlab.org/age/ &#039;&#039;&#039;AGE&#039;&#039;&#039;]&amp;lt;br&amp;gt; [http://papers.gersteinlab.org/papers/age/index.html (citation)]||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2011|| AGE is used for defining breakpoints of genomic structural variants at single-nucleotide resolution, using optimal alignments with gap excision.&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://sv.gersteinlab.org/cnvnator/ &#039;&#039;&#039;CNVnator&#039;&#039;&#039;] &amp;lt;br&amp;gt;[http://papers.gersteinlab.org/papers/CNVnator/index.html (citation)]||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2011|| CNVnator may be used to discover, genotype, and characterize typical and atypical CNVs from familial and population genome sequencing.&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
=Networks=&lt;br /&gt;
:{|class=&amp;quot;wikitable sortable&amp;quot; border=&amp;quot;1&amp;quot; cellspacing=&amp;quot;0&amp;quot; cellpadding=&amp;quot;10&amp;quot;&lt;br /&gt;
|- bgcolor=&amp;quot;lightsteelblue&amp;quot;&lt;br /&gt;
!Name!!Release Date!!class=&amp;quot;unsortable&amp;quot;|Description&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://archive.gersteinlab.org/proj/DynaSIN/index.html &#039;&#039;&#039;DynaSIN&#039;&#039;&#039;]||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2011||The Dynamic Structure Interaction Network (DynaSIN) is a resource for studying protein-protein interaction networks in the context of conformational changes.&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[https://github.com/gersteinlab/OrthoClust &#039;&#039;&#039;OrthoClust&#039;&#039;&#039;]||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2014||A computational framework that integrates the co-association networks of individual species by utilizing the orthology relationships of genes between species. It outputs optimized modules that are fundamentally cross-species, which can either be conserved or species-specific.&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://pubnet.gersteinlab.org/ &#039;&#039;&#039;PubNet&#039;&#039;&#039;]||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2005||A web-based tool that extracts several types of relationships returned by PubMed queries and maps them on to networks, allowing for graphical visualization, textual navigation, and topological analysis. &lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://networks.gersteinlab.org/genome/interactions/networks/core.html &#039;&#039;&#039;TopNet&#039;&#039;&#039;]||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2004||TopNet is an automated web tool designed compare the topologies of sub-networks, looking for global differences associated with different types of proteins. This automated web tool designed to address this question, calculating and comparing topological characteristics for different sub-networks derived from any given protein network.&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://tyna.gersteinlab.org/tyna/ &#039;&#039;&#039;tYNA&#039;&#039;&#039;]||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2006||(TopNet-like Yale Network Analyzer). A Web system for managing, comparing and mining multiple networks, both directed and undirected. tYNA efficiently implements methods that have proven useful in network analysis, including identifying defective cliques, finding small network motifs (such as feed-forward loops), calculating global statistics (such as the clustering coefficient and eccentricity), and identifying hubs and bottlenecks etc.&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
=Structure and Macromolecular Motions=&lt;br /&gt;
:{|class=&amp;quot;wikitable sortable&amp;quot; border=&amp;quot;1&amp;quot; cellspacing=&amp;quot;0&amp;quot; cellpadding=&amp;quot;10&amp;quot;&lt;br /&gt;
|- bgcolor=&amp;quot;lightsteelblue&amp;quot;&lt;br /&gt;
!Name!!Release Date!!class=&amp;quot;unsortable&amp;quot;|Description&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|  [http://3vee.molmovdb.org/ &#039;&#039;&#039;3V&#039;&#039;&#039;]||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2010||&lt;br /&gt;
The 3V web server extracts and comprehensively analyzes the internal volumes of input RNA and protein structures. It identifies internal volumes by taking the difference between two rolling-probe solvent-excluded surfaces.&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://helix.gersteinlab.org/ &#039;&#039;&#039;HIT&#039;&#039;&#039;]||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2006||The Helix Interaction Tool (HIT) is a comprehensive package for analyzing helix-helix packing in proteins. This enables the user to obtain quantitative measures of the helix interaction surface area and helix crossing angle, as well as several methods for visualizing the helical interaction.&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://geometry.molmovdb.org &#039;&#039;&#039;Macromolecular Geometry and Packing Tools&#039;&#039;&#039;]||style=&amp;quot;width:7%; text-align:center;&amp;quot;|1994-2009||A number of programs for calculating properties of protein and nucleic acid structures have been collected into a single distribution. Included is a library functions for analyzing structures, a convenient interactive command-line interpreter, and software for the calculation of geometrical quantities associated with macromolecular structures and their motions.&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://www2.molmovdb.org/wiki/info/index.php/Morph_Server &#039;&#039;&#039;Morph Server&#039;&#039;&#039;]||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2000||A web server for generating and viewing models of protein conformational change using interpolation with energy minimization. The user may opt to use either single- or multi-chain proteins as input.&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://stress.molmovdb.org/ &#039;&#039;&#039;STRESS&#039;&#039;&#039;] &amp;lt;br&amp;gt; [https://github.com/gersteinlab/STRESS Github repo] ||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2016||STRucturally-identified ESSential residues (STRESS) is a web tool that enables users to submit PDB-formatted protein structures to predict both surface- and interior-allosteric residues. The software behind this tool employs 3D structures to build models of protein conformational change in order to perform allosteric site predictions.&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|style=&amp;quot;width:15%; text-align:center;&amp;quot;|[http://intensification.gersteinlab.org// &#039;&#039;&#039;Intensification&#039;&#039;&#039;] &amp;lt;br&amp;gt; [https://github.com/gersteinlab/Intensification Github repo] ||style=&amp;quot;width:7%; text-align:center;&amp;quot;|2016||Intensification is a database that contains the results for 12 repeat protein domains, from the amplification of population-genetic signal by constructing a motif-based multiple sequence alignment (motif-MSA). We make use of the modular structure of repeat motifs to amplify signals of selection from population genetics and traditional inter-species conservation.&lt;br /&gt;
|-style=&amp;quot;height: 100px;&amp;quot;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
=more=&lt;br /&gt;
[http://info.gersteinlab.org/More_tools more tools &amp;amp; resources]&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=Main_Page&amp;diff=4152</id>
		<title>Main Page</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=Main_Page&amp;diff=4152"/>
		<updated>2017-06-26T14:49:31Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: /* Important Public Items */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Important Public Items ==&lt;br /&gt;
*[[Selected Press Accounts Highlighting Gerstein Lab Work| &#039;&#039;&#039;Press Accounts&#039;&#039;&#039;]] highlighting the lab&lt;br /&gt;
*&#039;&#039;&#039;[[FAQ]]&#039;&#039;&#039; on tools and data&lt;br /&gt;
*Lab &#039;&#039;&#039;[[Calendar]]&#039;&#039;&#039;&lt;br /&gt;
*Info. on [[Homepage image gallery | &#039;&#039;&#039;Homepage Images&#039;&#039;&#039;]] (including [http://www.gersteinlab.org/media/images gallery of all images] &amp;amp; [http://www.gersteinlab.org/index.html.1jun11 old version] of homepage )&lt;br /&gt;
*[https://sites.gersteinlab.org/permissions Permissions] on using material generated by the lab&lt;br /&gt;
*Public posts of links and useful info to the lab are at [http://linkstream.gersteinlab.org LinkStream.gersteinlab.org] (more info. about the [[Linkstream]])&lt;br /&gt;
*A (password-protected) archive of the lab&#039;s minutes thread is available at [http://glminutes.wordpress.com GLMinutes.wordpress.com] (more info about the [[Wordpress Minutes Stream]])&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
__TOC__&lt;br /&gt;
&lt;br /&gt;
In general, this wiki hosts general, public information about the [http://gersteinlab.org &#039;&#039;&#039;Gerstein Lab&#039;&#039;&#039;], beyond that on the homepage. Lab members should consult the [http://wiki.gersteinlab.org/labinfo/ private lab wiki] for lab-specific information (login as user &amp;quot;public&amp;quot;). There&#039;s also a wiki for [http://wiki.gersteinlab.org/meetings lab meetings] (login as user &amp;quot;public&amp;quot;). To edit this wiki, contact the [[User:Infoadmin | Infoadmin]]&lt;br /&gt;
&lt;br /&gt;
&amp;lt;HR&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Other Stuff for the Public ==&lt;br /&gt;
*Info on [[Bioinformatics Lab Information System (BLIS)]] (utilizing Google Docs API)&lt;br /&gt;
*[[Summaries]] of lab research&lt;br /&gt;
*Misc. Public [[Documents]] about the lab&lt;br /&gt;
*[[Status of Lab Web Resources]]&lt;br /&gt;
*Vectorized Yale logo, download [http://archive.gersteinlab.org/docs/2015/05.19/Yale_logo.eps here]&lt;br /&gt;
&lt;br /&gt;
== More Info for New People ==&lt;br /&gt;
* [http://wiki.gersteinlab.org/labinfo/New_People_Entering_Lab Information] for &amp;lt;B&amp;gt;new people&amp;lt;/B&amp;gt; starting in the lab from the private wiki&lt;br /&gt;
* [http://wiki.gersteinlab.org/labinfo/Staff People] to look for to get oriented&lt;br /&gt;
* About [http://wiki.gersteinlab.org/labinfo/Group_Meeting_and_JC_Procedure Group Meetings and JClubs]&lt;br /&gt;
&lt;br /&gt;
== Info for Lab Members and Collaborators ==&lt;br /&gt;
* Working on drafts, papers, grants, presentations &amp;amp;c&lt;br /&gt;
** Lab [[Resources Document]] (NIH form)&lt;br /&gt;
**[[Lab Dropbox]]&lt;br /&gt;
** Conventions for editing Google Docs - http://goo.gl/kSrS9&lt;br /&gt;
** Pointers on [[Streamlining Draft Flow]] (Ideas related to the process of drafting and submitting papers)&lt;br /&gt;
** [[Pointers on Powerpoints]] and [[Pointers on Grant Sections]]&lt;br /&gt;
** [[CRAMTOOLS and common directory]]&lt;br /&gt;
&lt;br /&gt;
* Computing&lt;br /&gt;
** [http://wiki.gersteinlab.org/labinfo/Lab_Computing_Resources Lab Computing Resources] page. &lt;br /&gt;
** [http://hpc.yale.edu/ls-account-request/ Yale High Performance Computing Center]&lt;br /&gt;
&lt;br /&gt;
* Calls, meetings, &amp;amp;c&lt;br /&gt;
**[http://wiki.gersteinlab.org/labinfo/Making_conference_calls_or_lab_related_calls Making phone calls] (SKYPE information included) page.&lt;br /&gt;
**[http://wiki.gersteinlab.org/labinfo/Subscribing_NIH_email_list Subscribing to a NIH email list] page.&lt;br /&gt;
** Information on [[Meeting Invites]]&lt;br /&gt;
**  [https://portal.infocus.com/support/Product%20Downloads/InFocus_IN3114_3116_ReferenceGuide_EN.pdf Manual] for new projector ([https://portal.infocus.com/support/Product%20Pages/downloads.aspx?mic=IN3116&amp;amp;on=downloads Other info])&lt;br /&gt;
&lt;br /&gt;
* Some Useful University Policies&lt;br /&gt;
** Snippets from [[MB&amp;amp;B Policy for Graduate Students on Vacation and Travel]] and [[Policy on Postdoc Appointments]]&lt;br /&gt;
** Policies on visitors: [http://provost.yale.edu/minors-in-labs Policy on Minors in Labs] ([http://archive.gersteinlab.org/docs/2010/06.02/Policy-on-Minors-in-Labs.pdf  old]), [http://provost.yale.edu/policy-access-university-labs-and-research-facilities Policy on Access to the Lab]&lt;br /&gt;
** Travel per diem information: [http://www.yale.edu/ppdev/Guides/bluepages.pdf Yale Blue Pages]&lt;br /&gt;
&lt;br /&gt;
* Misc&lt;br /&gt;
**[[Getting Mark&#039;s signature for documents]]&lt;br /&gt;
**[[Temperature in Bass 432a]], real time measurements (see how cold it gets)&lt;br /&gt;
**Prominent [[public wiki pages linked from elsewhere]]&lt;br /&gt;
**[http://wiki.gersteinlab.org/labinfo/Using_copier Using copier] page.&lt;br /&gt;
**[http://www.facebook.com/group.php?gid=54856837624 Gerstein Lab Facebook Page]&lt;br /&gt;
**[http://bioinfo.mbb.yale.edu/pipermail/web/ Public WEB Mailing List] (no longer in use)&lt;br /&gt;
** [[Recommendation Letters]]&lt;br /&gt;
**[https://docs.google.com/spreadsheet/pub?key=0ApmN6KUB6ILdcDh0Z1luZ3VnTVc3QV9YSmtQaXBpZXc&amp;amp;single=true&amp;amp;gid=2&amp;amp;output=html Magazine Subscriptions]&lt;br /&gt;
** Google Groups: [http://group.gersteinlab.org homepage], for [http://docs.google.com/a/gersteinlab.org DOCS]&lt;br /&gt;
** Useful links: [http://www.yale.edu/its/accounts/netid.html Yale NetID System], [http://www.yale.edu/its/network/vpn_faq.html Yale VPN FAQ], [http://www.yale.edu/its/network/wireless/faq.html Yale Wireless FAQ], [http://www.yale.edu/its/telecom/dialing.html Yale Dialing Instructions], [https://config.mail.yale.edu Configuring Yale email], [http://www.yale.edu/ris/main.html Poster Printing]&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=Main_Page&amp;diff=4151</id>
		<title>Main Page</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=Main_Page&amp;diff=4151"/>
		<updated>2017-06-25T22:01:14Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: /* Important Public Items */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Important Public Items ==&lt;br /&gt;
*[[Selected Press Accounts Highlighting Gerstein Lab Work| &#039;&#039;&#039;Press Accounts&#039;&#039;&#039;]] highlighting the lab&lt;br /&gt;
*&#039;&#039;&#039;[[FAQ]]&#039;&#039;&#039; on tools and data&lt;br /&gt;
*Lab &#039;&#039;&#039;[[Calendar]]&#039;&#039;&#039;&lt;br /&gt;
*Info. on [[Homepage image gallery | &#039;&#039;&#039;Homepage Images&#039;&#039;&#039;]] (including [http://www.gersteinlab.org/media/images gallery of all images] &amp;amp; [http://www.gersteinlab.org/index.html.1jun11 old version] of homepage )&lt;br /&gt;
*[https://sites.gersteinlab.org/permissions Permissions] on using material generated by the lab&lt;br /&gt;
*Public posts of links and useful info to the lab are at [http://linkstream.gersteinlab.org LinkStream.gersteinlab.org] (more info. about the [[Linkstream]])&lt;br /&gt;
*A (password-protected) archive of the lab&#039;s minutes thread is available at [http://glminutes.wordpress.org GLMinutes.wordpress.org] (more info about the [[Wordpress Minutes Stream]])&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
__TOC__&lt;br /&gt;
&lt;br /&gt;
In general, this wiki hosts general, public information about the [http://gersteinlab.org &#039;&#039;&#039;Gerstein Lab&#039;&#039;&#039;], beyond that on the homepage. Lab members should consult the [http://wiki.gersteinlab.org/labinfo/ private lab wiki] for lab-specific information (login as user &amp;quot;public&amp;quot;). There&#039;s also a wiki for [http://wiki.gersteinlab.org/meetings lab meetings] (login as user &amp;quot;public&amp;quot;). To edit this wiki, contact the [[User:Infoadmin | Infoadmin]]&lt;br /&gt;
&lt;br /&gt;
&amp;lt;HR&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Other Stuff for the Public ==&lt;br /&gt;
*Info on [[Bioinformatics Lab Information System (BLIS)]] (utilizing Google Docs API)&lt;br /&gt;
*[[Summaries]] of lab research&lt;br /&gt;
*Misc. Public [[Documents]] about the lab&lt;br /&gt;
*[[Status of Lab Web Resources]]&lt;br /&gt;
*Vectorized Yale logo, download [http://archive.gersteinlab.org/docs/2015/05.19/Yale_logo.eps here]&lt;br /&gt;
&lt;br /&gt;
== More Info for New People ==&lt;br /&gt;
* [http://wiki.gersteinlab.org/labinfo/New_People_Entering_Lab Information] for &amp;lt;B&amp;gt;new people&amp;lt;/B&amp;gt; starting in the lab from the private wiki&lt;br /&gt;
* [http://wiki.gersteinlab.org/labinfo/Staff People] to look for to get oriented&lt;br /&gt;
* About [http://wiki.gersteinlab.org/labinfo/Group_Meeting_and_JC_Procedure Group Meetings and JClubs]&lt;br /&gt;
&lt;br /&gt;
== Info for Lab Members and Collaborators ==&lt;br /&gt;
* Working on drafts, papers, grants, presentations &amp;amp;c&lt;br /&gt;
** Lab [[Resources Document]] (NIH form)&lt;br /&gt;
**[[Lab Dropbox]]&lt;br /&gt;
** Conventions for editing Google Docs - http://goo.gl/kSrS9&lt;br /&gt;
** Pointers on [[Streamlining Draft Flow]] (Ideas related to the process of drafting and submitting papers)&lt;br /&gt;
** [[Pointers on Powerpoints]] and [[Pointers on Grant Sections]]&lt;br /&gt;
** [[CRAMTOOLS and common directory]]&lt;br /&gt;
&lt;br /&gt;
* Computing&lt;br /&gt;
** [http://wiki.gersteinlab.org/labinfo/Lab_Computing_Resources Lab Computing Resources] page. &lt;br /&gt;
** [http://hpc.yale.edu/ls-account-request/ Yale High Performance Computing Center]&lt;br /&gt;
&lt;br /&gt;
* Calls, meetings, &amp;amp;c&lt;br /&gt;
**[http://wiki.gersteinlab.org/labinfo/Making_conference_calls_or_lab_related_calls Making phone calls] (SKYPE information included) page.&lt;br /&gt;
**[http://wiki.gersteinlab.org/labinfo/Subscribing_NIH_email_list Subscribing to a NIH email list] page.&lt;br /&gt;
** Information on [[Meeting Invites]]&lt;br /&gt;
**  [https://portal.infocus.com/support/Product%20Downloads/InFocus_IN3114_3116_ReferenceGuide_EN.pdf Manual] for new projector ([https://portal.infocus.com/support/Product%20Pages/downloads.aspx?mic=IN3116&amp;amp;on=downloads Other info])&lt;br /&gt;
&lt;br /&gt;
* Some Useful University Policies&lt;br /&gt;
** Snippets from [[MB&amp;amp;B Policy for Graduate Students on Vacation and Travel]] and [[Policy on Postdoc Appointments]]&lt;br /&gt;
** Policies on visitors: [http://provost.yale.edu/minors-in-labs Policy on Minors in Labs] ([http://archive.gersteinlab.org/docs/2010/06.02/Policy-on-Minors-in-Labs.pdf  old]), [http://provost.yale.edu/policy-access-university-labs-and-research-facilities Policy on Access to the Lab]&lt;br /&gt;
** Travel per diem information: [http://www.yale.edu/ppdev/Guides/bluepages.pdf Yale Blue Pages]&lt;br /&gt;
&lt;br /&gt;
* Misc&lt;br /&gt;
**[[Getting Mark&#039;s signature for documents]]&lt;br /&gt;
**[[Temperature in Bass 432a]], real time measurements (see how cold it gets)&lt;br /&gt;
**Prominent [[public wiki pages linked from elsewhere]]&lt;br /&gt;
**[http://wiki.gersteinlab.org/labinfo/Using_copier Using copier] page.&lt;br /&gt;
**[http://www.facebook.com/group.php?gid=54856837624 Gerstein Lab Facebook Page]&lt;br /&gt;
**[http://bioinfo.mbb.yale.edu/pipermail/web/ Public WEB Mailing List] (no longer in use)&lt;br /&gt;
** [[Recommendation Letters]]&lt;br /&gt;
**[https://docs.google.com/spreadsheet/pub?key=0ApmN6KUB6ILdcDh0Z1luZ3VnTVc3QV9YSmtQaXBpZXc&amp;amp;single=true&amp;amp;gid=2&amp;amp;output=html Magazine Subscriptions]&lt;br /&gt;
** Google Groups: [http://group.gersteinlab.org homepage], for [http://docs.google.com/a/gersteinlab.org DOCS]&lt;br /&gt;
** Useful links: [http://www.yale.edu/its/accounts/netid.html Yale NetID System], [http://www.yale.edu/its/network/vpn_faq.html Yale VPN FAQ], [http://www.yale.edu/its/network/wireless/faq.html Yale Wireless FAQ], [http://www.yale.edu/its/telecom/dialing.html Yale Dialing Instructions], [https://config.mail.yale.edu Configuring Yale email], [http://www.yale.edu/ris/main.html Poster Printing]&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=Wordpress_Minutes_Stream&amp;diff=4150</id>
		<title>Wordpress Minutes Stream</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=Wordpress_Minutes_Stream&amp;diff=4150"/>
		<updated>2017-06-25T22:00:31Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: Created page with &amp;#039;The Wordpress Minutes Stream is a new experiment to organize the lab&amp;#039;s minutes with the help of Wordpress. The idea is to use tags to cluster the items on the minutes thread for …&amp;#039;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;The Wordpress Minutes Stream is a new experiment to organize the lab&#039;s minutes with the help of Wordpress. The idea is to use tags to cluster the items on the minutes thread for easier future retrieval.&lt;br /&gt;
&lt;br /&gt;
The Minutes Stream can be accessed on Wordpress.com at:&lt;br /&gt;
&lt;br /&gt;
[http://glminutes.wordpress.com/ glminutes.wordpress.com]&lt;br /&gt;
&lt;br /&gt;
There is also a fully mirrored version here, which also includes tag cloud pages for quickly retrieving minutes with a certain tag:&lt;br /&gt;
&lt;br /&gt;
[http://glminutes.gersteinlab.org/ glminutes.gersteinlab.org]&lt;br /&gt;
&lt;br /&gt;
Any emails sent to [mailto:minutes@gersteinlab.org minutes@gersteinlab.org] will appear on both sites.&lt;br /&gt;
&lt;br /&gt;
Both pages are marked private, and require login to access. For both sites, the standard user account for non-admin access is &amp;quot;glreader&amp;quot; (password shared to the &amp;quot;all&amp;quot; list)&lt;br /&gt;
&lt;br /&gt;
Currently, the tags we&#039;re using are:&lt;br /&gt;
&lt;br /&gt;
*summary&lt;br /&gt;
*scheduling&lt;br /&gt;
*todo&lt;br /&gt;
&lt;br /&gt;
In the future, we may use tags for specific meetings and projects.&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=Main_Page&amp;diff=4149</id>
		<title>Main Page</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=Main_Page&amp;diff=4149"/>
		<updated>2017-06-25T21:47:46Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: /* Important Public Items */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Important Public Items ==&lt;br /&gt;
*[[Selected Press Accounts Highlighting Gerstein Lab Work| &#039;&#039;&#039;Press Accounts&#039;&#039;&#039;]] highlighting the lab&lt;br /&gt;
*&#039;&#039;&#039;[[FAQ]]&#039;&#039;&#039; on tools and data&lt;br /&gt;
*Lab &#039;&#039;&#039;[[Calendar]]&#039;&#039;&#039;&lt;br /&gt;
*Info. on [[Homepage image gallery | &#039;&#039;&#039;Homepage Images&#039;&#039;&#039;]] (including [http://www.gersteinlab.org/media/images gallery of all images] &amp;amp; [http://www.gersteinlab.org/index.html.1jun11 old version] of homepage )&lt;br /&gt;
*[https://sites.gersteinlab.org/permissions Permissions] on using material generated by the lab&lt;br /&gt;
*Public posts of links and useful info to the lab are at [http://linkstream.gersteinlab.org LinkStream.gersteinlab.org] (more info. about the [[Linkstream]])&lt;br /&gt;
*An archive of the lab&#039;s minutes thread is available at [http://glminutes.wordpress.org GLMinutes.wordpress.org] (password protected)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
__TOC__&lt;br /&gt;
&lt;br /&gt;
In general, this wiki hosts general, public information about the [http://gersteinlab.org &#039;&#039;&#039;Gerstein Lab&#039;&#039;&#039;], beyond that on the homepage. Lab members should consult the [http://wiki.gersteinlab.org/labinfo/ private lab wiki] for lab-specific information (login as user &amp;quot;public&amp;quot;). There&#039;s also a wiki for [http://wiki.gersteinlab.org/meetings lab meetings] (login as user &amp;quot;public&amp;quot;). To edit this wiki, contact the [[User:Infoadmin | Infoadmin]]&lt;br /&gt;
&lt;br /&gt;
&amp;lt;HR&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Other Stuff for the Public ==&lt;br /&gt;
*Info on [[Bioinformatics Lab Information System (BLIS)]] (utilizing Google Docs API)&lt;br /&gt;
*[[Summaries]] of lab research&lt;br /&gt;
*Misc. Public [[Documents]] about the lab&lt;br /&gt;
*[[Status of Lab Web Resources]]&lt;br /&gt;
*Vectorized Yale logo, download [http://archive.gersteinlab.org/docs/2015/05.19/Yale_logo.eps here]&lt;br /&gt;
&lt;br /&gt;
== More Info for New People ==&lt;br /&gt;
* [http://wiki.gersteinlab.org/labinfo/New_People_Entering_Lab Information] for &amp;lt;B&amp;gt;new people&amp;lt;/B&amp;gt; starting in the lab from the private wiki&lt;br /&gt;
* [http://wiki.gersteinlab.org/labinfo/Staff People] to look for to get oriented&lt;br /&gt;
* About [http://wiki.gersteinlab.org/labinfo/Group_Meeting_and_JC_Procedure Group Meetings and JClubs]&lt;br /&gt;
&lt;br /&gt;
== Info for Lab Members and Collaborators ==&lt;br /&gt;
* Working on drafts, papers, grants, presentations &amp;amp;c&lt;br /&gt;
** Lab [[Resources Document]] (NIH form)&lt;br /&gt;
**[[Lab Dropbox]]&lt;br /&gt;
** Conventions for editing Google Docs - http://goo.gl/kSrS9&lt;br /&gt;
** Pointers on [[Streamlining Draft Flow]] (Ideas related to the process of drafting and submitting papers)&lt;br /&gt;
** [[Pointers on Powerpoints]] and [[Pointers on Grant Sections]]&lt;br /&gt;
** [[CRAMTOOLS and common directory]]&lt;br /&gt;
&lt;br /&gt;
* Computing&lt;br /&gt;
** [http://wiki.gersteinlab.org/labinfo/Lab_Computing_Resources Lab Computing Resources] page. &lt;br /&gt;
** [http://hpc.yale.edu/ls-account-request/ Yale High Performance Computing Center]&lt;br /&gt;
&lt;br /&gt;
* Calls, meetings, &amp;amp;c&lt;br /&gt;
**[http://wiki.gersteinlab.org/labinfo/Making_conference_calls_or_lab_related_calls Making phone calls] (SKYPE information included) page.&lt;br /&gt;
**[http://wiki.gersteinlab.org/labinfo/Subscribing_NIH_email_list Subscribing to a NIH email list] page.&lt;br /&gt;
** Information on [[Meeting Invites]]&lt;br /&gt;
**  [https://portal.infocus.com/support/Product%20Downloads/InFocus_IN3114_3116_ReferenceGuide_EN.pdf Manual] for new projector ([https://portal.infocus.com/support/Product%20Pages/downloads.aspx?mic=IN3116&amp;amp;on=downloads Other info])&lt;br /&gt;
&lt;br /&gt;
* Some Useful University Policies&lt;br /&gt;
** Snippets from [[MB&amp;amp;B Policy for Graduate Students on Vacation and Travel]] and [[Policy on Postdoc Appointments]]&lt;br /&gt;
** Policies on visitors: [http://provost.yale.edu/minors-in-labs Policy on Minors in Labs] ([http://archive.gersteinlab.org/docs/2010/06.02/Policy-on-Minors-in-Labs.pdf  old]), [http://provost.yale.edu/policy-access-university-labs-and-research-facilities Policy on Access to the Lab]&lt;br /&gt;
** Travel per diem information: [http://www.yale.edu/ppdev/Guides/bluepages.pdf Yale Blue Pages]&lt;br /&gt;
&lt;br /&gt;
* Misc&lt;br /&gt;
**[[Getting Mark&#039;s signature for documents]]&lt;br /&gt;
**[[Temperature in Bass 432a]], real time measurements (see how cold it gets)&lt;br /&gt;
**Prominent [[public wiki pages linked from elsewhere]]&lt;br /&gt;
**[http://wiki.gersteinlab.org/labinfo/Using_copier Using copier] page.&lt;br /&gt;
**[http://www.facebook.com/group.php?gid=54856837624 Gerstein Lab Facebook Page]&lt;br /&gt;
**[http://bioinfo.mbb.yale.edu/pipermail/web/ Public WEB Mailing List] (no longer in use)&lt;br /&gt;
** [[Recommendation Letters]]&lt;br /&gt;
**[https://docs.google.com/spreadsheet/pub?key=0ApmN6KUB6ILdcDh0Z1luZ3VnTVc3QV9YSmtQaXBpZXc&amp;amp;single=true&amp;amp;gid=2&amp;amp;output=html Magazine Subscriptions]&lt;br /&gt;
** Google Groups: [http://group.gersteinlab.org homepage], for [http://docs.google.com/a/gersteinlab.org DOCS]&lt;br /&gt;
** Useful links: [http://www.yale.edu/its/accounts/netid.html Yale NetID System], [http://www.yale.edu/its/network/vpn_faq.html Yale VPN FAQ], [http://www.yale.edu/its/network/wireless/faq.html Yale Wireless FAQ], [http://www.yale.edu/its/telecom/dialing.html Yale Dialing Instructions], [https://config.mail.yale.edu Configuring Yale email], [http://www.yale.edu/ris/main.html Poster Printing]&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=Calendar_maintainers&amp;diff=4136</id>
		<title>Calendar maintainers</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=Calendar_maintainers&amp;diff=4136"/>
		<updated>2017-05-11T20:21:56Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;The individuals below have responsibilities for maintaining different parts of the lab calendar. These should *also* try to turn up on their call. &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
- Wormfly -------- JX (gcal)&lt;br /&gt;
- sv2anal -------- FN,XK (gcal)&lt;br /&gt;
- Entex ---------- JR,TG&lt;br /&gt;
- Psychencode ---- SL&lt;br /&gt;
- Gencode -------- PM&lt;br /&gt;
- gsp/cmg -------- AH,BW&lt;br /&gt;
- encode4 -------- JZ&lt;br /&gt;
- exrna ---------- JR&lt;br /&gt;
- ncvarg --------- SKL&lt;br /&gt;
- 1000G-SV ------- FN,XK&lt;br /&gt;
- pcawg ---------- SK&lt;br /&gt;
- special events - JR&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=Calendar_maintainers&amp;diff=4133</id>
		<title>Calendar maintainers</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=Calendar_maintainers&amp;diff=4133"/>
		<updated>2017-05-09T20:17:10Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: Revamped calendar maintainers&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;The individuals below have responsibilities for maintaining different parts of the lab calendar. These should *also* try to turn up on their call. &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
- Wormfly -------- JX (gcal)&lt;br /&gt;
- sv2anal -------- FN,XK (gcal)&lt;br /&gt;
- Entex ---------- JR,TG&lt;br /&gt;
- Psychencode ---- SL&lt;br /&gt;
- Gencode -------- PM&lt;br /&gt;
- gsp/cmg -------- AH,BW&lt;br /&gt;
- encode4 -------- JZ&lt;br /&gt;
- exrna ---------- JR&lt;br /&gt;
- ncvarg --------- SKL&lt;br /&gt;
- 1000G-SV ------- FN,XK&lt;br /&gt;
- Gencode -------- PM&lt;br /&gt;
- Psychencode ---- SL&lt;br /&gt;
- cmg unsolved --- AH,BW&lt;br /&gt;
- pcawg ---------- SK&lt;br /&gt;
- special events - JR&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=Summaries&amp;diff=3833</id>
		<title>Summaries</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=Summaries&amp;diff=3833"/>
		<updated>2016-03-16T18:42:45Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;See to [http://wiki.gersteinlab.org/labinfo/Uploading_documents upload documents]&lt;br /&gt;
&lt;br /&gt;
* Summaries of different research areas &lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2016/03.11/Research-summary-nlmtg16-MarkGerstein-11mar16mg.docx Gerstein Lab Experience in Several Major National Biomedical Data Science Initiatives, including 1KG, ENCODE, Brainspan, psychENCODE, Cancer Genomics, Genome Data Privacy, Proteomics and Networks] uploaded by LL on 11 March &#039;16 (nlmtg16)&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2016/01.20/gtechR01_summary4wiki.pdf Gerstein Lab experience in noncoding genome annotation, allelic analysis and somatic variant prioritization] uploaded by SKL on 20 January &#039;16 (gtechR01)&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2015/12.15/calling-and-prioritizing.docx Gerstein Lab Experience in calling and prioritizing variants and regions ] uploaded by LS on 15 December &#039;15 (CanDisp)&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2015/12.10/4dnbod_2_summary.docx	Gerstein Lab Experience in ChIP-Seq and RNA-Seq Analysis (brief) as of Dec. 2015] uploaded by MRS on 10 December &#039;15 (4dnbod[take2])&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2015/12.03/IBC.public.pdf Gerstein Lab Experience in breast cancer genome analysis] uploaded by XL on 3 December &#039;15 (SWOG)&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2015/12.03/prvWrk_allost.pdf Gerstein Lab Experience in Predicting Allosteric Hotspots (mmls)] uploaded by DC on 3 December &#039;15&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2015/12.01/RNASeqChipSeqintegrative_analysis.pdf Gerstein Lab experience in RNA-Seq, Chip-Seq and integrative analysis] uploaded by SL on 1 December &#039;15 (pec2)&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2015/12.01/RNAseq_processing_pipelines_DS.docx Gerstein Lab experience creating RNAseq processing pipelines] uploaded by DS on 1 Dec &#039;15 (u19asth)&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2015/12.01/Privacy_Research_Summary_11.2015.docx Gerstein lab experience in genomic privacy] uploaded by AH on Nov 30 &#039;15 (bd2kpr)&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2015/12.10/NSF-BBSRC-grantSummaryNov2015toUSE.docx	 Gerstein lab experience in molecular characterisation of gene phenotypes] uploaded by CSDS on Nov 24 &#039;15 (bbsrc)&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2015/12.10/Summary_efforts_big_data.docx Gerstein lab experience in data science] uploaded by KKY on Nov 23 &#039;15&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2015/12.10/Summary_networks_tools.docx  Gerstein lab network related tools] uploaded by KKY on Nov 23 &#039;15&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2015/09.18/Gerstein_Lab_Structural_Variation_and_Functional_Impact.pdf Gerstein Lab Structural Variation and Functional Impact (gspac)] uploaded by FN on 18 September &#039;15&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2015/06.01/Gerstein_Lab_analyzing_disease_implications_of_variants.pdf Gerstein Lab exprience analyzing disease implications of genetic variants as of Mar 2015] uploaded by TG on 1 June &#039;15 (cmg2)&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2015/06.01/Gerstein_Lab_ChipSeq_RNASeq_Networks.pdf Gerstein Lab experience with Chip-Seq, RNA-Seq, and network analysis as of Feb 2015] uploaded by TG on 1 June &#039;15 (4dnbod)&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2014/12.16/Gerstein_Lab_genome_annotation&amp;amp;variation_networks_comparative_genomics.pdf Gerstein Lab contributions to genome annotation and variation, networks, and comparative genomics] uploaded by TG on 16 December &#039;14 (4dnuc)&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2014/12.16/Gerstein_Lab_knowledgebase_design_and_analysis_of_sci_pubs.pdf Gerstein Lab expertise with knowledge-base design and analysis of scientific publications] uploaded by TG on 16 December &#039;14 (4dnoh)&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2014/06.13/gerstein-nsf-nrt_grant_v1.docx Yeast networks and Gerstein Lab expertise] uploaded by JC on 13 June &#039;14 (nrt)&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2014/03.06/YCC_report_2014.docx Gerstein Lab Cancer Research Accomplishments] uploaded by LL on 6 March &#039;14 &lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2014/02.24/1/Gerstein_Lab_network.htm Gerstein Lab contributions to statistical models of gene expression, network analysis, and mapping variation onto networks up to end of 2013] uploaded by DW on 24 February &#039;14 (dddic)&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2014/01.24/Gerstein_lab_annotation_contributions.pdf	 Annotation related contributions by Gerstein lab up to end of 2013] uploaded by SK on 24 Jan &#039;14 (ncvarg)&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2013/12.02/Gerstein_lab_rna_seq_contributions_120213.pdf Gerstein Lab Contributions to RNA-Seq Analysis up to end of 2013] uploaded by MRS on 2 December &#039;13 (exprofile)&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2013/11.25/GersteinLabContributions.docx Contributions to Networks &amp;amp; Genomics by Gerstein lab up to end of 2013] uploaded by ANS on 25 November &#039;13 (erggr)&lt;br /&gt;
 &lt;br /&gt;
[http://archive.gersteinlab.org/docs/2012/03.13/Summary_of_Gersteinlab_Genomics_contributions_upto_2012.doc Summary of Gersteinlab Genomics contributions upto 2012] uploaded by DW on 13 March &#039;12 (3dac)&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2011/01.16/Large-scale-collaborations-GersteinLab-involved-with.doc Writeup on large collaborations involving the Gerstein Lab] uploaded by mbg on 16 January &#039;11 (3dac)&lt;br /&gt;
&lt;br /&gt;
* Progress on various projects &lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2012/09.09/keck-2012-progress-report.doc Genome annotation, data integration, protein structure, sequencing costs, and privacy] uploaded by SB on 9 September &#039;12 (keck)&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2010/05.22/nih-CEGS-progress-report-2010Q1Q2-informatics-22May10mg.doc Informatics 2010Q1Q2; Gene Regulatory Networks] uploaded by MG on 22 May &#039;10 (cegs)&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2010/05.22/EXCERPTS-FROM--nih-CEGS-progress-report-14apr10mg.htm RNA-Seq, Chip-Seq, Human Variation, Informatics 14apr10mg] uploaded by MG on 22 May &#039;10 (cegs)&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2010/04.20/encode2-pgenes-progress-report-for-yr3-20Apr10-web-ver.doc Encode2 pseudogenes] uploaded by YHL on 20 April &#039;10 (encode2pgenes)&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2010/04.16/EDITFORWEB--Update-for-Annual-Mtg--NIH-CEGS-2008-2009-informatics-progress-report.doc Informatics &#039;08-&#039;09; Structural Variation, Regulatory Networks] uploaded by mg on 16 April &#039;10 (cegs)&lt;br /&gt;
&lt;br /&gt;
* Internal [https://docs.google.com/spreadsheets/d/1d4yWl-xCbvuJAY86JS5R1o91S_mQy2blyViusMTx54c/edit#gid=0 gsheet ] listing contributions&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=Summaries&amp;diff=3832</id>
		<title>Summaries</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=Summaries&amp;diff=3832"/>
		<updated>2016-03-14T20:57:31Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;See to [http://wiki.gersteinlab.org/labinfo/Uploading_documents upload documents]&lt;br /&gt;
&lt;br /&gt;
* Summaries of different research areas &lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2016/03.11/Research-summary-nlmtg16-MarkGerstein-11mar16mg.docx Gerstein Lab Experience in Several Major National Biomedical Data Science Initiatives, including 1KG, ENCODE, Brainspan, psychENCODE, Cancer Genomics, Genome Data Privacy, Proteomics and Networks] uploaded by LL on 11 March &#039;16 &lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2016/01.20/gtechR01_summary4wiki.pdf Gerstein Lab experience in noncoding genome annotation, allelic analysis and somatic variant prioritization] uploaded by SKL on 20 January &#039;16 (gtechR01)&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2015/12.15/calling-and-prioritizing.docx Gerstein Lab Experience in calling and prioritizing variants and regions ] uploaded by LS on 15 December &#039;15 (CanDisp)&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2015/12.10/4dnbod_2_summary.docx	Gerstein Lab Experience in ChIP-Seq and RNA-Seq Analysis (brief) as of Dec. 2015] uploaded by MRS on 10 December &#039;15 (4dnbod[take2])&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2015/12.03/IBC.public.pdf Gerstein Lab Experience in breast cancer genome analysis] uploaded by XL on 3 December &#039;15 (SWOG)&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2015/12.03/prvWrk_allost.pdf Gerstein Lab Experience in Predicting Allosteric Hotspots (mmls)] uploaded by DC on 3 December &#039;15&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2015/12.01/RNASeqChipSeqintegrative_analysis.pdf Gerstein Lab experience in RNA-Seq, Chip-Seq and integrative analysis] uploaded by SL on 1 December &#039;15 (pec2)&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2015/12.01/RNAseq_processing_pipelines_DS.docx Gerstein Lab experience creating RNAseq processing pipelines] uploaded by DS on 1 Dec &#039;15 (u19asth)&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2015/12.01/Privacy_Research_Summary_11.2015.docx Gerstein lab experience in genomic privacy] uploaded by AH on Nov 30 &#039;15 (bd2kpr)&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2015/12.10/NSF-BBSRC-grantSummaryNov2015toUSE.docx	 Gerstein lab experience in molecular characterisation of gene phenotypes] uploaded by CSDS on Nov 24 &#039;15 (bbsrc)&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2015/12.10/Summary_efforts_big_data.docx Gerstein lab experience in data science] uploaded by KKY on Nov 23 &#039;15&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2015/12.10/Summary_networks_tools.docx  Gerstein lab network related tools] uploaded by KKY on Nov 23 &#039;15&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2015/09.18/Gerstein_Lab_Structural_Variation_and_Functional_Impact.pdf Gerstein Lab Structural Variation and Functional Impact (gspac)] uploaded by FN on 18 September &#039;15&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2015/06.01/Gerstein_Lab_analyzing_disease_implications_of_variants.pdf Gerstein Lab exprience analyzing disease implications of genetic variants as of Mar 2015] uploaded by TG on 1 June &#039;15 (cmg2)&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2015/06.01/Gerstein_Lab_ChipSeq_RNASeq_Networks.pdf Gerstein Lab experience with Chip-Seq, RNA-Seq, and network analysis as of Feb 2015] uploaded by TG on 1 June &#039;15 (4dnbod)&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2014/12.16/Gerstein_Lab_genome_annotation&amp;amp;variation_networks_comparative_genomics.pdf Gerstein Lab contributions to genome annotation and variation, networks, and comparative genomics] uploaded by TG on 16 December &#039;14 (4dnuc)&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2014/12.16/Gerstein_Lab_knowledgebase_design_and_analysis_of_sci_pubs.pdf Gerstein Lab expertise with knowledge-base design and analysis of scientific publications] uploaded by TG on 16 December &#039;14 (4dnoh)&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2014/06.13/gerstein-nsf-nrt_grant_v1.docx Yeast networks and Gerstein Lab expertise] uploaded by JC on 13 June &#039;14 (nrt)&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2014/03.06/YCC_report_2014.docx Gerstein Lab Cancer Research Accomplishments] uploaded by LL on 6 March &#039;14 &lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2014/02.24/1/Gerstein_Lab_network.htm Gerstein Lab contributions to statistical models of gene expression, network analysis, and mapping variation onto networks up to end of 2013] uploaded by DW on 24 February &#039;14 (dddic)&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2014/01.24/Gerstein_lab_annotation_contributions.pdf	 Annotation related contributions by Gerstein lab up to end of 2013] uploaded by SK on 24 Jan &#039;14 (ncvarg)&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2013/12.02/Gerstein_lab_rna_seq_contributions_120213.pdf Gerstein Lab Contributions to RNA-Seq Analysis up to end of 2013] uploaded by MRS on 2 December &#039;13 (exprofile)&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2013/11.25/GersteinLabContributions.docx Contributions to Networks &amp;amp; Genomics by Gerstein lab up to end of 2013] uploaded by ANS on 25 November &#039;13 (erggr)&lt;br /&gt;
 &lt;br /&gt;
[http://archive.gersteinlab.org/docs/2012/03.13/Summary_of_Gersteinlab_Genomics_contributions_upto_2012.doc Summary of Gersteinlab Genomics contributions upto 2012] uploaded by DW on 13 March &#039;12 (3dac)&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2011/01.16/Large-scale-collaborations-GersteinLab-involved-with.doc Writeup on large collaborations involving the Gerstein Lab] uploaded by mbg on 16 January &#039;11 (3dac)&lt;br /&gt;
&lt;br /&gt;
* Progress on various projects &lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2012/09.09/keck-2012-progress-report.doc Genome annotation, data integration, protein structure, sequencing costs, and privacy] uploaded by SB on 9 September &#039;12 (keck)&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2010/05.22/nih-CEGS-progress-report-2010Q1Q2-informatics-22May10mg.doc Informatics 2010Q1Q2; Gene Regulatory Networks] uploaded by MG on 22 May &#039;10 (cegs)&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2010/05.22/EXCERPTS-FROM--nih-CEGS-progress-report-14apr10mg.htm RNA-Seq, Chip-Seq, Human Variation, Informatics 14apr10mg] uploaded by MG on 22 May &#039;10 (cegs)&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2010/04.20/encode2-pgenes-progress-report-for-yr3-20Apr10-web-ver.doc Encode2 pseudogenes] uploaded by YHL on 20 April &#039;10 (encode2pgenes)&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2010/04.16/EDITFORWEB--Update-for-Annual-Mtg--NIH-CEGS-2008-2009-informatics-progress-report.doc Informatics &#039;08-&#039;09; Structural Variation, Regulatory Networks] uploaded by mg on 16 April &#039;10 (cegs)&lt;br /&gt;
&lt;br /&gt;
* Internal [https://docs.google.com/spreadsheets/d/1d4yWl-xCbvuJAY86JS5R1o91S_mQy2blyViusMTx54c/edit#gid=0 gsheet ] listing contributions&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=Documents&amp;diff=3828</id>
		<title>Documents</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=Documents&amp;diff=3828"/>
		<updated>2016-02-18T21:34:03Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;(This page contains sundry public documents related to the Gerstein Lab. See to [http://wiki.gersteinlab.org/labinfo/Uploading_documents upload documents])&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/public-docs/2016/02.18/YCC_2016.docx Lab Cancer Research Summary 2015] uploaded by LL on 18 February &#039;16&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2016/02.16/Personal_Genome_Analyses.doc Personal Genome Analyses] uploaded by JR on 16 February &#039;16&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2016/02.05/6-slide-overview-of-the-lab_XL.pdf Lab_Overview] uploaded by XL on 5 February &#039;16&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2015/12.03/prvWrk_allost.pdf Gerstein Lab Experience in Predicting Allosteric Hotspots] uploaded by DC on 3 December &#039;15&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2015/06.09/1kg_fun_studies_revised.htm 1kg fun studies] updated by JC on 9 June &#039;15 &lt;br /&gt;
&lt;br /&gt;
Slide of [http://info.gersteinlab.org/images/5/59/CBB-US-Programs.pptx US Programs in Bioinformatics] uploaded by JAS on 11Nov 2014&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2014/03.12/Binder1_Gerstein.pdf Gerstein Lab at YINS] uploaded by KKY on 12 March &#039;14&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2011/03.15/1KGpapers-paragraphs.docx Summary of 1KG papers] uploaded by csds on 15 March &#039;11 &lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2011/01.14/people-and-space-as-of-20101231--EXTRACT-FROM--ann-rep-2010.doc People in the Lab &amp;amp; Room Space as of 31-Dec-2010] uploaded by mbg on 14 January &#039;11&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2010/03.17/Gersteinlab_Kline.pdf Exhibit slides] for Kline Science Library (uploaded by CS on 17 Mar &#039;10)&lt;br /&gt;
&lt;br /&gt;
Article on [http://archive.gersteinlab.org/docs/2010/03.27/Evolving-Definition-of-a-Gene--by-Karen-Hopkin--Bioscience-Dec09--Quotes-MG--bio.2009.59.11.pdf &amp;quot;Evolving Definition of a Gene&amp;quot;] (refers to Gerstein Lab &amp;amp; various ENCODE folks)&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2010/03.27/People-in-GersteinLab-at-end-of-2009.xls People-in-GersteinLab-at-end-of-2009.xls] uploaded by MG on 27 March &#039;10&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2010/04.26/M-Gerstein-CV-26Apr10 M Gerstein Full CV as of 26 Apr.] uploaded by MG on 26 April &#039;10&lt;br /&gt;
&lt;br /&gt;
[[Analysis of Lab Publications]]&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2010/06.24/1/metamembrane-cover-art.ai Illustrator File of Cover Art for Genome Research article on &amp;quot;metamembrane&amp;quot;] uploaded by mbg on 24 June &#039;10&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2012/02.13/Proteins_I_and_II_Rinehart.pdf Prof. Reinhart&#039;s lecture slides for CBB 752 Spring 2012] uploaded by LL on 13 February &#039;12&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/docs/2010/05.31/research-report-2009-v2-mg30May.htm research-report-2009-v2-mg30May] (2009 Research Summary) uploaded by MG on 31 May &#039;10&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Here is a [http://archive.gersteinlab.org/docs/summary.txt data file] of all uploaded things.&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=Lab_Address&amp;diff=3827</id>
		<title>Lab Address</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=Lab_Address&amp;diff=3827"/>
		<updated>2016-02-17T19:32:25Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: /* Main Room Numbers */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==PI Contact Address==&lt;br /&gt;
Mark Gerstein&lt;br /&gt;
([http://contact.gerstein.info contact.gerstein.info])&lt;br /&gt;
&lt;br /&gt;
==Admin Assistant==&lt;br /&gt;
Lori Iannicelli&amp;lt;BR&amp;gt;&lt;br /&gt;
266 Whitney Ave, Bass 432, New Haven, CT 06520&amp;lt;BR&amp;gt;&lt;br /&gt;
adminasst at gersteinlab.org&amp;lt;BR&amp;gt;&lt;br /&gt;
203-432-8189&lt;br /&gt;
&lt;br /&gt;
==Main Room Numbers==&lt;br /&gt;
*Bass  426: 203 432 8972&lt;br /&gt;
*Bass  428: 203 432 5065&lt;br /&gt;
*Bass  434: 203 436 4442&lt;br /&gt;
*Bass  437: 203 432 5405&lt;br /&gt;
*Gibbs 352: 203 432 8771 (near window)&lt;br /&gt;
*[[Mihali Felipe]] (sysadm): 203 432 8771 (Emergencies only: 203 623-8873)&lt;br /&gt;
&lt;br /&gt;
[http://wiki.gersteinlab.org/labinfo/Key_to_Gibbs Key to rooms in Gibbs]&lt;br /&gt;
&lt;br /&gt;
==Other Useful Numbers==&lt;br /&gt;
*Gibbs 362A (Core Conference Room): 203 432 6191&lt;br /&gt;
*Gibbs  371 (Core Labs Terminal Room)): 203 432 5615&lt;br /&gt;
*Gibbs 355E (Art Perlo, Core Sysadm): 203 432 5614&lt;br /&gt;
*Gibbs 355D (David Keller, Core Sysadm2): 203 432 5743&lt;br /&gt;
*Bass 432 : (Lori Iannicelli): 203 432 8189&lt;br /&gt;
*Bass 432A (Mark Gerstein): 203 432 6105&lt;br /&gt;
*Bass 424: conference room on Bass 4th floor&lt;br /&gt;
*Bass 425: 203 432 5599&lt;br /&gt;
*AKW 204: 203 432 1209&lt;br /&gt;
*AKW 206: office in AKW that Mark occasionally uses (changed from 205)&lt;br /&gt;
&lt;br /&gt;
== Laboratory Floor Plans &amp;amp; Room Instructions ==&lt;br /&gt;
*[[Bass Fourth Floor]]&lt;br /&gt;
*[http://www.gersteinlab.org/people/figures/gibbs-core-plan-2.gif Gibbs Core Facility]&lt;br /&gt;
* West Campus&lt;br /&gt;
*[https://docs.google.com/spreadsheets/d/1sRW8vqODqW25uFiOvo1YEN86ApvGuqN6jYIZgWWnkj4/pubhtml Square Footage of Assigned Rooms]&lt;br /&gt;
*[https://docs.google.com/presentation/d/16eDPLAC76DcMaCcnMsRcqKV1ggm4sbJ0ikofTPsUaBo/pub?start=false&amp;amp;loop=false&amp;amp;delayms=3000 Current Desk Assignments]&lt;br /&gt;
*[[Room Temperatures]]&lt;br /&gt;
&lt;br /&gt;
==Directions and Maps==&lt;br /&gt;
We&#039;re mostly located in the fourth floor of the [http://www.gersteinlab.org/people/bass.jpg Bass Building], but also have some outposts in Gibbs and AKW. The Bass Center is located at 266 Whitney Avenue. It faces the Biology department in Klein Tower. Click [http://www.gersteinlab.org/people/gerstein-map.html here] for an interactive satellite image of the Science Hill area from Google maps. Follow the link for a interactive [http://www.facilities.yale.edu/campus/Imapnew.asp campus map] and information for the [http://www.facilities.yale.edu/Campus/building1.asp?lstbldg=1030&amp;amp;sec=3 Bass Center] from buildings &amp;amp; grounds.&lt;br /&gt;
&lt;br /&gt;
General [http://www.yale.edu/directions.html Directions] to Yale University are posted on the University webserver. These include a detailed University [http://www.yale.edu/yaleinfo/maps/5.html map] locating the Bass building in the Science Hill area. Directions locating the Bass building on Science Hill is available [http://www.gersteinlab.org/people/directions.html here]. (Some of these maps are locally [http://bioinfo.mbb.yale.edu/~mbg/yale-maps mirrored]; in particular, a [http://bioinfo.mbb.yale.edu/~mbg/yale-maps/ConnAndVicinityReliefMap.gif relief map] of Connecticut and a map of the [http://bioinfo.mbb.yale.edu/~mbg/yale-maps/TownsOfGreaterNewHaven.gif towns] in the New Haven area).&lt;br /&gt;
&lt;br /&gt;
For walkers, note we&#039;re : [http://goo.gl/maps/ihKii 29&#039;] from the center of the Medical School at 300 Cedar St., [http://goo.gl/maps/GtECl 23&#039;] from 300 George St., [http://goo.gl/maps/vjNKv 9&#039;] from food area at Whitney &amp;amp; Audubon (e.g. Gourmet Heaven &amp;amp; Koffee), and [http://goo.gl/maps/ANU4B 7&#039;] from Orange &amp;amp; Pearl Streets (location of Cafe Romeo).&lt;br /&gt;
&lt;br /&gt;
==Rail==&lt;br /&gt;
[http://www.mta.nyc.ny.us/mnr/ Metro-North] and [http://www.amtrak.com/ Amtrak] service to New Haven (Taxis available at station.)&lt;br /&gt;
&lt;br /&gt;
==Driving and Parking==&lt;br /&gt;
===From Boston===&lt;br /&gt;
#Take Mass Pike to Interstate 84. &lt;br /&gt;
#Take Interstate 84 South to Hartford.&lt;br /&gt;
#Take Interstate 91 South to New Haven.&lt;br /&gt;
#Take Exit 3 off of Interstate 91.&lt;br /&gt;
#Turn right at the stoplight at the end of the exit ramp onto Orange Street. &lt;br /&gt;
#Turn left onto Humphrey Street at next stoplight. &lt;br /&gt;
#Proceed through the next stoplight into the parking lot (Lot 16). &lt;br /&gt;
&lt;br /&gt;
===From New York===&lt;br /&gt;
#Take Interstate 95 North to Interstate 91 at New Haven&lt;br /&gt;
#Take exit 3 off Interstate 91. &lt;br /&gt;
#Turn right at the stoplight at the end of the exit ramp onto Orange Street.&lt;br /&gt;
#Turn left onto Humphrey Street at next stoplight.&lt;br /&gt;
#Proceed through the next stoplight into the parking lot (Lot 16). &lt;br /&gt;
(Before getting on 95, it&#039;s nice to take the Merritt. You might want to follow this [http://www.gersteinlab.org/people/ny-to-ct.gif route].)&lt;br /&gt;
&lt;br /&gt;
===Parking===&lt;br /&gt;
There is a parking lot on Edwards St available for visitors. Passing &amp;quot;lot 29&amp;quot; is [https://www.google.com/maps/place/Pierson-Sage+Garage Pierson-Sage Garage]. Enter the code to enter.&lt;br /&gt;
&lt;br /&gt;
The code can be found [http://wiki.gersteinlab.org/labinfo/Parking_lot_for_visitors on the private wiki]. Please contact your host for the code or &amp;quot;Lori&amp;quot;. You must exit the same way you entered.&lt;br /&gt;
&lt;br /&gt;
NOTE: There is also free open-surface parking after 4PM which remains open until 7AM. The lots are located on 260-266 Whitney Avenue, anywhere you seen an open gate.&lt;br /&gt;
&lt;br /&gt;
==YCMI==&lt;br /&gt;
Mark has an office at YCMI: 300 George Street, Room 522; (203) 737-6029 (main YCMI line)&lt;br /&gt;
&lt;br /&gt;
==Other Stuff==&lt;br /&gt;
[http://yp.yahoo.com/py/ypResults.py?Pyt=Tmap&amp;amp;YY=24974&amp;amp;addr=266%20Whitney%20Avenue&amp;amp;city=NEW%20HAVEN&amp;amp;state=CT&amp;amp;zip=06520&amp;amp;slt=&amp;amp;sln=&amp;amp;cs=&amp;amp;stp=y&amp;amp;stx=8903827&amp;amp;btype=default Yahoo: Nearby Restaurants], [http://maps.yahoo.com/py/maps.py?Pyt=Tmap&amp;amp;YY=28780&amp;amp;addr=266%20Whitney%20Avenue&amp;amp;city=New%20Haven&amp;amp;state=CT&amp;amp;slt=41.3180&amp;amp;sln=-72.9199&amp;amp;zip=06520&amp;amp;mag=10&amp;amp;cs=9&amp;amp;newmag=9 Map of Bass Area], [http://yp.yahoo.com/py/ypMap.py?Pyt=Tmap&amp;amp;YY=8688&amp;amp;addr=266%20Whitney%20Avenue&amp;amp;city=NEW%20HAVEN&amp;amp;state=CT&amp;amp;country=US&amp;amp;slt=41.3180&amp;amp;sln=-72.9199&amp;amp;zip=06520&amp;amp;mag=9&amp;amp;cs=9&amp;amp;stx=7737307&amp;amp;stp=y&amp;amp;ycat=7737307&amp;amp;l=11&amp;amp;tuid=566518&amp;amp;tq=8&amp;amp;btype=default Colony Inn] (nearby hotel), [http://yp.yahoo.com/py/ypResults.py?Pyt=Tmap&amp;amp;YY=32031&amp;amp;addr=266%20Whitney%20Avenue&amp;amp;city=NEW%20HAVEN&amp;amp;state=CT&amp;amp;zip=06520&amp;amp;slt=&amp;amp;sln=&amp;amp;cs=9&amp;amp;stp=y&amp;amp;stx=8014796&amp;amp;btype=default Taxi] services.&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=RSEQtools/code&amp;diff=3537</id>
		<title>RSEQtools/code</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=RSEQtools/code&amp;diff=3537"/>
		<updated>2015-03-30T18:21:31Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&amp;lt;center&amp;gt;[http://archive.gersteinlab.org/proj/rnaseq/rseqtools &#039;&#039;&#039;RSEQtools Main Page&#039;&#039;&#039;]&amp;lt;/center&amp;gt;&lt;br /&gt;
&lt;br /&gt;
__NOTOC__&lt;br /&gt;
&lt;br /&gt;
== Code ==&lt;br /&gt;
&lt;br /&gt;
=== Required Software - External ===&lt;br /&gt;
&lt;br /&gt;
# [http://www.gnu.org/software/gsl/ GSL] - GNU Scientific Library&lt;br /&gt;
# [http://hgwdev.cse.ucsc.edu/~kent/exe/linux/blatSuite.34.zip BlatSuite] - BLAT and a collection of utility programs. Note these executables have to be part of the PATH.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Download ===&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/proj/rnaseq/rseqtools RSEQtools] consists of two libraries: [http://homes.gersteinlab.org/people/lh372/SOFT/bios/index.html BIOS], which is a general C library, and &lt;br /&gt;
[http://archive.gersteinlab.org/proj/rnaseq/doc/mrf/index.html MRF] (Mapped Read Format), which contains a set of modules to analyze RNA-Seq data sets.&lt;br /&gt;
&lt;br /&gt;
==== Source code ====&lt;br /&gt;
A TAR ball that contains the source code for these two components can be downloaded here:&lt;br /&gt;
* [http://archive.gersteinlab.org/proj/rnaseq/rseqtools/RSEQtools-0.6.tar.gz RSEQtools-0.6.tar.gz] - Last updated on 11/05/2010&lt;br /&gt;
* [http://archive.gersteinlab.org/proj/rnaseq/rseqtools/RSEQtools-0.5.tar.gz RSEQtools-0.5.tar.gz] - Initial upload&lt;br /&gt;
The components should be installed in the same order as listed above. Installation instructions can be found at the respective website.&lt;br /&gt;
&lt;br /&gt;
==== Executables ====&lt;br /&gt;
Statically built binaries for UNIX and Mac OS can be found here:&lt;br /&gt;
* [http://archive.gersteinlab.org/proj/rnaseq/rseqtools/RSEQtools-0.6_UNIX.tar.gz RSEQtools-0.6_UNIX.tar.gz] - 64bit version, last updated on 11/05/2010&lt;br /&gt;
* [http://archive.gersteinlab.org/proj/rnaseq/rseqtools/RSEQtools-0.6_MacOs_10.6.tar.gz RSEQtools-0.6_MacOs_10.6.tar.gz] - MacOs 10.6, last updated on 11/05/2010&lt;br /&gt;
&lt;br /&gt;
==== Github ====&lt;br /&gt;
All of the above tarballs are also available from Github:&lt;br /&gt;
* [https://github.com/gersteinlab/RSEQtools.git https://github.com/gersteinlab/RSEQtools.git]&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
Important Note&lt;br /&gt;
==============&lt;br /&gt;
&lt;br /&gt;
THIS PACKAGE (RSEQtools including BIOS and MRF) IS PROVIDED &amp;quot;AS IS&amp;quot; AND WITHOUT ANY EXPRESSED OR IMPLIED&lt;br /&gt;
WARRANTIES, INCLUDING, WITHOUT LIMITATION, THE IMPLIED WARRANTIES&lt;br /&gt;
OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE.&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== License information ===&lt;br /&gt;
&lt;br /&gt;
The software package is released under the [http://creativecommons.org/licenses/by-nc/2.5/legalcode Creative Commons license (Attribution-NonCommerical)]. &amp;lt;br&amp;gt;&lt;br /&gt;
For more details please refer to the [http://www.gersteinlab.org/misc/permissions.html Permissions Page] on the Gerstein Lab webpage.&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=RSEQtools/code&amp;diff=3536</id>
		<title>RSEQtools/code</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=RSEQtools/code&amp;diff=3536"/>
		<updated>2015-03-30T18:15:57Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: Undo revision 3534 by Lucasl (Talk)&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&amp;lt;center&amp;gt;[http://archive.gersteinlab.org/proj/rnaseq/rseqtools &#039;&#039;&#039;RSEQtools Main Page&#039;&#039;&#039;]&amp;lt;/center&amp;gt;&lt;br /&gt;
&lt;br /&gt;
__NOTOC__&lt;br /&gt;
&lt;br /&gt;
== Code ==&lt;br /&gt;
&lt;br /&gt;
=== Required Software - External ===&lt;br /&gt;
&lt;br /&gt;
# [http://www.gnu.org/software/gsl/ GSL] - GNU Scientific Library&lt;br /&gt;
# [http://hgwdev.cse.ucsc.edu/~kent/exe/linux/blatSuite.34.zip BlatSuite] - BLAT and a collection of utility programs. Note these executables have to be part of the PATH.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Download ===&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/proj/rnaseq/rseqtools RSEQtools] consists of two libraries: [http://homes.gersteinlab.org/people/lh372/SOFT/bios/index.html BIOS], which is a general C library, and &lt;br /&gt;
[http://archive.gersteinlab.org/proj/rnaseq/doc/mrf/index.html MRF] (Mapped Read Format), which contains a set of modules to analyze RNA-Seq data sets.&lt;br /&gt;
&lt;br /&gt;
==== Source code ====&lt;br /&gt;
A TAR ball that contains the source code for these two components can be downloaded here:&lt;br /&gt;
* [http://archive.gersteinlab.org/proj/rnaseq/rseqtools/RSEQtools-0.6.tar.gz RSEQtools-0.6.tar.gz] - Last updated on 11/05/2010&lt;br /&gt;
* [http://archive.gersteinlab.org/proj/rnaseq/rseqtools/RSEQtools-0.5.tar.gz RSEQtools-0.5.tar.gz] - Initial upload&lt;br /&gt;
The components should be installed in the same order as listed above. Installation instructions can be found at the respective website.&lt;br /&gt;
&lt;br /&gt;
==== Executables ====&lt;br /&gt;
Statically built binaries for UNIX and Mac OS can be found here:&lt;br /&gt;
* [http://archive.gersteinlab.org/proj/rnaseq/rseqtools/RSEQtools-0.6_UNIX.tar.gz RSEQtools-0.6_UNIX.tar.gz] - 64bit version, last updated on 11/05/2010&lt;br /&gt;
* [http://archive.gersteinlab.org/proj/rnaseq/rseqtools/RSEQtools-0.6_MacOs_10.6.tar.gz RSEQtools-0.6_MacOs_10.6.tar.gz] - MacOs 10.6, last updated on 11/05/2010&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
Important Note&lt;br /&gt;
==============&lt;br /&gt;
&lt;br /&gt;
THIS PACKAGE (RSEQtools including BIOS and MRF) IS PROVIDED &amp;quot;AS IS&amp;quot; AND WITHOUT ANY EXPRESSED OR IMPLIED&lt;br /&gt;
WARRANTIES, INCLUDING, WITHOUT LIMITATION, THE IMPLIED WARRANTIES&lt;br /&gt;
OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE.&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== License information ===&lt;br /&gt;
&lt;br /&gt;
The software package is released under the [http://creativecommons.org/licenses/by-nc/2.5/legalcode Creative Commons license (Attribution-NonCommerical)]. &amp;lt;br&amp;gt;&lt;br /&gt;
For more details please refer to the [http://www.gersteinlab.org/misc/permissions.html Permissions Page] on the Gerstein Lab webpage.&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=RSEQtools/code&amp;diff=3534</id>
		<title>RSEQtools/code</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=RSEQtools/code&amp;diff=3534"/>
		<updated>2015-03-24T17:10:17Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&amp;lt;center&amp;gt;[http://archive.gersteinlab.org/proj/rnaseq/rseqtools &#039;&#039;&#039;RSEQtools Main Page&#039;&#039;&#039;]&amp;lt;/center&amp;gt;&lt;br /&gt;
&lt;br /&gt;
__NOTOC__&lt;br /&gt;
&lt;br /&gt;
== Code ==&lt;br /&gt;
&lt;br /&gt;
=== Required Software - External ===&lt;br /&gt;
&lt;br /&gt;
# [http://www.gnu.org/software/gsl/ GSL] - GNU Scientific Library&lt;br /&gt;
# [http://hgwdev.cse.ucsc.edu/~kent/exe/linux/blatSuite.34.zip BlatSuite] - BLAT and a collection of utility programs. Note these executables have to be part of the PATH.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Download ===&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/proj/rnaseq/rseqtools RSEQtools] consists of two libraries: [http://homes.gersteinlab.org/people/lh372/SOFT/bios/index.html BIOS], which is a general C library, and &lt;br /&gt;
[http://archive.gersteinlab.org/proj/rnaseq/doc/mrf/index.html MRF] (Mapped Read Format), which contains a set of modules to analyze RNA-Seq data sets.&lt;br /&gt;
&lt;br /&gt;
==== Source code &amp;amp; Executables ====&lt;br /&gt;
A TAR ball that contains the source code for these two components, as well as statically built binaries for UNIX and Mac OS, can be found in RSEQtools&#039; Github repository, which can be cloned from the following URL:&lt;br /&gt;
* [https://github.com/gersteinlab/RSEQtools.git https://github.com/gersteinlab/RSEQtools.git]&lt;br /&gt;
The components should be installed in the same order as listed above. Installation instructions can be found at the respective website.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
Important Note&lt;br /&gt;
==============&lt;br /&gt;
&lt;br /&gt;
THIS PACKAGE (RSEQtools including BIOS and MRF) IS PROVIDED &amp;quot;AS IS&amp;quot; AND WITHOUT ANY EXPRESSED OR IMPLIED&lt;br /&gt;
WARRANTIES, INCLUDING, WITHOUT LIMITATION, THE IMPLIED WARRANTIES&lt;br /&gt;
OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE.&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== License information ===&lt;br /&gt;
&lt;br /&gt;
The software package is released under the [http://creativecommons.org/licenses/by-nc/2.5/legalcode Creative Commons license (Attribution-NonCommerical)]. &amp;lt;br&amp;gt;&lt;br /&gt;
For more details please refer to the [http://www.gersteinlab.org/misc/permissions.html Permissions Page] on the Gerstein Lab webpage.&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1843</id>
		<title>Cbb752b12</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1843"/>
		<updated>2012-01-18T14:12:49Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: /* Timing &amp;amp; location */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=CBB 752=&lt;br /&gt;
&lt;br /&gt;
==Course Information==&lt;br /&gt;
&lt;br /&gt;
===Course Description===&lt;br /&gt;
&lt;br /&gt;
Bioinformatics encompasses the analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. It represents a major practical application for modern techniques in data mining and simulation. Specific topics to be covered include sequence alignment, large-scale processing, next-generation sequencing data, comparative genomics, phylogenetics, biological database design, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, normalization of microarray data, mining of functional genomics data sets, and machine learning approaches for data integration.&lt;br /&gt;
&lt;br /&gt;
===Concise undergraduate course description===&lt;br /&gt;
&lt;br /&gt;
Techniques in data mining and simulation applied to bioinformatics, the computational analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. Sequence alignment, comparative genomics and phylogenetics, biological databases, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, microarray normalization, and machine-learning approaches to data integration.&lt;br /&gt;
&lt;br /&gt;
See entry from undergraduate catalog:&lt;br /&gt;
http://students.yale.edu/oci/resultDetail.jsp?course=21914&amp;amp;term=201201 , viz:&lt;br /&gt;
&lt;br /&gt;
 MB&amp;amp;B 452 01 (21914) /MCDB452/MB&amp;amp;B752/MB&amp;amp;B753/MB&amp;amp;B754/CB&amp;amp;B752/MCDB752/CPSC752&lt;br /&gt;
 Bioinformatics: Practical Application of Simulation and Data Mining &lt;br /&gt;
 Mark Gerstein&lt;br /&gt;
 MW 1.00-2.15 BASS 305&lt;br /&gt;
 Spring 2012 &lt;br /&gt;
 No regular final examination&lt;br /&gt;
 Areas Sc&lt;br /&gt;
 Prerequisites: MB&amp;amp;B 301b and MATH 115a or b, or permission of instructor.&lt;br /&gt;
 MCDB 120a or 200b is a prerequisite for courses numbered MCDB 202 and above.&lt;br /&gt;
&lt;br /&gt;
===Different headings for this class===&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B452/MCDB452&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, written problem sets, and a final written project.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B752/MCDB752&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, written problem sets, and a final, graduate level written project.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;CB&amp;amp;B752/CPSC752&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, programming assignments, and a final programming project.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For &#039;&#039;graduate students&#039;&#039; the course is broken up into two &amp;quot;modules&amp;quot; (each counting 0.5 credit towards MB&amp;amp;B course requirement):&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B 753b3&#039;&#039;&#039;, Bioinformatics: Practical Application of Data Mining (1st half of term)&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B 754b4&#039;&#039;&#039;, Bioinformatics: Practical Application of Simulation (2nd half of term)&lt;br /&gt;
&lt;br /&gt;
Each module consists of lectures, written problem sets, and a final, graduate level written project that is half the length of the full course&#039;s final project.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the grade weighting schemes of each course version, see Class Requirements section.&lt;br /&gt;
&lt;br /&gt;
===Prerequisites===&lt;br /&gt;
The course is keyed towards CBB graduate students as well as advanced MB&amp;amp;B undergraduates and graduate students wishing to learn about types of large-scale quantitative analyses that whole-genome sequencing will make possible. It would also be suitable for students from other fields such as computer science or physics wanting to learn about an important new biological application for computation.&lt;br /&gt;
&lt;br /&gt;
Students should have:&lt;br /&gt;
&lt;br /&gt;
A basic knowledge of biochemistry and molecular biology.&lt;br /&gt;
A knowledge of basic quantitative concepts, such as single variable calculus, some probability and statistics, and basic programming skills.&lt;br /&gt;
These can be fulfilled by the following prerequisites statement: &amp;quot;Prerequisites: MBB 200 and Mathematics 115 or permission of the instructor.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
==Timing &amp;amp; location==&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Class:&#039;&#039;&#039; Meeting from 1:00-2:15 pm on Monday and Wednesday, in Bass 305. (First meeting will be on 9 Jan.)&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Discussion section:&#039;&#039;&#039; Wed 6-7 pm in Gibbs 263 (starting Jan 18)&lt;br /&gt;
&lt;br /&gt;
[https://docs.google.com/spreadsheet/viewform?formkey=dFpoWF92YlVQQTFFOWY0cWt6RUdueWc6MQ Poll] for students to indicate good times for the weekly discussion section&lt;br /&gt;
&lt;br /&gt;
==Instructors==&lt;br /&gt;
&lt;br /&gt;
===Instructor-in-Charge===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Mark Gerstein&lt;br /&gt;
| Bass 432A&lt;br /&gt;
| mark.gerstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Guest Instructors===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Corey O&#039;Hern&lt;br /&gt;
| Mason Laboratory&lt;br /&gt;
| corey.ohern(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jesse Rinehart&lt;br /&gt;
| 300 George St&lt;br /&gt;
| jesse.rinehart(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| James Noonan&lt;br /&gt;
| 333 Cedar St&lt;br /&gt;
| james.noonan(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Kei Cheung&lt;br /&gt;
| 300 George St&lt;br /&gt;
| kei.cheung(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Steven Kleinstein&lt;br /&gt;
| 300 George St&lt;br /&gt;
| steven.kleinstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Teaching Fellows===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Lucas Lochovsky&lt;br /&gt;
| Bass 437&lt;br /&gt;
| lucas.lochovsky(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jane Leng&lt;br /&gt;
| Bass 437&lt;br /&gt;
| jing.leng(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Alice Zhou (select lectures)&lt;br /&gt;
| Mason Laboratory&lt;br /&gt;
| alice.zhou(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Topics/Class Schedule==&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;[https://docs.google.com/spreadsheet/ccc?key=0Ar2M8lIkl9T5dDN4d1Q0aDBGcThXdjV2c21ESWVxSGc Class Schedule]&#039;&#039;&#039; (including a list of topics and quiz dates)&lt;br /&gt;
&lt;br /&gt;
==Discussion Section Readings==&lt;br /&gt;
&lt;br /&gt;
===Session 1===&lt;br /&gt;
Metzker ML. &amp;quot;Sequencing technologies - the next generation” Nature Reviews Genetics. 11 (2010) [http://www.gersteinlab.org/courses/452/10-spring/pdf/ngs.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Wheeler DA et al. &amp;quot;The complete genome of an individual by massively parallel DNA sequencing,” Nature. 452:872-876 (2008) [http://www.gersteinlab.org/courses/452/10-spring/pdf/WatsonGenome.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 2===&lt;br /&gt;
Olsen JV, Blagoev B, Gnad F, Macek B, Kumar C, Mortensen P, Mann M. (2006) Global, in vivo, and site-specific phosphorylation dynamics in signaling networks.Cell. 2006 Nov 3;127(3):635-48. [http://www.pil.sdu.dk/1/MSQuant/Cell_GlobalPhosphorylationSignalingDynamics.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Nevan J. Krogan et al (2006) Global landscape of protein complexes in the yeast Saccharomyces cerevisiae Nature 440, 637-643 (30 March 2006) [http://www.nature.com/nature/journal/v440/n7084/pdf/nature04670.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 3===&lt;br /&gt;
T.F. Smith and M.S. Waterman. (1981) Identification of common molecular subsequences. Journal of Molecular Biology,147(1): 195-7. PMID: 7265238. [http://www.gersteinlab.org/courses/452/10-spring/pdf/sw.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. (1990) Basic local alignment search tool. Journal of Molecular Biology, 215(3):403-10. PMID: 2231712. [http://www.gersteinlab.org/courses/452/10-spring/pdf/Altschul.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 4===&lt;br /&gt;
Bailey TL, Williams N, Misleh C, Li WW. (2006) MEME: discovering and analyzing DNA and protein sequence motifs, Nucl Acids Res.34:W369-373 [http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1538909/?tool=pubmed PDF]&lt;br /&gt;
&lt;br /&gt;
Garnier J, Gibrat JF, Robson B. (1996) GOR method for predicting protein secondary structure from amino acid sequence.Methods in Enzymology,266: 540-53. PMID: 8743705. [http://www.gersteinlab.org/courses/452/10-spring/pdf/GOR.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 5===&lt;br /&gt;
Sotiriou et al. (2006) Gene Expression Profiling in Breast Cancer: Understanding the Molecular Basis of Histologic Grade To Improve Prognosis. JNCI J Natl Cancer Inst (15 February 2006) 98 (4):262-272.doi: 10.1093/jnci/djj052 [http://jnci.oxfordjournals.org/content/98/4/262.full.pdf+html PDF]&lt;br /&gt;
&lt;br /&gt;
Ekman D, Light S, Björklund AK, Elofsson A. (2006) What properties characterize the hub proteins of the protein-protein interaction network of Saccharomyces cerevisiae? Genome Biol. 2006;7(6):R45. [http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1779539/pdf/gb-2006-7-6-r45.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 6===&lt;br /&gt;
Antezana E, Egaña M, Blondé W, Illarramendi A, Bilbao I, De Baets B, Stevens R, Mironov V, Kuiper M. (2009) The Cell Cycle Ontology: an application ontology for the representation and integrated analysis of the cell cycle process. Genome Biol. 2009;10(5):R58. Epub 2009 May 29. [http://genomebiology.com/content/10/5/R58 PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 7===&lt;br /&gt;
Perelson AS. Modelling viral and immune system dynamics. Nat Rev Immunol. 2002 Jan;2(1):28-36. [http://www.gersteinlab.org/courses/452/10-spring/pdf/perelson.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 8===&lt;br /&gt;
ML Connolly. (1983) Solvent-accessible surfaces of proteins and nucleic acids. Science, 221(4612): 709-13. PMID: 6879170.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Connolly.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Martin Karplus and J. Andrew McCammon. (2002) Molecular dynamics simulations of biomolecules. Nature Structural Biology,9, 646-52. PMID: 12198485.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Karplus.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 9===&lt;br /&gt;
Dill KA, Ozkan SB, Shell MS, Weikl TR. (2008) The Protein Folding Problem.Annu Rev Biophys,9, 37:289-316. PMID: 2443096.[http://www.gersteinlab.org/courses/452/10-spring/pdf/proteinFolding.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Bowman GR, Beauchamp KA, Boxer G, Pande VS. “Progress and challenges in the automated construction of Markov state models for full protein systems,” J. Chem. Phys. 131 (2009) 124101 [http://www.gersteinlab.org/courses/452/10-spring/pdf/bowman.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
==Class Requirements==&lt;br /&gt;
&lt;br /&gt;
===Discussion Section / Readings===&lt;br /&gt;
Papers will be assigned throughout the course. These papers will be presented and discussed in weekly 60-minute sections with the TFs. A brief summary (a half-page per article) should be submitted at the beginning of the discussion session.&lt;br /&gt;
&lt;br /&gt;
===Bioinformatics quizzes===&lt;br /&gt;
There will be four short quizzes (25 minutes) in class comprising SIMPLE questions that you should be able to answer from the lectures plus the main readings.&lt;br /&gt;
&lt;br /&gt;
===Programming Assignments (CBB and CS)===&lt;br /&gt;
There will be several short programming assignments required for CBB and CS students taking this course. Acceptable languages and submission requirements will be discussed prior to the first assignment. These assignments are NOT required for students not taking the CBB or CS sections of the course.&lt;br /&gt;
&lt;br /&gt;
===Non-CBB Final Project===&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
===CBB Final Project===&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
===Grade Categories===&lt;br /&gt;
&lt;br /&gt;
====CBB and CPSC Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 9%&lt;br /&gt;
|-&lt;br /&gt;
| Programming Assignments&lt;br /&gt;
| 25%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
====MBB and MCDB Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 17%&lt;br /&gt;
|-&lt;br /&gt;
| Problem Sets&lt;br /&gt;
| 17%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Relevant Yale College Regulations===&lt;br /&gt;
&lt;br /&gt;
Students may have questions concerning end-of-term matters.  Links to further information about these regulations can be found below:&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/reading-period-and-final-examination-period&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/completion-course-work&lt;br /&gt;
&lt;br /&gt;
==Misc==&lt;br /&gt;
[http://info.gersteinlab.org/Permissions Permissions] on using website material&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/mark/out/log/2011/12.29/talk/intro-graphic.ppt Graphic] for course homepage&lt;br /&gt;
&lt;br /&gt;
If you&#039;re really motivated, take a look at http://gersteinlab.org/jobs for further Research Opportunities&lt;br /&gt;
&lt;br /&gt;
==Pages from previous years==&lt;br /&gt;
[http://info.gersteinlab.org/Cbb752b11 2011],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/ 2010],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/previous.html 2009 and earlier]&lt;br /&gt;
([[Pointers on finding things on old class pages]])&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1841</id>
		<title>Cbb752b12</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1841"/>
		<updated>2012-01-11T10:59:22Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: /* Grade Categories */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=CBB 752=&lt;br /&gt;
&lt;br /&gt;
==Course Information==&lt;br /&gt;
&lt;br /&gt;
===Course Description===&lt;br /&gt;
&lt;br /&gt;
Bioinformatics encompasses the analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. It represents a major practical application for modern techniques in data mining and simulation. Specific topics to be covered include sequence alignment, large-scale processing, next-generation sequencing data, comparative genomics, phylogenetics, biological database design, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, normalization of microarray data, mining of functional genomics data sets, and machine learning approaches for data integration.&lt;br /&gt;
&lt;br /&gt;
===Concise undergraduate course description===&lt;br /&gt;
&lt;br /&gt;
Techniques in data mining and simulation applied to bioinformatics, the computational analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. Sequence alignment, comparative genomics and phylogenetics, biological databases, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, microarray normalization, and machine-learning approaches to data integration.&lt;br /&gt;
&lt;br /&gt;
See entry from undergraduate catalog:&lt;br /&gt;
http://students.yale.edu/oci/resultDetail.jsp?course=21914&amp;amp;term=201201 , viz:&lt;br /&gt;
&lt;br /&gt;
 MB&amp;amp;B 452 01 (21914) /MCDB452/MB&amp;amp;B752/MB&amp;amp;B753/MB&amp;amp;B754/CB&amp;amp;B752/MCDB752/CPSC752&lt;br /&gt;
 Bioinformatics: Practical Application of Simulation and Data Mining &lt;br /&gt;
 Mark Gerstein&lt;br /&gt;
 MW 1.00-2.15 BASS 305&lt;br /&gt;
 Spring 2012 &lt;br /&gt;
 No regular final examination&lt;br /&gt;
 Areas Sc&lt;br /&gt;
 Prerequisites: MB&amp;amp;B 301b and MATH 115a or b, or permission of instructor.&lt;br /&gt;
 MCDB 120a or 200b is a prerequisite for courses numbered MCDB 202 and above.&lt;br /&gt;
&lt;br /&gt;
===Different headings for this class===&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B452/MCDB452&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, written problem sets, and a final written project.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B752/MCDB752&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, written problem sets, and a final, graduate level written project.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;CB&amp;amp;B752/CPSC752&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, programming assignments, and a final programming project.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For &#039;&#039;graduate students&#039;&#039; the course is broken up into two &amp;quot;modules&amp;quot; (each counting 0.5 credit towards MB&amp;amp;B course requirement):&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B 753b3&#039;&#039;&#039;, Bioinformatics: Practical Application of Data Mining (1st half of term)&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B 754b4&#039;&#039;&#039;, Bioinformatics: Practical Application of Simulation (2nd half of term)&lt;br /&gt;
&lt;br /&gt;
Each module consists of lectures, written problem sets, and a final, graduate level written project that is half the length of the full course&#039;s final project.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the grade weighting schemes of each course version, see Class Requirements section.&lt;br /&gt;
&lt;br /&gt;
===Prerequisites===&lt;br /&gt;
The course is keyed towards CBB graduate students as well as advanced MB&amp;amp;B undergraduates and graduate students wishing to learn about types of large-scale quantitative analyses that whole-genome sequencing will make possible. It would also be suitable for students from other fields such as computer science or physics wanting to learn about an important new biological application for computation.&lt;br /&gt;
&lt;br /&gt;
Students should have:&lt;br /&gt;
&lt;br /&gt;
A basic knowledge of biochemistry and molecular biology.&lt;br /&gt;
A knowledge of basic quantitative concepts, such as single variable calculus, some probability and statistics, and basic programming skills.&lt;br /&gt;
These can be fulfilled by the following prerequisites statement: &amp;quot;Prerequisites: MBB 200 and Mathematics 115 or permission of the instructor.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
==Timing &amp;amp; location==&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Class:&#039;&#039;&#039; Meeting from 1:00-2:15 pm on Monday and Wednesday, in Bass 305. (First meeting will be on 9 Jan.)&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Discussion section:&#039;&#039;&#039; [https://docs.google.com/spreadsheet/viewform?formkey=dFpoWF92YlVQQTFFOWY0cWt6RUdueWc6MQ Poll] for students to indicate good times for the weekly discussion section&lt;br /&gt;
&lt;br /&gt;
==Instructors==&lt;br /&gt;
&lt;br /&gt;
===Instructor-in-Charge===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Mark Gerstein&lt;br /&gt;
| Bass 432A&lt;br /&gt;
| mark.gerstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Guest Instructors===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Corey O&#039;Hern&lt;br /&gt;
| Mason Laboratory&lt;br /&gt;
| corey.ohern(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jesse Rinehart&lt;br /&gt;
| 300 George St&lt;br /&gt;
| jesse.rinehart(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| James Noonan&lt;br /&gt;
| 333 Cedar St&lt;br /&gt;
| james.noonan(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Kei Cheung&lt;br /&gt;
| 300 George St&lt;br /&gt;
| kei.cheung(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Steven Kleinstein&lt;br /&gt;
| 300 George St&lt;br /&gt;
| steven.kleinstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Teaching Fellows===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Lucas Lochovsky&lt;br /&gt;
| Bass 437&lt;br /&gt;
| lucas.lochovsky(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jane Leng&lt;br /&gt;
| Bass 437&lt;br /&gt;
| jing.leng(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Alice Zhou (select lectures)&lt;br /&gt;
| Mason Laboratory&lt;br /&gt;
| alice.zhou(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Topics/Class Schedule==&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;[https://docs.google.com/spreadsheet/ccc?key=0Ar2M8lIkl9T5dDN4d1Q0aDBGcThXdjV2c21ESWVxSGc Class Schedule]&#039;&#039;&#039; (including a list of topics and quiz dates)&lt;br /&gt;
&lt;br /&gt;
==Discussion Section Readings==&lt;br /&gt;
&lt;br /&gt;
===Session 1===&lt;br /&gt;
Metzker ML. &amp;quot;Sequencing technologies - the next generation” Nature Reviews Genetics. 11 (2010) [http://www.gersteinlab.org/courses/452/10-spring/pdf/ngs.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Wheeler DA et al. &amp;quot;The complete genome of an individual by massively parallel DNA sequencing,” Nature. 452:872-876 (2008) [http://www.gersteinlab.org/courses/452/10-spring/pdf/WatsonGenome.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 2===&lt;br /&gt;
Olsen JV, Blagoev B, Gnad F, Macek B, Kumar C, Mortensen P, Mann M. (2006) Global, in vivo, and site-specific phosphorylation dynamics in signaling networks.Cell. 2006 Nov 3;127(3):635-48. [http://www.pil.sdu.dk/1/MSQuant/Cell_GlobalPhosphorylationSignalingDynamics.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Nevan J. Krogan et al (2006) Global landscape of protein complexes in the yeast Saccharomyces cerevisiae Nature 440, 637-643 (30 March 2006) [http://www.nature.com/nature/journal/v440/n7084/pdf/nature04670.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 3===&lt;br /&gt;
T.F. Smith and M.S. Waterman. (1981) Identification of common molecular subsequences. Journal of Molecular Biology,147(1): 195-7. PMID: 7265238. [http://www.gersteinlab.org/courses/452/10-spring/pdf/sw.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. (1990) Basic local alignment search tool. Journal of Molecular Biology, 215(3):403-10. PMID: 2231712. [http://www.gersteinlab.org/courses/452/10-spring/pdf/Altschul.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 4===&lt;br /&gt;
Bailey TL, Williams N, Misleh C, Li WW. (2006) MEME: discovering and analyzing DNA and protein sequence motifs, Nucl Acids Res.34:W369-373 [http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1538909/?tool=pubmed PDF]&lt;br /&gt;
&lt;br /&gt;
Garnier J, Gibrat JF, Robson B. (1996) GOR method for predicting protein secondary structure from amino acid sequence.Methods in Enzymology,266: 540-53. PMID: 8743705. [http://www.gersteinlab.org/courses/452/10-spring/pdf/GOR.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 5===&lt;br /&gt;
Sotiriou et al. (2006) Gene Expression Profiling in Breast Cancer: Understanding the Molecular Basis of Histologic Grade To Improve Prognosis. JNCI J Natl Cancer Inst (15 February 2006) 98 (4):262-272.doi: 10.1093/jnci/djj052 [http://jnci.oxfordjournals.org/content/98/4/262.full.pdf+html PDF]&lt;br /&gt;
&lt;br /&gt;
Ekman D, Light S, Björklund AK, Elofsson A. (2006) What properties characterize the hub proteins of the protein-protein interaction network of Saccharomyces cerevisiae? Genome Biol. 2006;7(6):R45. [http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1779539/pdf/gb-2006-7-6-r45.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 6===&lt;br /&gt;
Antezana E, Egaña M, Blondé W, Illarramendi A, Bilbao I, De Baets B, Stevens R, Mironov V, Kuiper M. (2009) The Cell Cycle Ontology: an application ontology for the representation and integrated analysis of the cell cycle process. Genome Biol. 2009;10(5):R58. Epub 2009 May 29. [http://genomebiology.com/content/10/5/R58 PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 7===&lt;br /&gt;
Perelson AS. Modelling viral and immune system dynamics. Nat Rev Immunol. 2002 Jan;2(1):28-36. [http://www.gersteinlab.org/courses/452/10-spring/pdf/perelson.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 8===&lt;br /&gt;
ML Connolly. (1983) Solvent-accessible surfaces of proteins and nucleic acids. Science, 221(4612): 709-13. PMID: 6879170.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Connolly.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Martin Karplus and J. Andrew McCammon. (2002) Molecular dynamics simulations of biomolecules. Nature Structural Biology,9, 646-52. PMID: 12198485.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Karplus.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 9===&lt;br /&gt;
Dill KA, Ozkan SB, Shell MS, Weikl TR. (2008) The Protein Folding Problem.Annu Rev Biophys,9, 37:289-316. PMID: 2443096.[http://www.gersteinlab.org/courses/452/10-spring/pdf/proteinFolding.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Bowman GR, Beauchamp KA, Boxer G, Pande VS. “Progress and challenges in the automated construction of Markov state models for full protein systems,” J. Chem. Phys. 131 (2009) 124101 [http://www.gersteinlab.org/courses/452/10-spring/pdf/bowman.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
==Class Requirements==&lt;br /&gt;
&lt;br /&gt;
===Discussion Section / Readings===&lt;br /&gt;
Papers will be assigned throughout the course. These papers will be presented and discussed in weekly 60-minute sections with the TFs. A brief summary (a half-page per article) should be submitted at the beginning of the discussion session.&lt;br /&gt;
&lt;br /&gt;
===Bioinformatics quizzes===&lt;br /&gt;
There will be four short quizzes (25 minutes) in class comprising SIMPLE questions that you should be able to answer from the lectures plus the main readings.&lt;br /&gt;
&lt;br /&gt;
===Programming Assignments (CBB and CS)===&lt;br /&gt;
There will be several short programming assignments required for CBB and CS students taking this course. Acceptable languages and submission requirements will be discussed prior to the first assignment. These assignments are NOT required for students not taking the CBB or CS sections of the course.&lt;br /&gt;
&lt;br /&gt;
===Non-CBB Final Project===&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
===CBB Final Project===&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
===Grade Categories===&lt;br /&gt;
&lt;br /&gt;
====CBB and CPSC Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 9%&lt;br /&gt;
|-&lt;br /&gt;
| Programming Assignments&lt;br /&gt;
| 25%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
====MBB and MCDB Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 17%&lt;br /&gt;
|-&lt;br /&gt;
| Problem Sets&lt;br /&gt;
| 17%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Relevant Yale College Regulations===&lt;br /&gt;
&lt;br /&gt;
Students may have questions concerning end-of-term matters.  Links to further information about these regulations can be found below:&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/reading-period-and-final-examination-period&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/completion-course-work&lt;br /&gt;
&lt;br /&gt;
==Misc==&lt;br /&gt;
[http://info.gersteinlab.org/Permissions Permissions] on using website material&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/mark/out/log/2011/12.29/talk/intro-graphic.ppt Graphic] for course homepage&lt;br /&gt;
&lt;br /&gt;
If you&#039;re really motivated, take a look at http://gersteinlab.org/jobs for further Research Opportunities&lt;br /&gt;
&lt;br /&gt;
==Pages from previous years==&lt;br /&gt;
[http://info.gersteinlab.org/Cbb752b11 2011],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/ 2010],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/previous.html 2009 and earlier]&lt;br /&gt;
([[Pointers on finding things on old class pages]])&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1835</id>
		<title>Cbb752b12</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1835"/>
		<updated>2012-01-10T00:30:15Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: /* Topics/Class Schedule */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=CBB 752=&lt;br /&gt;
&lt;br /&gt;
==Course Information==&lt;br /&gt;
&lt;br /&gt;
===Course Description===&lt;br /&gt;
&lt;br /&gt;
Bioinformatics encompasses the analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. It represents a major practical application for modern techniques in data mining and simulation. Specific topics to be covered include sequence alignment, large-scale processing, next-generation sequencing data, comparative genomics, phylogenetics, biological database design, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, normalization of microarray data, mining of functional genomics data sets, and machine learning approaches for data integration.&lt;br /&gt;
&lt;br /&gt;
===Concise undergraduate course description===&lt;br /&gt;
&lt;br /&gt;
Techniques in data mining and simulation applied to bioinformatics, the computational analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. Sequence alignment, comparative genomics and phylogenetics, biological databases, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, microarray normalization, and machine-learning approaches to data integration.&lt;br /&gt;
&lt;br /&gt;
See entry from undergraduate catalog:&lt;br /&gt;
http://students.yale.edu/oci/resultDetail.jsp?course=21914&amp;amp;term=201201 , viz:&lt;br /&gt;
&lt;br /&gt;
 MB&amp;amp;B 452 01 (21914) /MCDB452/MB&amp;amp;B752/MB&amp;amp;B753/MB&amp;amp;B754/CB&amp;amp;B752/MCDB752/CPSC752&lt;br /&gt;
 Bioinformatics: Practical Application of Simulation and Data Mining &lt;br /&gt;
 Mark Gerstein&lt;br /&gt;
 MW 1.00-2.15 BASS 305&lt;br /&gt;
 Spring 2012 &lt;br /&gt;
 No regular final examination&lt;br /&gt;
 Areas Sc&lt;br /&gt;
 Prerequisites: MB&amp;amp;B 301b and MATH 115a or b, or permission of instructor.&lt;br /&gt;
 MCDB 120a or 200b is a prerequisite for courses numbered MCDB 202 and above.&lt;br /&gt;
&lt;br /&gt;
===Different headings for this class===&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B452/MCDB452&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, written problem sets, and a final written project.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B752/MCDB752&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, written problem sets, and a final, graduate level written project.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;CB&amp;amp;B752/CPSC752&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, programming assignments, and a final programming project.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For &#039;&#039;graduate students&#039;&#039; the course is broken up into two &amp;quot;modules&amp;quot; (each counting 0.5 credit towards MB&amp;amp;B course requirement):&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B 753b3&#039;&#039;&#039;, Bioinformatics: Practical Application of Data Mining (1st half of term)&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B 754b4&#039;&#039;&#039;, Bioinformatics: Practical Application of Simulation (2nd half of term)&lt;br /&gt;
&lt;br /&gt;
Each module consists of lectures, written problem sets, and a final, graduate level written project that is half the length of the full course&#039;s final project.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the grade weighting schemes of each course version, see Class Requirements section.&lt;br /&gt;
&lt;br /&gt;
===Prerequisites===&lt;br /&gt;
The course is keyed towards CBB graduate students as well as advanced MB&amp;amp;B undergraduates and graduate students wishing to learn about types of large-scale quantitative analyses that whole-genome sequencing will make possible. It would also be suitable for students from other fields such as computer science or physics wanting to learn about an important new biological application for computation.&lt;br /&gt;
&lt;br /&gt;
Students should have:&lt;br /&gt;
&lt;br /&gt;
A basic knowledge of biochemistry and molecular biology.&lt;br /&gt;
A knowledge of basic quantitative concepts, such as single variable calculus, some probability and statistics, and basic programming skills.&lt;br /&gt;
These can be fulfilled by the following prerequisites statement: &amp;quot;Prerequisites: MBB 200 and Mathematics 115 or permission of the instructor.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
==Timing &amp;amp; location==&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Class:&#039;&#039;&#039; Meeting from 1:00-2:15 pm on Monday and Wednesday, in Bass 305. (First meeting will be on 9 Jan.)&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Discussion section:&#039;&#039;&#039; [https://docs.google.com/spreadsheet/viewform?formkey=dFpoWF92YlVQQTFFOWY0cWt6RUdueWc6MQ Poll] for students to indicate good times for the weekly discussion section&lt;br /&gt;
&lt;br /&gt;
==Instructors==&lt;br /&gt;
&lt;br /&gt;
===Instructor-in-Charge===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Mark Gerstein&lt;br /&gt;
| Bass 432A&lt;br /&gt;
| mark.gerstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Instructors===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Corey O&#039;Hern&lt;br /&gt;
| Mason Laboratory&lt;br /&gt;
| corey.ohern(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jesse Rinehart&lt;br /&gt;
| 300 George St&lt;br /&gt;
| jesse.rinehart(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| James Noonan&lt;br /&gt;
| 333 Cedar St&lt;br /&gt;
| james.noonan(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Kei Cheung&lt;br /&gt;
| 300 George St&lt;br /&gt;
| kei.cheung(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Steven Kleinstein&lt;br /&gt;
| 300 George St&lt;br /&gt;
| steven.kleinstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Teaching Fellows===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Lucas Lochovsky&lt;br /&gt;
| Bass 437&lt;br /&gt;
| lucas.lochovsky(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jane Leng&lt;br /&gt;
| Bass 437&lt;br /&gt;
| jing.leng(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Alice Zhou (select lectures)&lt;br /&gt;
| Mason Laboratory&lt;br /&gt;
| alice.zhou(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Topics/Class Schedule==&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;[https://docs.google.com/spreadsheet/ccc?key=0Ar2M8lIkl9T5dDN4d1Q0aDBGcThXdjV2c21ESWVxSGc Class Schedule]&#039;&#039;&#039; (including a list of topics and quiz dates)&lt;br /&gt;
&lt;br /&gt;
==Discussion Section Readings==&lt;br /&gt;
&lt;br /&gt;
===Session 1===&lt;br /&gt;
Metzker ML. &amp;quot;Sequencing technologies - the next generation” Nature Reviews Genetics. 11 (2010) [http://www.gersteinlab.org/courses/452/10-spring/pdf/ngs.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Wheeler DA et al. &amp;quot;The complete genome of an individual by massively parallel DNA sequencing,” Nature. 452:872-876 (2008) [http://www.gersteinlab.org/courses/452/10-spring/pdf/WatsonGenome.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 2===&lt;br /&gt;
Olsen JV, Blagoev B, Gnad F, Macek B, Kumar C, Mortensen P, Mann M. (2006) Global, in vivo, and site-specific phosphorylation dynamics in signaling networks.Cell. 2006 Nov 3;127(3):635-48. [http://www.pil.sdu.dk/1/MSQuant/Cell_GlobalPhosphorylationSignalingDynamics.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Nevan J. Krogan et al (2006) Global landscape of protein complexes in the yeast Saccharomyces cerevisiae Nature 440, 637-643 (30 March 2006) [http://www.nature.com/nature/journal/v440/n7084/pdf/nature04670.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 3===&lt;br /&gt;
T.F. Smith and M.S. Waterman. (1981) Identification of common molecular subsequences. Journal of Molecular Biology,147(1): 195-7. PMID: 7265238. [http://www.gersteinlab.org/courses/452/10-spring/pdf/sw.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. (1990) Basic local alignment search tool. Journal of Molecular Biology, 215(3):403-10. PMID: 2231712. [http://www.gersteinlab.org/courses/452/10-spring/pdf/Altschul.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 4===&lt;br /&gt;
Bailey TL, Williams N, Misleh C, Li WW. (2006) MEME: discovering and analyzing DNA and protein sequence motifs, Nucl Acids Res.34:W369-373 [http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1538909/?tool=pubmed PDF]&lt;br /&gt;
&lt;br /&gt;
Garnier J, Gibrat JF, Robson B. (1996) GOR method for predicting protein secondary structure from amino acid sequence.Methods in Enzymology,266: 540-53. PMID: 8743705. [http://www.gersteinlab.org/courses/452/10-spring/pdf/GOR.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 5===&lt;br /&gt;
Sotiriou et al. (2006) Gene Expression Profiling in Breast Cancer: Understanding the Molecular Basis of Histologic Grade To Improve Prognosis. JNCI J Natl Cancer Inst (15 February 2006) 98 (4):262-272.doi: 10.1093/jnci/djj052 [http://jnci.oxfordjournals.org/content/98/4/262.full.pdf+html PDF]&lt;br /&gt;
&lt;br /&gt;
Ekman D, Light S, Björklund AK, Elofsson A. (2006) What properties characterize the hub proteins of the protein-protein interaction network of Saccharomyces cerevisiae? Genome Biol. 2006;7(6):R45. [http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1779539/pdf/gb-2006-7-6-r45.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 6===&lt;br /&gt;
Antezana E, Egaña M, Blondé W, Illarramendi A, Bilbao I, De Baets B, Stevens R, Mironov V, Kuiper M. (2009) The Cell Cycle Ontology: an application ontology for the representation and integrated analysis of the cell cycle process. Genome Biol. 2009;10(5):R58. Epub 2009 May 29. [http://genomebiology.com/content/10/5/R58 PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 7===&lt;br /&gt;
Perelson AS. Modelling viral and immune system dynamics. Nat Rev Immunol. 2002 Jan;2(1):28-36. [http://www.gersteinlab.org/courses/452/10-spring/pdf/perelson.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 8===&lt;br /&gt;
ML Connolly. (1983) Solvent-accessible surfaces of proteins and nucleic acids. Science, 221(4612): 709-13. PMID: 6879170.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Connolly.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Martin Karplus and J. Andrew McCammon. (2002) Molecular dynamics simulations of biomolecules. Nature Structural Biology,9, 646-52. PMID: 12198485.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Karplus.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 9===&lt;br /&gt;
Dill KA, Ozkan SB, Shell MS, Weikl TR. (2008) The Protein Folding Problem.Annu Rev Biophys,9, 37:289-316. PMID: 2443096.[http://www.gersteinlab.org/courses/452/10-spring/pdf/proteinFolding.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Bowman GR, Beauchamp KA, Boxer G, Pande VS. “Progress and challenges in the automated construction of Markov state models for full protein systems,” J. Chem. Phys. 131 (2009) 124101 [http://www.gersteinlab.org/courses/452/10-spring/pdf/bowman.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
==Class Requirements==&lt;br /&gt;
&lt;br /&gt;
===Discussion Section / Readings===&lt;br /&gt;
Papers will be assigned throughout the course. These papers will be presented and discussed in weekly 60-minute sections with the TFs. A brief summary (a half-page per article) should be submitted at the beginning of the discussion session.&lt;br /&gt;
&lt;br /&gt;
===Bioinformatics quizzes===&lt;br /&gt;
There will be four short quizzes (25 minutes) in class comprising SIMPLE questions that you should be able to answer from the lectures plus the main readings.&lt;br /&gt;
&lt;br /&gt;
===Programming Assignments (CBB and CS)===&lt;br /&gt;
There will be several short programming assignments required for CBB and CS students taking this course. Acceptable languages and submission requirements will be discussed prior to the first assignment. These assignments are NOT required for students not taking the CBB or CS sections of the course.&lt;br /&gt;
&lt;br /&gt;
===Non-CBB Final Project===&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
===CBB Final Project===&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
===Grade Categories===&lt;br /&gt;
&lt;br /&gt;
====CBB and CPSC Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 8.50%&lt;br /&gt;
|-&lt;br /&gt;
| Programming Assignments&lt;br /&gt;
| 25.00%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
====MBB and MCDB Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 16.75%&lt;br /&gt;
|-&lt;br /&gt;
| Problem Sets&lt;br /&gt;
| 16.75%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Relevant Yale College Regulations===&lt;br /&gt;
&lt;br /&gt;
Students may have questions concerning end-of-term matters.  Links to further information about these regulations can be found below:&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/reading-period-and-final-examination-period&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/completion-course-work&lt;br /&gt;
&lt;br /&gt;
==Misc==&lt;br /&gt;
[http://info.gersteinlab.org/Permissions Permissions] on using website material&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/mark/out/log/2011/12.29/talk/intro-graphic.ppt Graphic] for course homepage&lt;br /&gt;
&lt;br /&gt;
If you&#039;re really motivated, take a look at http://gersteinlab.org/jobs for further Research Opportunities&lt;br /&gt;
&lt;br /&gt;
==Pages from previous years==&lt;br /&gt;
[http://info.gersteinlab.org/Cbb752b11 2011],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/ 2010],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/previous.html 2009 and earlier]&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1834</id>
		<title>Cbb752b12</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1834"/>
		<updated>2012-01-05T22:13:11Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: /* Teaching Fellows */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=CBB 752=&lt;br /&gt;
&lt;br /&gt;
==Course Information==&lt;br /&gt;
&lt;br /&gt;
===Course Description===&lt;br /&gt;
&lt;br /&gt;
Bioinformatics encompasses the analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. It represents a major practical application for modern techniques in data mining and simulation. Specific topics to be covered include sequence alignment, large-scale processing, next-generation sequencing data, comparative genomics, phylogenetics, biological database design, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, normalization of microarray data, mining of functional genomics data sets, and machine learning approaches for data integration.&lt;br /&gt;
&lt;br /&gt;
===Concise undergraduate course description===&lt;br /&gt;
&lt;br /&gt;
Techniques in data mining and simulation applied to bioinformatics, the computational analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. Sequence alignment, comparative genomics and phylogenetics, biological databases, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, microarray normalization, and machine-learning approaches to data integration.&lt;br /&gt;
&lt;br /&gt;
See entry from undergraduate catalog:&lt;br /&gt;
http://students.yale.edu/oci/resultDetail.jsp?course=21914&amp;amp;term=201201 , viz:&lt;br /&gt;
&lt;br /&gt;
 MB&amp;amp;B 452 01 (21914) /MCDB452/MB&amp;amp;B752/MB&amp;amp;B753/MB&amp;amp;B754/CB&amp;amp;B752/MCDB752/CPSC752&lt;br /&gt;
 Bioinformatics: Practical Application of Simulation and Data Mining &lt;br /&gt;
 Mark Gerstein&lt;br /&gt;
 MW 1.00-2.15 BASS 305&lt;br /&gt;
 Spring 2012 &lt;br /&gt;
 No regular final examination&lt;br /&gt;
 Areas Sc&lt;br /&gt;
 Prerequisites: MB&amp;amp;B 301b and MATH 115a or b, or permission of instructor.&lt;br /&gt;
 MCDB 120a or 200b is a prerequisite for courses numbered MCDB 202 and above.&lt;br /&gt;
&lt;br /&gt;
===Different headings for this class===&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B452/MCDB452&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, written problem sets, and a final written project.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B752/MCDB752&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, written problem sets, and a final, graduate level written project.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;CB&amp;amp;B752/CPSC752&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, programming assignments, and a final programming project.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For &#039;&#039;graduate students&#039;&#039; the course is broken up into two &amp;quot;modules&amp;quot; (each counting 0.5 credit towards MB&amp;amp;B course requirement):&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B 753b3&#039;&#039;&#039;, Bioinformatics: Practical Application of Data Mining (1st half of term)&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B 754b4&#039;&#039;&#039;, Bioinformatics: Practical Application of Simulation (2nd half of term)&lt;br /&gt;
&lt;br /&gt;
Each module consists of lectures, written problem sets, and a final, graduate level written project that is half the length of the full course&#039;s final project.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the grade weighting schemes of each course version, see Class Requirements section.&lt;br /&gt;
&lt;br /&gt;
===Prerequisites===&lt;br /&gt;
The course is keyed towards CBB graduate students as well as advanced MB&amp;amp;B undergraduates and graduate students wishing to learn about types of large-scale quantitative analyses that whole-genome sequencing will make possible. It would also be suitable for students from other fields such as computer science or physics wanting to learn about an important new biological application for computation.&lt;br /&gt;
&lt;br /&gt;
Students should have:&lt;br /&gt;
&lt;br /&gt;
A basic knowledge of biochemistry and molecular biology.&lt;br /&gt;
A knowledge of basic quantitative concepts, such as single variable calculus, some probability and statistics, and basic programming skills.&lt;br /&gt;
These can be fulfilled by the following prerequisites statement: &amp;quot;Prerequisites: MBB 200 and Mathematics 115 or permission of the instructor.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
==Timing &amp;amp; location==&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Class:&#039;&#039;&#039; Meeting from 1:00-2:15 pm on Monday and Wednesday, in Bass 305. (First meeting will be on 9 Jan.)&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Discussion section:&#039;&#039;&#039; [https://docs.google.com/spreadsheet/viewform?formkey=dFpoWF92YlVQQTFFOWY0cWt6RUdueWc6MQ Poll] for students to indicate good times for the weekly discussion section&lt;br /&gt;
&lt;br /&gt;
==Instructors==&lt;br /&gt;
&lt;br /&gt;
===Instructor-in-Charge===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Mark Gerstein&lt;br /&gt;
| Bass 432A&lt;br /&gt;
| mark.gerstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Instructors===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Corey O&#039;Hern&lt;br /&gt;
| Mason Laboratory&lt;br /&gt;
| corey.ohern(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jesse Rinehart&lt;br /&gt;
| 300 George St&lt;br /&gt;
| jesse.rinehart(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| James Noonan&lt;br /&gt;
| 333 Cedar St&lt;br /&gt;
| james.noonan(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Kei Cheung&lt;br /&gt;
| 300 George St&lt;br /&gt;
| kei.cheung(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Steven Kleinstein&lt;br /&gt;
| 300 George St&lt;br /&gt;
| steven.kleinstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Teaching Fellows===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Lucas Lochovsky&lt;br /&gt;
| Bass 437&lt;br /&gt;
| lucas.lochovsky(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jane Leng&lt;br /&gt;
| Bass 437&lt;br /&gt;
| jing.leng(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Alice Zhou (select lectures)&lt;br /&gt;
| Mason Laboratory&lt;br /&gt;
| alice.zhou(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Topics/Class Schedule==&lt;br /&gt;
&lt;br /&gt;
[https://docs.google.com/spreadsheet/ccc?key=0Ar2M8lIkl9T5dDN4d1Q0aDBGcThXdjV2c21ESWVxSGc Class Schedule] (including a list of topics and quiz dates)&lt;br /&gt;
&lt;br /&gt;
==Discussion Section Readings==&lt;br /&gt;
&lt;br /&gt;
===Session 1===&lt;br /&gt;
Metzker ML. &amp;quot;Sequencing technologies - the next generation” Nature Reviews Genetics. 11 (2010) [http://www.gersteinlab.org/courses/452/10-spring/pdf/ngs.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Wheeler DA et al. &amp;quot;The complete genome of an individual by massively parallel DNA sequencing,” Nature. 452:872-876 (2008) [http://www.gersteinlab.org/courses/452/10-spring/pdf/WatsonGenome.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 2===&lt;br /&gt;
Olsen JV, Blagoev B, Gnad F, Macek B, Kumar C, Mortensen P, Mann M. (2006) Global, in vivo, and site-specific phosphorylation dynamics in signaling networks.Cell. 2006 Nov 3;127(3):635-48. [http://www.pil.sdu.dk/1/MSQuant/Cell_GlobalPhosphorylationSignalingDynamics.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Nevan J. Krogan et al (2006) Global landscape of protein complexes in the yeast Saccharomyces cerevisiae Nature 440, 637-643 (30 March 2006) [http://www.nature.com/nature/journal/v440/n7084/pdf/nature04670.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 3===&lt;br /&gt;
T.F. Smith and M.S. Waterman. (1981) Identification of common molecular subsequences. Journal of Molecular Biology,147(1): 195-7. PMID: 7265238. [http://www.gersteinlab.org/courses/452/10-spring/pdf/sw.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. (1990) Basic local alignment search tool. Journal of Molecular Biology, 215(3):403-10. PMID: 2231712. [http://www.gersteinlab.org/courses/452/10-spring/pdf/Altschul.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 4===&lt;br /&gt;
Bailey TL, Williams N, Misleh C, Li WW. (2006) MEME: discovering and analyzing DNA and protein sequence motifs, Nucl Acids Res.34:W369-373 [http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1538909/?tool=pubmed PDF]&lt;br /&gt;
&lt;br /&gt;
Garnier J, Gibrat JF, Robson B. (1996) GOR method for predicting protein secondary structure from amino acid sequence.Methods in Enzymology,266: 540-53. PMID: 8743705. [http://www.gersteinlab.org/courses/452/10-spring/pdf/GOR.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 5===&lt;br /&gt;
Sotiriou et al. (2006) Gene Expression Profiling in Breast Cancer: Understanding the Molecular Basis of Histologic Grade To Improve Prognosis. JNCI J Natl Cancer Inst (15 February 2006) 98 (4):262-272.doi: 10.1093/jnci/djj052 [http://jnci.oxfordjournals.org/content/98/4/262.full.pdf+html PDF]&lt;br /&gt;
&lt;br /&gt;
Ekman D, Light S, Björklund AK, Elofsson A. (2006) What properties characterize the hub proteins of the protein-protein interaction network of Saccharomyces cerevisiae? Genome Biol. 2006;7(6):R45. [http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1779539/pdf/gb-2006-7-6-r45.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 6===&lt;br /&gt;
Antezana E, Egaña M, Blondé W, Illarramendi A, Bilbao I, De Baets B, Stevens R, Mironov V, Kuiper M. (2009) The Cell Cycle Ontology: an application ontology for the representation and integrated analysis of the cell cycle process. Genome Biol. 2009;10(5):R58. Epub 2009 May 29. [http://genomebiology.com/content/10/5/R58 PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 7===&lt;br /&gt;
Perelson AS. Modelling viral and immune system dynamics. Nat Rev Immunol. 2002 Jan;2(1):28-36. [http://www.gersteinlab.org/courses/452/10-spring/pdf/perelson.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 8===&lt;br /&gt;
ML Connolly. (1983) Solvent-accessible surfaces of proteins and nucleic acids. Science, 221(4612): 709-13. PMID: 6879170.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Connolly.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Martin Karplus and J. Andrew McCammon. (2002) Molecular dynamics simulations of biomolecules. Nature Structural Biology,9, 646-52. PMID: 12198485.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Karplus.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 9===&lt;br /&gt;
Dill KA, Ozkan SB, Shell MS, Weikl TR. (2008) The Protein Folding Problem.Annu Rev Biophys,9, 37:289-316. PMID: 2443096.[http://www.gersteinlab.org/courses/452/10-spring/pdf/proteinFolding.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Bowman GR, Beauchamp KA, Boxer G, Pande VS. “Progress and challenges in the automated construction of Markov state models for full protein systems,” J. Chem. Phys. 131 (2009) 124101 [http://www.gersteinlab.org/courses/452/10-spring/pdf/bowman.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
==Class Requirements==&lt;br /&gt;
&lt;br /&gt;
===Discussion Section / Readings===&lt;br /&gt;
Papers will be assigned throughout the course. These papers will be presented and discussed in weekly 60-minute sections with the TFs. A brief summary (a half-page per article) should be submitted at the beginning of the discussion session.&lt;br /&gt;
&lt;br /&gt;
===Bioinformatics quizzes===&lt;br /&gt;
There will be four short quizzes (25 minutes) in class comprising SIMPLE questions that you should be able to answer from the lectures plus the main readings.&lt;br /&gt;
&lt;br /&gt;
===Programming Assignments (CBB and CS)===&lt;br /&gt;
There will be several short programming assignments required for CBB and CS students taking this course. Acceptable languages and submission requirements will be discussed prior to the first assignment. These assignments are NOT required for students not taking the CBB or CS sections of the course.&lt;br /&gt;
&lt;br /&gt;
===Non-CBB Final Project===&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
===CBB Final Project===&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
===Grade Categories===&lt;br /&gt;
&lt;br /&gt;
====CBB and CPSC Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 8.50%&lt;br /&gt;
|-&lt;br /&gt;
| Programming Assignments&lt;br /&gt;
| 25.00%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
====MBB and MCDB Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 16.75%&lt;br /&gt;
|-&lt;br /&gt;
| Problem Sets&lt;br /&gt;
| 16.75%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Relevant Yale College Regulations===&lt;br /&gt;
&lt;br /&gt;
Students may have questions concerning end-of-term matters.  Links to further information about these regulations can be found below:&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/reading-period-and-final-examination-period&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/completion-course-work&lt;br /&gt;
&lt;br /&gt;
==Misc==&lt;br /&gt;
[http://info.gersteinlab.org/Permissions Permissions] on using website material&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/mark/out/log/2011/12.29/talk/intro-graphic.ppt Graphic] for course homepage&lt;br /&gt;
&lt;br /&gt;
If you&#039;re really motivated, take a look at http://gersteinlab.org/jobs for further Research Opportunities&lt;br /&gt;
&lt;br /&gt;
==Pages from previous years==&lt;br /&gt;
[http://info.gersteinlab.org/Cbb752b11 2011],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/ 2010],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/previous.html 2009 and earlier]&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1833</id>
		<title>Cbb752b12</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1833"/>
		<updated>2012-01-05T22:12:04Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: /* Teaching Fellows */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=CBB 752=&lt;br /&gt;
&lt;br /&gt;
==Course Information==&lt;br /&gt;
&lt;br /&gt;
===Course Description===&lt;br /&gt;
&lt;br /&gt;
Bioinformatics encompasses the analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. It represents a major practical application for modern techniques in data mining and simulation. Specific topics to be covered include sequence alignment, large-scale processing, next-generation sequencing data, comparative genomics, phylogenetics, biological database design, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, normalization of microarray data, mining of functional genomics data sets, and machine learning approaches for data integration.&lt;br /&gt;
&lt;br /&gt;
===Concise undergraduate course description===&lt;br /&gt;
&lt;br /&gt;
Techniques in data mining and simulation applied to bioinformatics, the computational analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. Sequence alignment, comparative genomics and phylogenetics, biological databases, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, microarray normalization, and machine-learning approaches to data integration.&lt;br /&gt;
&lt;br /&gt;
See entry from undergraduate catalog:&lt;br /&gt;
http://students.yale.edu/oci/resultDetail.jsp?course=21914&amp;amp;term=201201 , viz:&lt;br /&gt;
&lt;br /&gt;
 MB&amp;amp;B 452 01 (21914) /MCDB452/MB&amp;amp;B752/MB&amp;amp;B753/MB&amp;amp;B754/CB&amp;amp;B752/MCDB752/CPSC752&lt;br /&gt;
 Bioinformatics: Practical Application of Simulation and Data Mining &lt;br /&gt;
 Mark Gerstein&lt;br /&gt;
 MW 1.00-2.15 BASS 305&lt;br /&gt;
 Spring 2012 &lt;br /&gt;
 No regular final examination&lt;br /&gt;
 Areas Sc&lt;br /&gt;
 Prerequisites: MB&amp;amp;B 301b and MATH 115a or b, or permission of instructor.&lt;br /&gt;
 MCDB 120a or 200b is a prerequisite for courses numbered MCDB 202 and above.&lt;br /&gt;
&lt;br /&gt;
===Different headings for this class===&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B452/MCDB452&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, written problem sets, and a final written project.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B752/MCDB752&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, written problem sets, and a final, graduate level written project.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;CB&amp;amp;B752/CPSC752&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, programming assignments, and a final programming project.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For &#039;&#039;graduate students&#039;&#039; the course is broken up into two &amp;quot;modules&amp;quot; (each counting 0.5 credit towards MB&amp;amp;B course requirement):&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B 753b3&#039;&#039;&#039;, Bioinformatics: Practical Application of Data Mining (1st half of term)&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B 754b4&#039;&#039;&#039;, Bioinformatics: Practical Application of Simulation (2nd half of term)&lt;br /&gt;
&lt;br /&gt;
Each module consists of lectures, written problem sets, and a final, graduate level written project that is half the length of the full course&#039;s final project.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the grade weighting schemes of each course version, see Class Requirements section.&lt;br /&gt;
&lt;br /&gt;
===Prerequisites===&lt;br /&gt;
The course is keyed towards CBB graduate students as well as advanced MB&amp;amp;B undergraduates and graduate students wishing to learn about types of large-scale quantitative analyses that whole-genome sequencing will make possible. It would also be suitable for students from other fields such as computer science or physics wanting to learn about an important new biological application for computation.&lt;br /&gt;
&lt;br /&gt;
Students should have:&lt;br /&gt;
&lt;br /&gt;
A basic knowledge of biochemistry and molecular biology.&lt;br /&gt;
A knowledge of basic quantitative concepts, such as single variable calculus, some probability and statistics, and basic programming skills.&lt;br /&gt;
These can be fulfilled by the following prerequisites statement: &amp;quot;Prerequisites: MBB 200 and Mathematics 115 or permission of the instructor.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
==Timing &amp;amp; location==&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Class:&#039;&#039;&#039; Meeting from 1:00-2:15 pm on Monday and Wednesday, in Bass 305. (First meeting will be on 9 Jan.)&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Discussion section:&#039;&#039;&#039; [https://docs.google.com/spreadsheet/viewform?formkey=dFpoWF92YlVQQTFFOWY0cWt6RUdueWc6MQ Poll] for students to indicate good times for the weekly discussion section&lt;br /&gt;
&lt;br /&gt;
==Instructors==&lt;br /&gt;
&lt;br /&gt;
===Instructor-in-Charge===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Mark Gerstein&lt;br /&gt;
| Bass 432A&lt;br /&gt;
| mark.gerstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Instructors===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Corey O&#039;Hern&lt;br /&gt;
| Mason Laboratory&lt;br /&gt;
| corey.ohern(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jesse Rinehart&lt;br /&gt;
| 300 George St&lt;br /&gt;
| jesse.rinehart(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| James Noonan&lt;br /&gt;
| 333 Cedar St&lt;br /&gt;
| james.noonan(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Kei Cheung&lt;br /&gt;
| 300 George St&lt;br /&gt;
| kei.cheung(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Steven Kleinstein&lt;br /&gt;
| 300 George St&lt;br /&gt;
| steven.kleinstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Teaching Fellows===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Lucas Lochovsky&lt;br /&gt;
| Bass 437&lt;br /&gt;
| lucas.lochovsky(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jane Leng&lt;br /&gt;
| Bass 437&lt;br /&gt;
| jing.leng(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Alice Zhou&lt;br /&gt;
| Mason Laboratory&lt;br /&gt;
| alice.zhou(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Topics/Class Schedule==&lt;br /&gt;
&lt;br /&gt;
[https://docs.google.com/spreadsheet/ccc?key=0Ar2M8lIkl9T5dDN4d1Q0aDBGcThXdjV2c21ESWVxSGc Class Schedule] (including a list of topics and quiz dates)&lt;br /&gt;
&lt;br /&gt;
==Discussion Section Readings==&lt;br /&gt;
&lt;br /&gt;
===Session 1===&lt;br /&gt;
Metzker ML. &amp;quot;Sequencing technologies - the next generation” Nature Reviews Genetics. 11 (2010) [http://www.gersteinlab.org/courses/452/10-spring/pdf/ngs.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Wheeler DA et al. &amp;quot;The complete genome of an individual by massively parallel DNA sequencing,” Nature. 452:872-876 (2008) [http://www.gersteinlab.org/courses/452/10-spring/pdf/WatsonGenome.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 2===&lt;br /&gt;
Olsen JV, Blagoev B, Gnad F, Macek B, Kumar C, Mortensen P, Mann M. (2006) Global, in vivo, and site-specific phosphorylation dynamics in signaling networks.Cell. 2006 Nov 3;127(3):635-48. [http://www.pil.sdu.dk/1/MSQuant/Cell_GlobalPhosphorylationSignalingDynamics.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Nevan J. Krogan et al (2006) Global landscape of protein complexes in the yeast Saccharomyces cerevisiae Nature 440, 637-643 (30 March 2006) [http://www.nature.com/nature/journal/v440/n7084/pdf/nature04670.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 3===&lt;br /&gt;
T.F. Smith and M.S. Waterman. (1981) Identification of common molecular subsequences. Journal of Molecular Biology,147(1): 195-7. PMID: 7265238. [http://www.gersteinlab.org/courses/452/10-spring/pdf/sw.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. (1990) Basic local alignment search tool. Journal of Molecular Biology, 215(3):403-10. PMID: 2231712. [http://www.gersteinlab.org/courses/452/10-spring/pdf/Altschul.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 4===&lt;br /&gt;
Bailey TL, Williams N, Misleh C, Li WW. (2006) MEME: discovering and analyzing DNA and protein sequence motifs, Nucl Acids Res.34:W369-373 [http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1538909/?tool=pubmed PDF]&lt;br /&gt;
&lt;br /&gt;
Garnier J, Gibrat JF, Robson B. (1996) GOR method for predicting protein secondary structure from amino acid sequence.Methods in Enzymology,266: 540-53. PMID: 8743705. [http://www.gersteinlab.org/courses/452/10-spring/pdf/GOR.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 5===&lt;br /&gt;
Sotiriou et al. (2006) Gene Expression Profiling in Breast Cancer: Understanding the Molecular Basis of Histologic Grade To Improve Prognosis. JNCI J Natl Cancer Inst (15 February 2006) 98 (4):262-272.doi: 10.1093/jnci/djj052 [http://jnci.oxfordjournals.org/content/98/4/262.full.pdf+html PDF]&lt;br /&gt;
&lt;br /&gt;
Ekman D, Light S, Björklund AK, Elofsson A. (2006) What properties characterize the hub proteins of the protein-protein interaction network of Saccharomyces cerevisiae? Genome Biol. 2006;7(6):R45. [http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1779539/pdf/gb-2006-7-6-r45.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 6===&lt;br /&gt;
Antezana E, Egaña M, Blondé W, Illarramendi A, Bilbao I, De Baets B, Stevens R, Mironov V, Kuiper M. (2009) The Cell Cycle Ontology: an application ontology for the representation and integrated analysis of the cell cycle process. Genome Biol. 2009;10(5):R58. Epub 2009 May 29. [http://genomebiology.com/content/10/5/R58 PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 7===&lt;br /&gt;
Perelson AS. Modelling viral and immune system dynamics. Nat Rev Immunol. 2002 Jan;2(1):28-36. [http://www.gersteinlab.org/courses/452/10-spring/pdf/perelson.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 8===&lt;br /&gt;
ML Connolly. (1983) Solvent-accessible surfaces of proteins and nucleic acids. Science, 221(4612): 709-13. PMID: 6879170.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Connolly.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Martin Karplus and J. Andrew McCammon. (2002) Molecular dynamics simulations of biomolecules. Nature Structural Biology,9, 646-52. PMID: 12198485.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Karplus.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 9===&lt;br /&gt;
Dill KA, Ozkan SB, Shell MS, Weikl TR. (2008) The Protein Folding Problem.Annu Rev Biophys,9, 37:289-316. PMID: 2443096.[http://www.gersteinlab.org/courses/452/10-spring/pdf/proteinFolding.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Bowman GR, Beauchamp KA, Boxer G, Pande VS. “Progress and challenges in the automated construction of Markov state models for full protein systems,” J. Chem. Phys. 131 (2009) 124101 [http://www.gersteinlab.org/courses/452/10-spring/pdf/bowman.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
==Class Requirements==&lt;br /&gt;
&lt;br /&gt;
===Discussion Section / Readings===&lt;br /&gt;
Papers will be assigned throughout the course. These papers will be presented and discussed in weekly 60-minute sections with the TFs. A brief summary (a half-page per article) should be submitted at the beginning of the discussion session.&lt;br /&gt;
&lt;br /&gt;
===Bioinformatics quizzes===&lt;br /&gt;
There will be four short quizzes (25 minutes) in class comprising SIMPLE questions that you should be able to answer from the lectures plus the main readings.&lt;br /&gt;
&lt;br /&gt;
===Programming Assignments (CBB and CS)===&lt;br /&gt;
There will be several short programming assignments required for CBB and CS students taking this course. Acceptable languages and submission requirements will be discussed prior to the first assignment. These assignments are NOT required for students not taking the CBB or CS sections of the course.&lt;br /&gt;
&lt;br /&gt;
===Non-CBB Final Project===&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
===CBB Final Project===&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
===Grade Categories===&lt;br /&gt;
&lt;br /&gt;
====CBB and CPSC Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 8.50%&lt;br /&gt;
|-&lt;br /&gt;
| Programming Assignments&lt;br /&gt;
| 25.00%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
====MBB and MCDB Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 16.75%&lt;br /&gt;
|-&lt;br /&gt;
| Problem Sets&lt;br /&gt;
| 16.75%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Relevant Yale College Regulations===&lt;br /&gt;
&lt;br /&gt;
Students may have questions concerning end-of-term matters.  Links to further information about these regulations can be found below:&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/reading-period-and-final-examination-period&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/completion-course-work&lt;br /&gt;
&lt;br /&gt;
==Misc==&lt;br /&gt;
[http://info.gersteinlab.org/Permissions Permissions] on using website material&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/mark/out/log/2011/12.29/talk/intro-graphic.ppt Graphic] for course homepage&lt;br /&gt;
&lt;br /&gt;
If you&#039;re really motivated, take a look at http://gersteinlab.org/jobs for further Research Opportunities&lt;br /&gt;
&lt;br /&gt;
==Pages from previous years==&lt;br /&gt;
[http://info.gersteinlab.org/Cbb752b11 2011],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/ 2010],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/previous.html 2009 and earlier]&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1832</id>
		<title>Cbb752b12</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1832"/>
		<updated>2012-01-05T22:11:47Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: /* Teaching Fellows */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=CBB 752=&lt;br /&gt;
&lt;br /&gt;
==Course Information==&lt;br /&gt;
&lt;br /&gt;
===Course Description===&lt;br /&gt;
&lt;br /&gt;
Bioinformatics encompasses the analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. It represents a major practical application for modern techniques in data mining and simulation. Specific topics to be covered include sequence alignment, large-scale processing, next-generation sequencing data, comparative genomics, phylogenetics, biological database design, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, normalization of microarray data, mining of functional genomics data sets, and machine learning approaches for data integration.&lt;br /&gt;
&lt;br /&gt;
===Concise undergraduate course description===&lt;br /&gt;
&lt;br /&gt;
Techniques in data mining and simulation applied to bioinformatics, the computational analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. Sequence alignment, comparative genomics and phylogenetics, biological databases, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, microarray normalization, and machine-learning approaches to data integration.&lt;br /&gt;
&lt;br /&gt;
See entry from undergraduate catalog:&lt;br /&gt;
http://students.yale.edu/oci/resultDetail.jsp?course=21914&amp;amp;term=201201 , viz:&lt;br /&gt;
&lt;br /&gt;
 MB&amp;amp;B 452 01 (21914) /MCDB452/MB&amp;amp;B752/MB&amp;amp;B753/MB&amp;amp;B754/CB&amp;amp;B752/MCDB752/CPSC752&lt;br /&gt;
 Bioinformatics: Practical Application of Simulation and Data Mining &lt;br /&gt;
 Mark Gerstein&lt;br /&gt;
 MW 1.00-2.15 BASS 305&lt;br /&gt;
 Spring 2012 &lt;br /&gt;
 No regular final examination&lt;br /&gt;
 Areas Sc&lt;br /&gt;
 Prerequisites: MB&amp;amp;B 301b and MATH 115a or b, or permission of instructor.&lt;br /&gt;
 MCDB 120a or 200b is a prerequisite for courses numbered MCDB 202 and above.&lt;br /&gt;
&lt;br /&gt;
===Different headings for this class===&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B452/MCDB452&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, written problem sets, and a final written project.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B752/MCDB752&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, written problem sets, and a final, graduate level written project.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;CB&amp;amp;B752/CPSC752&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, programming assignments, and a final programming project.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For &#039;&#039;graduate students&#039;&#039; the course is broken up into two &amp;quot;modules&amp;quot; (each counting 0.5 credit towards MB&amp;amp;B course requirement):&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B 753b3&#039;&#039;&#039;, Bioinformatics: Practical Application of Data Mining (1st half of term)&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B 754b4&#039;&#039;&#039;, Bioinformatics: Practical Application of Simulation (2nd half of term)&lt;br /&gt;
&lt;br /&gt;
Each module consists of lectures, written problem sets, and a final, graduate level written project that is half the length of the full course&#039;s final project.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the grade weighting schemes of each course version, see Class Requirements section.&lt;br /&gt;
&lt;br /&gt;
===Prerequisites===&lt;br /&gt;
The course is keyed towards CBB graduate students as well as advanced MB&amp;amp;B undergraduates and graduate students wishing to learn about types of large-scale quantitative analyses that whole-genome sequencing will make possible. It would also be suitable for students from other fields such as computer science or physics wanting to learn about an important new biological application for computation.&lt;br /&gt;
&lt;br /&gt;
Students should have:&lt;br /&gt;
&lt;br /&gt;
A basic knowledge of biochemistry and molecular biology.&lt;br /&gt;
A knowledge of basic quantitative concepts, such as single variable calculus, some probability and statistics, and basic programming skills.&lt;br /&gt;
These can be fulfilled by the following prerequisites statement: &amp;quot;Prerequisites: MBB 200 and Mathematics 115 or permission of the instructor.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
==Timing &amp;amp; location==&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Class:&#039;&#039;&#039; Meeting from 1:00-2:15 pm on Monday and Wednesday, in Bass 305. (First meeting will be on 9 Jan.)&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Discussion section:&#039;&#039;&#039; [https://docs.google.com/spreadsheet/viewform?formkey=dFpoWF92YlVQQTFFOWY0cWt6RUdueWc6MQ Poll] for students to indicate good times for the weekly discussion section&lt;br /&gt;
&lt;br /&gt;
==Instructors==&lt;br /&gt;
&lt;br /&gt;
===Instructor-in-Charge===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Mark Gerstein&lt;br /&gt;
| Bass 432A&lt;br /&gt;
| mark.gerstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Instructors===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Corey O&#039;Hern&lt;br /&gt;
| Mason Laboratory&lt;br /&gt;
| corey.ohern(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jesse Rinehart&lt;br /&gt;
| 300 George St&lt;br /&gt;
| jesse.rinehart(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| James Noonan&lt;br /&gt;
| 333 Cedar St&lt;br /&gt;
| james.noonan(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Kei Cheung&lt;br /&gt;
| 300 George St&lt;br /&gt;
| kei.cheung(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Steven Kleinstein&lt;br /&gt;
| 300 George St&lt;br /&gt;
| steven.kleinstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Teaching Fellows===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Lucas Lochovsky&lt;br /&gt;
| Bass 437&lt;br /&gt;
| lucas.lochovsky(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jane Leng&lt;br /&gt;
| Bass 437&lt;br /&gt;
| jing.leng(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Alice Zhou&lt;br /&gt;
| Mason Labs&lt;br /&gt;
| alice.zhou(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Topics/Class Schedule==&lt;br /&gt;
&lt;br /&gt;
[https://docs.google.com/spreadsheet/ccc?key=0Ar2M8lIkl9T5dDN4d1Q0aDBGcThXdjV2c21ESWVxSGc Class Schedule] (including a list of topics and quiz dates)&lt;br /&gt;
&lt;br /&gt;
==Discussion Section Readings==&lt;br /&gt;
&lt;br /&gt;
===Session 1===&lt;br /&gt;
Metzker ML. &amp;quot;Sequencing technologies - the next generation” Nature Reviews Genetics. 11 (2010) [http://www.gersteinlab.org/courses/452/10-spring/pdf/ngs.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Wheeler DA et al. &amp;quot;The complete genome of an individual by massively parallel DNA sequencing,” Nature. 452:872-876 (2008) [http://www.gersteinlab.org/courses/452/10-spring/pdf/WatsonGenome.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 2===&lt;br /&gt;
Olsen JV, Blagoev B, Gnad F, Macek B, Kumar C, Mortensen P, Mann M. (2006) Global, in vivo, and site-specific phosphorylation dynamics in signaling networks.Cell. 2006 Nov 3;127(3):635-48. [http://www.pil.sdu.dk/1/MSQuant/Cell_GlobalPhosphorylationSignalingDynamics.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Nevan J. Krogan et al (2006) Global landscape of protein complexes in the yeast Saccharomyces cerevisiae Nature 440, 637-643 (30 March 2006) [http://www.nature.com/nature/journal/v440/n7084/pdf/nature04670.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 3===&lt;br /&gt;
T.F. Smith and M.S. Waterman. (1981) Identification of common molecular subsequences. Journal of Molecular Biology,147(1): 195-7. PMID: 7265238. [http://www.gersteinlab.org/courses/452/10-spring/pdf/sw.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. (1990) Basic local alignment search tool. Journal of Molecular Biology, 215(3):403-10. PMID: 2231712. [http://www.gersteinlab.org/courses/452/10-spring/pdf/Altschul.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 4===&lt;br /&gt;
Bailey TL, Williams N, Misleh C, Li WW. (2006) MEME: discovering and analyzing DNA and protein sequence motifs, Nucl Acids Res.34:W369-373 [http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1538909/?tool=pubmed PDF]&lt;br /&gt;
&lt;br /&gt;
Garnier J, Gibrat JF, Robson B. (1996) GOR method for predicting protein secondary structure from amino acid sequence.Methods in Enzymology,266: 540-53. PMID: 8743705. [http://www.gersteinlab.org/courses/452/10-spring/pdf/GOR.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 5===&lt;br /&gt;
Sotiriou et al. (2006) Gene Expression Profiling in Breast Cancer: Understanding the Molecular Basis of Histologic Grade To Improve Prognosis. JNCI J Natl Cancer Inst (15 February 2006) 98 (4):262-272.doi: 10.1093/jnci/djj052 [http://jnci.oxfordjournals.org/content/98/4/262.full.pdf+html PDF]&lt;br /&gt;
&lt;br /&gt;
Ekman D, Light S, Björklund AK, Elofsson A. (2006) What properties characterize the hub proteins of the protein-protein interaction network of Saccharomyces cerevisiae? Genome Biol. 2006;7(6):R45. [http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1779539/pdf/gb-2006-7-6-r45.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 6===&lt;br /&gt;
Antezana E, Egaña M, Blondé W, Illarramendi A, Bilbao I, De Baets B, Stevens R, Mironov V, Kuiper M. (2009) The Cell Cycle Ontology: an application ontology for the representation and integrated analysis of the cell cycle process. Genome Biol. 2009;10(5):R58. Epub 2009 May 29. [http://genomebiology.com/content/10/5/R58 PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 7===&lt;br /&gt;
Perelson AS. Modelling viral and immune system dynamics. Nat Rev Immunol. 2002 Jan;2(1):28-36. [http://www.gersteinlab.org/courses/452/10-spring/pdf/perelson.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 8===&lt;br /&gt;
ML Connolly. (1983) Solvent-accessible surfaces of proteins and nucleic acids. Science, 221(4612): 709-13. PMID: 6879170.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Connolly.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Martin Karplus and J. Andrew McCammon. (2002) Molecular dynamics simulations of biomolecules. Nature Structural Biology,9, 646-52. PMID: 12198485.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Karplus.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 9===&lt;br /&gt;
Dill KA, Ozkan SB, Shell MS, Weikl TR. (2008) The Protein Folding Problem.Annu Rev Biophys,9, 37:289-316. PMID: 2443096.[http://www.gersteinlab.org/courses/452/10-spring/pdf/proteinFolding.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Bowman GR, Beauchamp KA, Boxer G, Pande VS. “Progress and challenges in the automated construction of Markov state models for full protein systems,” J. Chem. Phys. 131 (2009) 124101 [http://www.gersteinlab.org/courses/452/10-spring/pdf/bowman.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
==Class Requirements==&lt;br /&gt;
&lt;br /&gt;
===Discussion Section / Readings===&lt;br /&gt;
Papers will be assigned throughout the course. These papers will be presented and discussed in weekly 60-minute sections with the TFs. A brief summary (a half-page per article) should be submitted at the beginning of the discussion session.&lt;br /&gt;
&lt;br /&gt;
===Bioinformatics quizzes===&lt;br /&gt;
There will be four short quizzes (25 minutes) in class comprising SIMPLE questions that you should be able to answer from the lectures plus the main readings.&lt;br /&gt;
&lt;br /&gt;
===Programming Assignments (CBB and CS)===&lt;br /&gt;
There will be several short programming assignments required for CBB and CS students taking this course. Acceptable languages and submission requirements will be discussed prior to the first assignment. These assignments are NOT required for students not taking the CBB or CS sections of the course.&lt;br /&gt;
&lt;br /&gt;
===Non-CBB Final Project===&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
===CBB Final Project===&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
===Grade Categories===&lt;br /&gt;
&lt;br /&gt;
====CBB and CPSC Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 8.50%&lt;br /&gt;
|-&lt;br /&gt;
| Programming Assignments&lt;br /&gt;
| 25.00%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
====MBB and MCDB Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 16.75%&lt;br /&gt;
|-&lt;br /&gt;
| Problem Sets&lt;br /&gt;
| 16.75%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Relevant Yale College Regulations===&lt;br /&gt;
&lt;br /&gt;
Students may have questions concerning end-of-term matters.  Links to further information about these regulations can be found below:&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/reading-period-and-final-examination-period&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/completion-course-work&lt;br /&gt;
&lt;br /&gt;
==Misc==&lt;br /&gt;
[http://info.gersteinlab.org/Permissions Permissions] on using website material&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/mark/out/log/2011/12.29/talk/intro-graphic.ppt Graphic] for course homepage&lt;br /&gt;
&lt;br /&gt;
If you&#039;re really motivated, take a look at http://gersteinlab.org/jobs for further Research Opportunities&lt;br /&gt;
&lt;br /&gt;
==Pages from previous years==&lt;br /&gt;
[http://info.gersteinlab.org/Cbb752b11 2011],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/ 2010],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/previous.html 2009 and earlier]&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1828</id>
		<title>Cbb752b12</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1828"/>
		<updated>2012-01-02T21:44:27Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: /* Session 5 */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=CBB 752=&lt;br /&gt;
&lt;br /&gt;
==Course Information==&lt;br /&gt;
&lt;br /&gt;
===Course Description===&lt;br /&gt;
&lt;br /&gt;
Bioinformatics encompasses the analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. It represents a major practical application for modern techniques in data mining and simulation. Specific topics to be covered include sequence alignment, large-scale processing, next-generation sequencing data, comparative genomics, phylogenetics, biological database design, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, normalization of microarray data, mining of functional genomics data sets, and machine learning approaches for data integration.&lt;br /&gt;
&lt;br /&gt;
===Concise undergraduate course description===&lt;br /&gt;
&lt;br /&gt;
Techniques in data mining and simulation applied to bioinformatics, the computational analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. Sequence alignment, comparative genomics and phylogenetics, biological databases, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, microarray normalization, and machine-learning approaches to data integration.&lt;br /&gt;
&lt;br /&gt;
See entry from undergraduate catalog:&lt;br /&gt;
http://students.yale.edu/oci/resultDetail.jsp?course=21914&amp;amp;term=201201 , viz:&lt;br /&gt;
&lt;br /&gt;
 MB&amp;amp;B 452 01 (21914) /MCDB452/MB&amp;amp;B752/MB&amp;amp;B753/MB&amp;amp;B754/CB&amp;amp;B752/MCDB752/CPSC752&lt;br /&gt;
 Bioinformatics: Practical Application of Simulation and Data Mining &lt;br /&gt;
 Mark Gerstein&lt;br /&gt;
 MW 1.00-2.15 BASS 305&lt;br /&gt;
 Spring 2012 &lt;br /&gt;
 No regular final examination&lt;br /&gt;
 Areas Sc&lt;br /&gt;
 Prerequisites: MB&amp;amp;B 301b and MATH 115a or b, or permission of instructor.&lt;br /&gt;
 MCDB 120a or 200b is a prerequisite for courses numbered MCDB 202 and above.&lt;br /&gt;
&lt;br /&gt;
===Different headings for this class===&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B452/MCDB452&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, written problem sets, and a final written project.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B752/MCDB752&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, written problem sets, and a final, graduate level written project.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;CB&amp;amp;B752/CPSC752&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, programming assignments, and a final programming project.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For &#039;&#039;graduate students&#039;&#039; the course is broken up into two &amp;quot;modules&amp;quot; (each counting 0.5 credit towards MB&amp;amp;B course requirement):&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B 753b3&#039;&#039;&#039;, Bioinformatics: Practical Application of Data Mining (1st half of term)&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B 754b4&#039;&#039;&#039;, Bioinformatics: Practical Application of Simulation (2nd half of term)&lt;br /&gt;
&lt;br /&gt;
Each module consists of lectures, written problem sets, and a final, graduate level written project that is half the length of the full course&#039;s final project.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the grade weighting schemes of each course version, see Class Requirements section.&lt;br /&gt;
&lt;br /&gt;
===Prerequisites===&lt;br /&gt;
The course is keyed towards CBB graduate students as well as advanced MB&amp;amp;B undergraduates and graduate students wishing to learn about types of large-scale quantitative analyses that whole-genome sequencing will make possible. It would also be suitable for students from other fields such as computer science or physics wanting to learn about an important new biological application for computation.&lt;br /&gt;
&lt;br /&gt;
Students should have:&lt;br /&gt;
&lt;br /&gt;
A basic knowledge of biochemistry and molecular biology.&lt;br /&gt;
A knowledge of basic quantitative concepts, such as single variable calculus, some probability and statistics, and basic programming skills.&lt;br /&gt;
These can be fulfilled by the following prerequisites statement: &amp;quot;Prerequisites: MBB 200 and Mathematics 115 or permission of the instructor.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
==Timing &amp;amp; location==&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Class:&#039;&#039;&#039; Meeting from 1:00-2:15 pm on Monday and Wednesday, in Bass 305. (First meeting will be on 9 Jan.)&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Discussion section:&#039;&#039;&#039; [https://docs.google.com/spreadsheet/viewform?formkey=dFpoWF92YlVQQTFFOWY0cWt6RUdueWc6MQ Poll] for students to indicate good times for the weekly discussion section&lt;br /&gt;
&lt;br /&gt;
==Instructors==&lt;br /&gt;
&lt;br /&gt;
===Instructor-in-Charge===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Mark Gerstein&lt;br /&gt;
| Bass 432A&lt;br /&gt;
| mark.gerstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Instructors===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Corey O&#039;Hern&lt;br /&gt;
| Mason Laboratory&lt;br /&gt;
| corey.ohern(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jesse Rinehart&lt;br /&gt;
| 300 George St&lt;br /&gt;
| jesse.rinehart(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| James Noonan&lt;br /&gt;
| 333 Cedar St&lt;br /&gt;
| james.noonan(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Kei Cheung&lt;br /&gt;
| 300 George St&lt;br /&gt;
| kei.cheung(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Steven Kleinstein&lt;br /&gt;
| 300 George St&lt;br /&gt;
| steven.kleinstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Teaching Fellows===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Lucas Lochovsky&lt;br /&gt;
| Bass 437&lt;br /&gt;
| lucas.lochovsky(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jane Leng&lt;br /&gt;
| Bass 437&lt;br /&gt;
| jing.leng(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Topics/Class Schedule==&lt;br /&gt;
&lt;br /&gt;
[https://docs.google.com/spreadsheet/ccc?key=0Ar2M8lIkl9T5dDN4d1Q0aDBGcThXdjV2c21ESWVxSGc Class Schedule] (including a list of topics and quiz dates)&lt;br /&gt;
&lt;br /&gt;
==Discussion Section Readings==&lt;br /&gt;
&lt;br /&gt;
===Session 1===&lt;br /&gt;
Metzker ML. &amp;quot;Sequencing technologies - the next generation” Nature Reviews Genetics. 11 (2010) [http://www.gersteinlab.org/courses/452/10-spring/pdf/ngs.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Wheeler DA et al. &amp;quot;The complete genome of an individual by massively parallel DNA sequencing,” Nature. 452:872-876 (2008) [http://www.gersteinlab.org/courses/452/10-spring/pdf/WatsonGenome.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 2===&lt;br /&gt;
Olsen JV, Blagoev B, Gnad F, Macek B, Kumar C, Mortensen P, Mann M. (2006) Global, in vivo, and site-specific phosphorylation dynamics in signaling networks.Cell. 2006 Nov 3;127(3):635-48. [http://www.pil.sdu.dk/1/MSQuant/Cell_GlobalPhosphorylationSignalingDynamics.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Nevan J. Krogan et al (2006) Global landscape of protein complexes in the yeast Saccharomyces cerevisiae Nature 440, 637-643 (30 March 2006) [http://www.nature.com/nature/journal/v440/n7084/pdf/nature04670.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 3===&lt;br /&gt;
T.F. Smith and M.S. Waterman. (1981) Identification of common molecular subsequences. Journal of Molecular Biology,147(1): 195-7. PMID: 7265238. [http://www.gersteinlab.org/courses/452/10-spring/pdf/sw.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. (1990) Basic local alignment search tool. Journal of Molecular Biology, 215(3):403-10. PMID: 2231712. [http://www.gersteinlab.org/courses/452/10-spring/pdf/Altschul.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 4===&lt;br /&gt;
Bailey TL, Williams N, Misleh C, Li WW. (2006) MEME: discovering and analyzing DNA and protein sequence motifs, Nucl Acids Res.34:W369-373 [http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1538909/?tool=pubmed PDF]&lt;br /&gt;
&lt;br /&gt;
Garnier J, Gibrat JF, Robson B. (1996) GOR method for predicting protein secondary structure from amino acid sequence.Methods in Enzymology,266: 540-53. PMID: 8743705. [http://www.gersteinlab.org/courses/452/10-spring/pdf/GOR.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 5===&lt;br /&gt;
Sotiriou et al. (2006) Gene Expression Profiling in Breast Cancer: Understanding the Molecular Basis of Histologic Grade To Improve Prognosis. JNCI J Natl Cancer Inst (15 February 2006) 98 (4):262-272.doi: 10.1093/jnci/djj052 [http://jnci.oxfordjournals.org/content/98/4/262.full.pdf+html PDF]&lt;br /&gt;
&lt;br /&gt;
Ekman D, Light S, Björklund AK, Elofsson A. (2006) What properties characterize the hub proteins of the protein-protein interaction network of Saccharomyces cerevisiae? Genome Biol. 2006;7(6):R45. [http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1779539/pdf/gb-2006-7-6-r45.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 6===&lt;br /&gt;
Antezana E, Egaña M, Blondé W, Illarramendi A, Bilbao I, De Baets B, Stevens R, Mironov V, Kuiper M. (2009) The Cell Cycle Ontology: an application ontology for the representation and integrated analysis of the cell cycle process. Genome Biol. 2009;10(5):R58. Epub 2009 May 29. [http://genomebiology.com/content/10/5/R58 PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 7===&lt;br /&gt;
Perelson AS. Modelling viral and immune system dynamics. Nat Rev Immunol. 2002 Jan;2(1):28-36. [http://www.gersteinlab.org/courses/452/10-spring/pdf/perelson.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 8===&lt;br /&gt;
ML Connolly. (1983) Solvent-accessible surfaces of proteins and nucleic acids. Science, 221(4612): 709-13. PMID: 6879170.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Connolly.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Martin Karplus and J. Andrew McCammon. (2002) Molecular dynamics simulations of biomolecules. Nature Structural Biology,9, 646-52. PMID: 12198485.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Karplus.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 9===&lt;br /&gt;
Dill KA, Ozkan SB, Shell MS, Weikl TR. (2008) The Protein Folding Problem.Annu Rev Biophys,9, 37:289-316. PMID: 2443096.[http://www.gersteinlab.org/courses/452/10-spring/pdf/proteinFolding.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Bowman GR, Beauchamp KA, Boxer G, Pande VS. “Progress and challenges in the automated construction of Markov state models for full protein systems,” J. Chem. Phys. 131 (2009) 124101 [http://www.gersteinlab.org/courses/452/10-spring/pdf/bowman.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
==Class Requirements==&lt;br /&gt;
&lt;br /&gt;
===Discussion Section / Readings===&lt;br /&gt;
Papers will be assigned throughout the course. These papers will be presented and discussed in weekly 60-minute sections with the TFs. A brief summary (a half-page per article) should be submitted at the beginning of the discussion session.&lt;br /&gt;
&lt;br /&gt;
===Bioinformatics quizzes===&lt;br /&gt;
There will be four short quizzes (25 minutes) in class comprising SIMPLE questions that you should be able to answer from the lectures plus the main readings.&lt;br /&gt;
&lt;br /&gt;
===Programming Assignments (CBB and CS)===&lt;br /&gt;
There will be several short programming assignments required for CBB and CS students taking this course. Acceptable languages and submission requirements will be discussed prior to the first assignment. These assignments are NOT required for students not taking the CBB or CS sections of the course.&lt;br /&gt;
&lt;br /&gt;
===Non-CBB Final Project===&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
===CBB Final Project===&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
===Grade Categories===&lt;br /&gt;
&lt;br /&gt;
====CBB and CPSC Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 8.50%&lt;br /&gt;
|-&lt;br /&gt;
| Programming Assignments&lt;br /&gt;
| 25.00%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
====MBB and MCDB Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 16.75%&lt;br /&gt;
|-&lt;br /&gt;
| Problem Sets&lt;br /&gt;
| 16.75%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Relevant Yale College Regulations===&lt;br /&gt;
&lt;br /&gt;
Students may have questions concerning end-of-term matters.  Links to further information about these regulations can be found below:&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/reading-period-and-final-examination-period&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/completion-course-work&lt;br /&gt;
&lt;br /&gt;
==Misc==&lt;br /&gt;
[http://info.gersteinlab.org/Permissions Permissions] on using website material&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/mark/out/log/2011/12.29/talk/intro-graphic.ppt Graphic] for course homepage&lt;br /&gt;
&lt;br /&gt;
If you&#039;re really motivated, take a look at http://gersteinlab.org/jobs for further Research Opportunities&lt;br /&gt;
&lt;br /&gt;
==Pages from previous years==&lt;br /&gt;
[http://info.gersteinlab.org/Cbb752b11 2011],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/ 2010],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/previous.html 2009 and earlier]&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1827</id>
		<title>Cbb752b12</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1827"/>
		<updated>2012-01-02T21:43:16Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: /* Session 5 */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=CBB 752=&lt;br /&gt;
&lt;br /&gt;
==Course Information==&lt;br /&gt;
&lt;br /&gt;
===Course Description===&lt;br /&gt;
&lt;br /&gt;
Bioinformatics encompasses the analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. It represents a major practical application for modern techniques in data mining and simulation. Specific topics to be covered include sequence alignment, large-scale processing, next-generation sequencing data, comparative genomics, phylogenetics, biological database design, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, normalization of microarray data, mining of functional genomics data sets, and machine learning approaches for data integration.&lt;br /&gt;
&lt;br /&gt;
===Concise undergraduate course description===&lt;br /&gt;
&lt;br /&gt;
Techniques in data mining and simulation applied to bioinformatics, the computational analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. Sequence alignment, comparative genomics and phylogenetics, biological databases, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, microarray normalization, and machine-learning approaches to data integration.&lt;br /&gt;
&lt;br /&gt;
See entry from undergraduate catalog:&lt;br /&gt;
http://students.yale.edu/oci/resultDetail.jsp?course=21914&amp;amp;term=201201 , viz:&lt;br /&gt;
&lt;br /&gt;
 MB&amp;amp;B 452 01 (21914) /MCDB452/MB&amp;amp;B752/MB&amp;amp;B753/MB&amp;amp;B754/CB&amp;amp;B752/MCDB752/CPSC752&lt;br /&gt;
 Bioinformatics: Practical Application of Simulation and Data Mining &lt;br /&gt;
 Mark Gerstein&lt;br /&gt;
 MW 1.00-2.15 BASS 305&lt;br /&gt;
 Spring 2012 &lt;br /&gt;
 No regular final examination&lt;br /&gt;
 Areas Sc&lt;br /&gt;
 Prerequisites: MB&amp;amp;B 301b and MATH 115a or b, or permission of instructor.&lt;br /&gt;
 MCDB 120a or 200b is a prerequisite for courses numbered MCDB 202 and above.&lt;br /&gt;
&lt;br /&gt;
===Different headings for this class===&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B452/MCDB452&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, written problem sets, and a final written project.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B752/MCDB752&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, written problem sets, and a final, graduate level written project.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;CB&amp;amp;B752/CPSC752&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, programming assignments, and a final programming project.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For &#039;&#039;graduate students&#039;&#039; the course is broken up into two &amp;quot;modules&amp;quot; (each counting 0.5 credit towards MB&amp;amp;B course requirement):&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B 753b3&#039;&#039;&#039;, Bioinformatics: Practical Application of Data Mining (1st half of term)&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B 754b4&#039;&#039;&#039;, Bioinformatics: Practical Application of Simulation (2nd half of term)&lt;br /&gt;
&lt;br /&gt;
Each module consists of lectures, written problem sets, and a final, graduate level written project that is half the length of the full course&#039;s final project.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the grade weighting schemes of each course version, see Class Requirements section.&lt;br /&gt;
&lt;br /&gt;
===Prerequisites===&lt;br /&gt;
The course is keyed towards CBB graduate students as well as advanced MB&amp;amp;B undergraduates and graduate students wishing to learn about types of large-scale quantitative analyses that whole-genome sequencing will make possible. It would also be suitable for students from other fields such as computer science or physics wanting to learn about an important new biological application for computation.&lt;br /&gt;
&lt;br /&gt;
Students should have:&lt;br /&gt;
&lt;br /&gt;
A basic knowledge of biochemistry and molecular biology.&lt;br /&gt;
A knowledge of basic quantitative concepts, such as single variable calculus, some probability and statistics, and basic programming skills.&lt;br /&gt;
These can be fulfilled by the following prerequisites statement: &amp;quot;Prerequisites: MBB 200 and Mathematics 115 or permission of the instructor.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
==Timing &amp;amp; location==&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Class:&#039;&#039;&#039; Meeting from 1:00-2:15 pm on Monday and Wednesday, in Bass 305. (First meeting will be on 9 Jan.)&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Discussion section:&#039;&#039;&#039; [https://docs.google.com/spreadsheet/viewform?formkey=dFpoWF92YlVQQTFFOWY0cWt6RUdueWc6MQ Poll] for students to indicate good times for the weekly discussion section&lt;br /&gt;
&lt;br /&gt;
==Instructors==&lt;br /&gt;
&lt;br /&gt;
===Instructor-in-Charge===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Mark Gerstein&lt;br /&gt;
| Bass 432A&lt;br /&gt;
| mark.gerstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Instructors===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Corey O&#039;Hern&lt;br /&gt;
| Mason Laboratory&lt;br /&gt;
| corey.ohern(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jesse Rinehart&lt;br /&gt;
| 300 George St&lt;br /&gt;
| jesse.rinehart(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| James Noonan&lt;br /&gt;
| 333 Cedar St&lt;br /&gt;
| james.noonan(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Kei Cheung&lt;br /&gt;
| 300 George St&lt;br /&gt;
| kei.cheung(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Steven Kleinstein&lt;br /&gt;
| 300 George St&lt;br /&gt;
| steven.kleinstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Teaching Fellows===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Lucas Lochovsky&lt;br /&gt;
| Bass 437&lt;br /&gt;
| lucas.lochovsky(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jane Leng&lt;br /&gt;
| Bass 437&lt;br /&gt;
| jing.leng(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Topics/Class Schedule==&lt;br /&gt;
&lt;br /&gt;
[https://docs.google.com/spreadsheet/ccc?key=0Ar2M8lIkl9T5dDN4d1Q0aDBGcThXdjV2c21ESWVxSGc Class Schedule] (including a list of topics and quiz dates)&lt;br /&gt;
&lt;br /&gt;
==Discussion Section Readings==&lt;br /&gt;
&lt;br /&gt;
===Session 1===&lt;br /&gt;
Metzker ML. &amp;quot;Sequencing technologies - the next generation” Nature Reviews Genetics. 11 (2010) [http://www.gersteinlab.org/courses/452/10-spring/pdf/ngs.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Wheeler DA et al. &amp;quot;The complete genome of an individual by massively parallel DNA sequencing,” Nature. 452:872-876 (2008) [http://www.gersteinlab.org/courses/452/10-spring/pdf/WatsonGenome.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 2===&lt;br /&gt;
Olsen JV, Blagoev B, Gnad F, Macek B, Kumar C, Mortensen P, Mann M. (2006) Global, in vivo, and site-specific phosphorylation dynamics in signaling networks.Cell. 2006 Nov 3;127(3):635-48. [http://www.pil.sdu.dk/1/MSQuant/Cell_GlobalPhosphorylationSignalingDynamics.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Nevan J. Krogan et al (2006) Global landscape of protein complexes in the yeast Saccharomyces cerevisiae Nature 440, 637-643 (30 March 2006) [http://www.nature.com/nature/journal/v440/n7084/pdf/nature04670.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 3===&lt;br /&gt;
T.F. Smith and M.S. Waterman. (1981) Identification of common molecular subsequences. Journal of Molecular Biology,147(1): 195-7. PMID: 7265238. [http://www.gersteinlab.org/courses/452/10-spring/pdf/sw.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. (1990) Basic local alignment search tool. Journal of Molecular Biology, 215(3):403-10. PMID: 2231712. [http://www.gersteinlab.org/courses/452/10-spring/pdf/Altschul.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 4===&lt;br /&gt;
Bailey TL, Williams N, Misleh C, Li WW. (2006) MEME: discovering and analyzing DNA and protein sequence motifs, Nucl Acids Res.34:W369-373 [http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1538909/?tool=pubmed PDF]&lt;br /&gt;
&lt;br /&gt;
Garnier J, Gibrat JF, Robson B. (1996) GOR method for predicting protein secondary structure from amino acid sequence.Methods in Enzymology,266: 540-53. PMID: 8743705. [http://www.gersteinlab.org/courses/452/10-spring/pdf/GOR.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 5===&lt;br /&gt;
Sotiriou et al. (2006) Gene Expression Profiling in Breast Cancer: Understanding the Molecular Basis of Histologic Grade To Improve Prognosis. JNCI J Natl Cancer Inst 98 (4):262-272.doi: 10.1093/jnci/djj052 [http://jnci.oxfordjournals.org/content/98/4/262.full.pdf+html PDF]&lt;br /&gt;
&lt;br /&gt;
Ekman D, Light S, Björklund AK, Elofsson A. (2006) What properties characterize the hub proteins of the protein-protein interaction network of Saccharomyces cerevisiae? Genome Biol.;7(6):R45. [http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1779539/pdf/gb-2006-7-6-r45.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 6===&lt;br /&gt;
Antezana E, Egaña M, Blondé W, Illarramendi A, Bilbao I, De Baets B, Stevens R, Mironov V, Kuiper M. (2009) The Cell Cycle Ontology: an application ontology for the representation and integrated analysis of the cell cycle process. Genome Biol. 2009;10(5):R58. Epub 2009 May 29. [http://genomebiology.com/content/10/5/R58 PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 7===&lt;br /&gt;
Perelson AS. Modelling viral and immune system dynamics. Nat Rev Immunol. 2002 Jan;2(1):28-36. [http://www.gersteinlab.org/courses/452/10-spring/pdf/perelson.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 8===&lt;br /&gt;
ML Connolly. (1983) Solvent-accessible surfaces of proteins and nucleic acids. Science, 221(4612): 709-13. PMID: 6879170.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Connolly.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Martin Karplus and J. Andrew McCammon. (2002) Molecular dynamics simulations of biomolecules. Nature Structural Biology,9, 646-52. PMID: 12198485.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Karplus.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 9===&lt;br /&gt;
Dill KA, Ozkan SB, Shell MS, Weikl TR. (2008) The Protein Folding Problem.Annu Rev Biophys,9, 37:289-316. PMID: 2443096.[http://www.gersteinlab.org/courses/452/10-spring/pdf/proteinFolding.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Bowman GR, Beauchamp KA, Boxer G, Pande VS. “Progress and challenges in the automated construction of Markov state models for full protein systems,” J. Chem. Phys. 131 (2009) 124101 [http://www.gersteinlab.org/courses/452/10-spring/pdf/bowman.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
==Class Requirements==&lt;br /&gt;
&lt;br /&gt;
===Discussion Section / Readings===&lt;br /&gt;
Papers will be assigned throughout the course. These papers will be presented and discussed in weekly 60-minute sections with the TFs. A brief summary (a half-page per article) should be submitted at the beginning of the discussion session.&lt;br /&gt;
&lt;br /&gt;
===Bioinformatics quizzes===&lt;br /&gt;
There will be four short quizzes (25 minutes) in class comprising SIMPLE questions that you should be able to answer from the lectures plus the main readings.&lt;br /&gt;
&lt;br /&gt;
===Programming Assignments (CBB and CS)===&lt;br /&gt;
There will be several short programming assignments required for CBB and CS students taking this course. Acceptable languages and submission requirements will be discussed prior to the first assignment. These assignments are NOT required for students not taking the CBB or CS sections of the course.&lt;br /&gt;
&lt;br /&gt;
===Non-CBB Final Project===&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
===CBB Final Project===&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
===Grade Categories===&lt;br /&gt;
&lt;br /&gt;
====CBB and CPSC Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 8.50%&lt;br /&gt;
|-&lt;br /&gt;
| Programming Assignments&lt;br /&gt;
| 25.00%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
====MBB and MCDB Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 16.75%&lt;br /&gt;
|-&lt;br /&gt;
| Problem Sets&lt;br /&gt;
| 16.75%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Relevant Yale College Regulations===&lt;br /&gt;
&lt;br /&gt;
Students may have questions concerning end-of-term matters.  Links to further information about these regulations can be found below:&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/reading-period-and-final-examination-period&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/completion-course-work&lt;br /&gt;
&lt;br /&gt;
==Misc==&lt;br /&gt;
[http://info.gersteinlab.org/Permissions Permissions] on using website material&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/mark/out/log/2011/12.29/talk/intro-graphic.ppt Graphic] for course homepage&lt;br /&gt;
&lt;br /&gt;
If you&#039;re really motivated, take a look at http://gersteinlab.org/jobs for further Research Opportunities&lt;br /&gt;
&lt;br /&gt;
==Pages from previous years==&lt;br /&gt;
[http://info.gersteinlab.org/Cbb752b11 2011],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/ 2010],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/previous.html 2009 and earlier]&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1826</id>
		<title>Cbb752b12</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1826"/>
		<updated>2011-12-30T05:08:40Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: /* Session 1 */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=CBB 752=&lt;br /&gt;
&lt;br /&gt;
==Course Information==&lt;br /&gt;
&lt;br /&gt;
===Course Description===&lt;br /&gt;
&lt;br /&gt;
Bioinformatics encompasses the analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. It represents a major practical application for modern techniques in data mining and simulation. Specific topics to be covered include sequence alignment, large-scale processing, next-generation sequencing data, comparative genomics, phylogenetics, biological database design, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, normalization of microarray data, mining of functional genomics data sets, and machine learning approaches for data integration.&lt;br /&gt;
&lt;br /&gt;
===Concise undergraduate course description===&lt;br /&gt;
&lt;br /&gt;
Techniques in data mining and simulation applied to bioinformatics, the computational analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. Sequence alignment, comparative genomics and phylogenetics, biological databases, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, microarray normalization, and machine-learning approaches to data integration.&lt;br /&gt;
&lt;br /&gt;
See entry from undergraduate catalog:&lt;br /&gt;
http://students.yale.edu/oci/resultDetail.jsp?course=21914&amp;amp;term=201201 , viz:&lt;br /&gt;
&lt;br /&gt;
 MB&amp;amp;B 452 01 (21914) /MCDB452/MB&amp;amp;B752/MB&amp;amp;B753/MB&amp;amp;B754/CB&amp;amp;B752/MCDB752/CPSC752&lt;br /&gt;
 Bioinformatics: Practical Application of Simulation and Data Mining &lt;br /&gt;
 Mark Gerstein&lt;br /&gt;
 MW 1.00-2.15 BASS 305&lt;br /&gt;
 Spring 2012 &lt;br /&gt;
 No regular final examination&lt;br /&gt;
 Areas Sc&lt;br /&gt;
 Prerequisites: MB&amp;amp;B 301b and MATH 115a or b, or permission of instructor.&lt;br /&gt;
 MCDB 120a or 200b is a prerequisite for courses numbered MCDB 202 and above.&lt;br /&gt;
&lt;br /&gt;
===Different headings for this class===&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B452/MCDB452&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, written problem sets, and a final written project.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B752/MCDB752&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, written problem sets, and a final, graduate level written project.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;CB&amp;amp;B752/CPSC752&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, programming assignments, and a final programming project.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For &#039;&#039;graduate students&#039;&#039; the course is broken up into two &amp;quot;modules&amp;quot; (each counting 0.5 credit towards MB&amp;amp;B course requirement):&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B 753b3&#039;&#039;&#039;, Bioinformatics: Practical Application of Data Mining (1st half of term)&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B 754b4&#039;&#039;&#039;, Bioinformatics: Practical Application of Simulation (2nd half of term)&lt;br /&gt;
&lt;br /&gt;
Each module consists of lectures, written problem sets, and a final, graduate level written project that is half the length of the full course&#039;s final project.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the grade weighting schemes of each course version, see Class Requirements section.&lt;br /&gt;
&lt;br /&gt;
===Prerequisites===&lt;br /&gt;
The course is keyed towards CBB graduate students as well as advanced MB&amp;amp;B undergraduates and graduate students wishing to learn about types of large-scale quantitative analyses that whole-genome sequencing will make possible. It would also be suitable for students from other fields such as computer science or physics wanting to learn about an important new biological application for computation.&lt;br /&gt;
&lt;br /&gt;
Students should have:&lt;br /&gt;
&lt;br /&gt;
A basic knowledge of biochemistry and molecular biology.&lt;br /&gt;
A knowledge of basic quantitative concepts, such as single variable calculus, some probability and statistics, and basic programming skills.&lt;br /&gt;
These can be fulfilled by the following prerequisites statement: &amp;quot;Prerequisites: MBB 200 and Mathematics 115 or permission of the instructor.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
==Timing &amp;amp; location==&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Class:&#039;&#039;&#039; Meeting from 1:00-2:15 pm on Monday and Wednesday, in Bass 305. (First meeting will be on 9 Jan.)&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Discussion section:&#039;&#039;&#039; [https://docs.google.com/spreadsheet/viewform?formkey=dFpoWF92YlVQQTFFOWY0cWt6RUdueWc6MQ Poll] for students to indicate good times for the weekly discussion section&lt;br /&gt;
&lt;br /&gt;
==Instructors==&lt;br /&gt;
&lt;br /&gt;
===Instructor-in-Charge===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Mark Gerstein&lt;br /&gt;
| Bass 432A&lt;br /&gt;
| mark.gerstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Instructors===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Corey O&#039;Hern&lt;br /&gt;
| Mason Laboratory&lt;br /&gt;
| corey.ohern(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jesse Rinehart&lt;br /&gt;
| 300 George St&lt;br /&gt;
| jesse.rinehart(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| James Noonan&lt;br /&gt;
| 333 Cedar St&lt;br /&gt;
| james.noonan(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Kei Cheung&lt;br /&gt;
| 300 George St&lt;br /&gt;
| kei.cheung(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Steven Kleinstein&lt;br /&gt;
| 300 George St&lt;br /&gt;
| steven.kleinstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Teaching Fellows===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Lucas Lochovsky&lt;br /&gt;
| Bass 437&lt;br /&gt;
| lucas.lochovsky(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jane Leng&lt;br /&gt;
| Bass 437&lt;br /&gt;
| jing.leng(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Topics/Class Schedule==&lt;br /&gt;
&lt;br /&gt;
[https://docs.google.com/spreadsheet/ccc?key=0Ar2M8lIkl9T5dDN4d1Q0aDBGcThXdjV2c21ESWVxSGc Class Schedule] (including a list of topics and quiz dates)&lt;br /&gt;
&lt;br /&gt;
==Discussion Section Readings==&lt;br /&gt;
&lt;br /&gt;
===Session 1===&lt;br /&gt;
Metzker ML. &amp;quot;Sequencing technologies - the next generation” Nature Reviews Genetics. 11 (2010) [http://www.gersteinlab.org/courses/452/10-spring/pdf/ngs.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Wheeler DA et al. &amp;quot;The complete genome of an individual by massively parallel DNA sequencing,” Nature. 452:872-876 (2008) [http://www.gersteinlab.org/courses/452/10-spring/pdf/WatsonGenome.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 2===&lt;br /&gt;
Olsen JV, Blagoev B, Gnad F, Macek B, Kumar C, Mortensen P, Mann M. (2006) Global, in vivo, and site-specific phosphorylation dynamics in signaling networks.Cell. 2006 Nov 3;127(3):635-48. [http://www.pil.sdu.dk/1/MSQuant/Cell_GlobalPhosphorylationSignalingDynamics.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Nevan J. Krogan et al (2006) Global landscape of protein complexes in the yeast Saccharomyces cerevisiae Nature 440, 637-643 (30 March 2006) [http://www.nature.com/nature/journal/v440/n7084/pdf/nature04670.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 3===&lt;br /&gt;
T.F. Smith and M.S. Waterman. (1981) Identification of common molecular subsequences. Journal of Molecular Biology,147(1): 195-7. PMID: 7265238. [http://www.gersteinlab.org/courses/452/10-spring/pdf/sw.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. (1990) Basic local alignment search tool. Journal of Molecular Biology, 215(3):403-10. PMID: 2231712. [http://www.gersteinlab.org/courses/452/10-spring/pdf/Altschul.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 4===&lt;br /&gt;
Bailey TL, Williams N, Misleh C, Li WW. (2006) MEME: discovering and analyzing DNA and protein sequence motifs, Nucl Acids Res.34:W369-373 [http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1538909/?tool=pubmed PDF]&lt;br /&gt;
&lt;br /&gt;
Garnier J, Gibrat JF, Robson B. (1996) GOR method for predicting protein secondary structure from amino acid sequence.Methods in Enzymology,266: 540-53. PMID: 8743705. [http://www.gersteinlab.org/courses/452/10-spring/pdf/GOR.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 5===&lt;br /&gt;
Laura J. van &#039;t Veer et al. Gene expression profiling predicts clinical outcome of breast cancer Nature 415, 530-536 (31 January 2002) | doi:10.1038/415530a; Received 24 August 2001; Accepted 22 November 2001 [http://www.nature.com/nature/journal/v415/n6871/full/415530a.html TEXT]&lt;br /&gt;
&lt;br /&gt;
Kwang-Il Goh, Michael E. Cusick, David Vall, Barton Child, Marc Vidal, and Albert-La ́szlo ́ Barabasi (2007) The human disease network Proc Natl Acad Sci U S A. 2007 May 22;104(21):8685-90. Epub 2007 May 14. [http://www.pnas.org/content/104/21/8685.full.pdf+html PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 6===&lt;br /&gt;
Antezana E, Egaña M, Blondé W, Illarramendi A, Bilbao I, De Baets B, Stevens R, Mironov V, Kuiper M. (2009) The Cell Cycle Ontology: an application ontology for the representation and integrated analysis of the cell cycle process. Genome Biol. 2009;10(5):R58. Epub 2009 May 29. [http://genomebiology.com/content/10/5/R58 PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 7===&lt;br /&gt;
Perelson AS. Modelling viral and immune system dynamics. Nat Rev Immunol. 2002 Jan;2(1):28-36. [http://www.gersteinlab.org/courses/452/10-spring/pdf/perelson.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 8===&lt;br /&gt;
ML Connolly. (1983) Solvent-accessible surfaces of proteins and nucleic acids. Science, 221(4612): 709-13. PMID: 6879170.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Connolly.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Martin Karplus and J. Andrew McCammon. (2002) Molecular dynamics simulations of biomolecules. Nature Structural Biology,9, 646-52. PMID: 12198485.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Karplus.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 9===&lt;br /&gt;
Dill KA, Ozkan SB, Shell MS, Weikl TR. (2008) The Protein Folding Problem.Annu Rev Biophys,9, 37:289-316. PMID: 2443096.[http://www.gersteinlab.org/courses/452/10-spring/pdf/proteinFolding.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Bowman GR, Beauchamp KA, Boxer G, Pande VS. “Progress and challenges in the automated construction of Markov state models for full protein systems,” J. Chem. Phys. 131 (2009) 124101 [http://www.gersteinlab.org/courses/452/10-spring/pdf/bowman.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
==Class Requirements==&lt;br /&gt;
&lt;br /&gt;
===Discussion Section / Readings===&lt;br /&gt;
Papers will be assigned throughout the course. These papers will be presented and discussed in weekly 60-minute sections with the TFs. A brief summary (a half-page per article) should be submitted at the beginning of the discussion session.&lt;br /&gt;
&lt;br /&gt;
===Bioinformatics quizzes===&lt;br /&gt;
There will be four short quizzes (25 minutes) in class comprising SIMPLE questions that you should be able to answer from the lectures plus the main readings.&lt;br /&gt;
&lt;br /&gt;
===Programming Assignments (CBB and CS)===&lt;br /&gt;
There will be several short programming assignments required for CBB and CS students taking this course. Acceptable languages and submission requirements will be discussed prior to the first assignment. These assignments are NOT required for students not taking the CBB or CS sections of the course.&lt;br /&gt;
&lt;br /&gt;
===Non-CBB Final Project===&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
===CBB Final Project===&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
===Grade Categories===&lt;br /&gt;
&lt;br /&gt;
====CBB and CPSC Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 8.50%&lt;br /&gt;
|-&lt;br /&gt;
| Programming Assignments&lt;br /&gt;
| 25.00%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
====MBB and MCDB Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 16.75%&lt;br /&gt;
|-&lt;br /&gt;
| Problem Sets&lt;br /&gt;
| 16.75%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Relevant Yale College Regulations===&lt;br /&gt;
&lt;br /&gt;
Students may have questions concerning end-of-term matters.  Links to further information about these regulations can be found below:&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/reading-period-and-final-examination-period&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/completion-course-work&lt;br /&gt;
&lt;br /&gt;
==Misc==&lt;br /&gt;
[http://info.gersteinlab.org/Permissions Permissions] on using website material&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/mark/out/log/2011/12.29/talk/intro-graphic.ppt Graphic] for course homepage&lt;br /&gt;
&lt;br /&gt;
If you&#039;re really motivated, take a look at http://gersteinlab.org/jobs for further Research Opportunities&lt;br /&gt;
&lt;br /&gt;
==Pages from previous years==&lt;br /&gt;
[http://info.gersteinlab.org/Cbb752b11 2011],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/ 2010],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/previous.html 2009 and earlier]&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1825</id>
		<title>Cbb752b12</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1825"/>
		<updated>2011-12-30T01:58:04Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: /* Discussion Sections */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=CBB 752=&lt;br /&gt;
&lt;br /&gt;
==Course Information==&lt;br /&gt;
&lt;br /&gt;
===Course Description===&lt;br /&gt;
&lt;br /&gt;
Bioinformatics encompasses the analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. It represents a major practical application for modern techniques in data mining and simulation. Specific topics to be covered include sequence alignment, large-scale processing, next-generation sequencing data, comparative genomics, phylogenetics, biological database design, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, normalization of microarray data, mining of functional genomics data sets, and machine learning approaches for data integration.&lt;br /&gt;
&lt;br /&gt;
===Concise undergraduate course description===&lt;br /&gt;
&lt;br /&gt;
Techniques in data mining and simulation applied to bioinformatics, the computational analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. Sequence alignment, comparative genomics and phylogenetics, biological databases, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, microarray normalization, and machine-learning approaches to data integration.&lt;br /&gt;
&lt;br /&gt;
See entry from undergraduate catalog:&lt;br /&gt;
http://students.yale.edu/oci/resultDetail.jsp?course=21914&amp;amp;term=201201 , viz:&lt;br /&gt;
&lt;br /&gt;
 MB&amp;amp;B 452 01 (21914) /MCDB452/MB&amp;amp;B752/MB&amp;amp;B753/MB&amp;amp;B754/CB&amp;amp;B752/MCDB752/CPSC752&lt;br /&gt;
 Bioinformatics: Practical Application of Simulation and Data Mining &lt;br /&gt;
 Mark Gerstein&lt;br /&gt;
 MW 1.00-2.15 BASS 305&lt;br /&gt;
 Spring 2012 &lt;br /&gt;
 No regular final examination&lt;br /&gt;
 Areas Sc&lt;br /&gt;
 Prerequisites: MB&amp;amp;B 301b and MATH 115a or b, or permission of instructor.&lt;br /&gt;
 MCDB 120a or 200b is a prerequisite for courses numbered MCDB 202 and above.&lt;br /&gt;
&lt;br /&gt;
===Different headings for this class===&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B452/MCDB452&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, written problem sets, and a final written project.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B752/MCDB752&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, written problem sets, and a final, graduate level written project.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;CB&amp;amp;B752/CPSC752&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, programming assignments, and a final programming project.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For &#039;&#039;graduate students&#039;&#039; the course is broken up into two &amp;quot;modules&amp;quot; (each counting 0.5 credit towards MB&amp;amp;B course requirement):&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B 753b3&#039;&#039;&#039;, Bioinformatics: Practical Application of Data Mining (1st half of term)&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B 754b4&#039;&#039;&#039;, Bioinformatics: Practical Application of Simulation (2nd half of term)&lt;br /&gt;
&lt;br /&gt;
Each module consists of lectures, written problem sets, and a final, graduate level written project that is half the length of the full course&#039;s final project.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the grade weighting schemes of each course version, see Class Requirements section.&lt;br /&gt;
&lt;br /&gt;
===Prerequisites===&lt;br /&gt;
The course is keyed towards CBB graduate students as well as advanced MB&amp;amp;B undergraduates and graduate students wishing to learn about types of large-scale quantitative analyses that whole-genome sequencing will make possible. It would also be suitable for students from other fields such as computer science or physics wanting to learn about an important new biological application for computation.&lt;br /&gt;
&lt;br /&gt;
Students should have:&lt;br /&gt;
&lt;br /&gt;
A basic knowledge of biochemistry and molecular biology.&lt;br /&gt;
A knowledge of basic quantitative concepts, such as single variable calculus, some probability and statistics, and basic programming skills.&lt;br /&gt;
These can be fulfilled by the following prerequisites statement: &amp;quot;Prerequisites: MBB 200 and Mathematics 115 or permission of the instructor.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
==Timing &amp;amp; location==&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Class:&#039;&#039;&#039; Meeting from 1:00-2:15 pm on Monday and Wednesday, in Bass 305. (First meeting will be on 9 Jan.)&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Discussion section:&#039;&#039;&#039; [https://docs.google.com/spreadsheet/viewform?formkey=dFpoWF92YlVQQTFFOWY0cWt6RUdueWc6MQ Poll] for students to indicate good times for the weekly discussion section&lt;br /&gt;
&lt;br /&gt;
==Instructors==&lt;br /&gt;
&lt;br /&gt;
===Instructor-in-Charge===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Mark Gerstein&lt;br /&gt;
| Bass 432A&lt;br /&gt;
| mark.gerstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Instructors===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Corey O&#039;Hern&lt;br /&gt;
| Mason Laboratory&lt;br /&gt;
| corey.ohern(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jesse Rinehart&lt;br /&gt;
| 300 George St&lt;br /&gt;
| jesse.rinehart(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| James Noonan&lt;br /&gt;
| 333 Cedar St&lt;br /&gt;
| james.noonan(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Kei Cheung&lt;br /&gt;
| 300 George St&lt;br /&gt;
| kei.cheung(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Steven Kleinstein&lt;br /&gt;
| 300 George St&lt;br /&gt;
| steven.kleinstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Teaching Fellows===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Lucas Lochovsky&lt;br /&gt;
| Bass 437&lt;br /&gt;
| lucas.lochovsky(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jane Leng&lt;br /&gt;
| Bass 437&lt;br /&gt;
| jing.leng(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Topics/Class Schedule==&lt;br /&gt;
&lt;br /&gt;
[https://docs.google.com/spreadsheet/ccc?key=0Ar2M8lIkl9T5dDN4d1Q0aDBGcThXdjV2c21ESWVxSGc Class Schedule] (including a list of topics and quiz dates)&lt;br /&gt;
&lt;br /&gt;
==Discussion Section Readings==&lt;br /&gt;
&lt;br /&gt;
===Session 1===&lt;br /&gt;
Metzker ML. &amp;quot;Sequencing technologies - the next generation” Nature Reviews Genetics. 11 (2010) [http://www.gersteinlab.org/courses/452/10-spring/pdf/ngs.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Wheeler DA et al. &amp;quot;The complete genome of an individual by massively parallel DNA sequencing,” Nature. 452:872-876 (208) [http://www.gersteinlab.org/courses/452/10-spring/pdf/WatsonGenome.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 2===&lt;br /&gt;
Olsen JV, Blagoev B, Gnad F, Macek B, Kumar C, Mortensen P, Mann M. (2006) Global, in vivo, and site-specific phosphorylation dynamics in signaling networks.Cell. 2006 Nov 3;127(3):635-48. [http://www.pil.sdu.dk/1/MSQuant/Cell_GlobalPhosphorylationSignalingDynamics.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Nevan J. Krogan et al (2006) Global landscape of protein complexes in the yeast Saccharomyces cerevisiae Nature 440, 637-643 (30 March 2006) [http://www.nature.com/nature/journal/v440/n7084/pdf/nature04670.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 3===&lt;br /&gt;
T.F. Smith and M.S. Waterman. (1981) Identification of common molecular subsequences. Journal of Molecular Biology,147(1): 195-7. PMID: 7265238. [http://www.gersteinlab.org/courses/452/10-spring/pdf/sw.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. (1990) Basic local alignment search tool. Journal of Molecular Biology, 215(3):403-10. PMID: 2231712. [http://www.gersteinlab.org/courses/452/10-spring/pdf/Altschul.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 4===&lt;br /&gt;
Bailey TL, Williams N, Misleh C, Li WW. (2006) MEME: discovering and analyzing DNA and protein sequence motifs, Nucl Acids Res.34:W369-373 [http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1538909/?tool=pubmed PDF]&lt;br /&gt;
&lt;br /&gt;
Garnier J, Gibrat JF, Robson B. (1996) GOR method for predicting protein secondary structure from amino acid sequence.Methods in Enzymology,266: 540-53. PMID: 8743705. [http://www.gersteinlab.org/courses/452/10-spring/pdf/GOR.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 5===&lt;br /&gt;
Laura J. van &#039;t Veer et al. Gene expression profiling predicts clinical outcome of breast cancer Nature 415, 530-536 (31 January 2002) | doi:10.1038/415530a; Received 24 August 2001; Accepted 22 November 2001 [http://www.nature.com/nature/journal/v415/n6871/full/415530a.html TEXT]&lt;br /&gt;
&lt;br /&gt;
Kwang-Il Goh, Michael E. Cusick, David Vall, Barton Child, Marc Vidal, and Albert-La ́szlo ́ Barabasi (2007) The human disease network Proc Natl Acad Sci U S A. 2007 May 22;104(21):8685-90. Epub 2007 May 14. [http://www.pnas.org/content/104/21/8685.full.pdf+html PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 6===&lt;br /&gt;
Antezana E, Egaña M, Blondé W, Illarramendi A, Bilbao I, De Baets B, Stevens R, Mironov V, Kuiper M. (2009) The Cell Cycle Ontology: an application ontology for the representation and integrated analysis of the cell cycle process. Genome Biol. 2009;10(5):R58. Epub 2009 May 29. [http://genomebiology.com/content/10/5/R58 PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 7===&lt;br /&gt;
Perelson AS. Modelling viral and immune system dynamics. Nat Rev Immunol. 2002 Jan;2(1):28-36. [http://www.gersteinlab.org/courses/452/10-spring/pdf/perelson.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 8===&lt;br /&gt;
ML Connolly. (1983) Solvent-accessible surfaces of proteins and nucleic acids. Science, 221(4612): 709-13. PMID: 6879170.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Connolly.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Martin Karplus and J. Andrew McCammon. (2002) Molecular dynamics simulations of biomolecules. Nature Structural Biology,9, 646-52. PMID: 12198485.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Karplus.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 9===&lt;br /&gt;
Dill KA, Ozkan SB, Shell MS, Weikl TR. (2008) The Protein Folding Problem.Annu Rev Biophys,9, 37:289-316. PMID: 2443096.[http://www.gersteinlab.org/courses/452/10-spring/pdf/proteinFolding.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Bowman GR, Beauchamp KA, Boxer G, Pande VS. “Progress and challenges in the automated construction of Markov state models for full protein systems,” J. Chem. Phys. 131 (2009) 124101 [http://www.gersteinlab.org/courses/452/10-spring/pdf/bowman.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
==Class Requirements==&lt;br /&gt;
&lt;br /&gt;
===Discussion Section / Readings===&lt;br /&gt;
Papers will be assigned throughout the course. These papers will be presented and discussed in weekly 60-minute sections with the TFs. A brief summary (a half-page per article) should be submitted at the beginning of the discussion session.&lt;br /&gt;
&lt;br /&gt;
===Bioinformatics quizzes===&lt;br /&gt;
There will be four short quizzes (25 minutes) in class comprising SIMPLE questions that you should be able to answer from the lectures plus the main readings.&lt;br /&gt;
&lt;br /&gt;
===Programming Assignments (CBB and CS)===&lt;br /&gt;
There will be several short programming assignments required for CBB and CS students taking this course. Acceptable languages and submission requirements will be discussed prior to the first assignment. These assignments are NOT required for students not taking the CBB or CS sections of the course.&lt;br /&gt;
&lt;br /&gt;
===Non-CBB Final Project===&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
===CBB Final Project===&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
===Grade Categories===&lt;br /&gt;
&lt;br /&gt;
====CBB and CPSC Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 8.50%&lt;br /&gt;
|-&lt;br /&gt;
| Programming Assignments&lt;br /&gt;
| 25.00%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
====MBB and MCDB Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 16.75%&lt;br /&gt;
|-&lt;br /&gt;
| Problem Sets&lt;br /&gt;
| 16.75%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Relevant Yale College Regulations===&lt;br /&gt;
&lt;br /&gt;
Students may have questions concerning end-of-term matters.  Links to further information about these regulations can be found below:&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/reading-period-and-final-examination-period&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/completion-course-work&lt;br /&gt;
&lt;br /&gt;
==Misc==&lt;br /&gt;
[http://info.gersteinlab.org/Permissions Permissions] on using website material&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/mark/out/log/2011/12.29/talk/intro-graphic.ppt Graphic] for course homepage&lt;br /&gt;
&lt;br /&gt;
If you&#039;re really motivated, take a look at http://gersteinlab.org/jobs for further Research Opportunities&lt;br /&gt;
&lt;br /&gt;
==Pages from previous years==&lt;br /&gt;
[http://info.gersteinlab.org/Cbb752b11 2011],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/ 2010],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/previous.html 2009 and earlier]&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1824</id>
		<title>Cbb752b12</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1824"/>
		<updated>2011-12-30T01:55:41Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: /* Timing &amp;amp; location */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=CBB 752=&lt;br /&gt;
&lt;br /&gt;
==Course Information==&lt;br /&gt;
&lt;br /&gt;
===Course Description===&lt;br /&gt;
&lt;br /&gt;
Bioinformatics encompasses the analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. It represents a major practical application for modern techniques in data mining and simulation. Specific topics to be covered include sequence alignment, large-scale processing, next-generation sequencing data, comparative genomics, phylogenetics, biological database design, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, normalization of microarray data, mining of functional genomics data sets, and machine learning approaches for data integration.&lt;br /&gt;
&lt;br /&gt;
===Concise undergraduate course description===&lt;br /&gt;
&lt;br /&gt;
Techniques in data mining and simulation applied to bioinformatics, the computational analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. Sequence alignment, comparative genomics and phylogenetics, biological databases, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, microarray normalization, and machine-learning approaches to data integration.&lt;br /&gt;
&lt;br /&gt;
See entry from undergraduate catalog:&lt;br /&gt;
http://students.yale.edu/oci/resultDetail.jsp?course=21914&amp;amp;term=201201 , viz:&lt;br /&gt;
&lt;br /&gt;
 MB&amp;amp;B 452 01 (21914) /MCDB452/MB&amp;amp;B752/MB&amp;amp;B753/MB&amp;amp;B754/CB&amp;amp;B752/MCDB752/CPSC752&lt;br /&gt;
 Bioinformatics: Practical Application of Simulation and Data Mining &lt;br /&gt;
 Mark Gerstein&lt;br /&gt;
 MW 1.00-2.15 BASS 305&lt;br /&gt;
 Spring 2012 &lt;br /&gt;
 No regular final examination&lt;br /&gt;
 Areas Sc&lt;br /&gt;
 Prerequisites: MB&amp;amp;B 301b and MATH 115a or b, or permission of instructor.&lt;br /&gt;
 MCDB 120a or 200b is a prerequisite for courses numbered MCDB 202 and above.&lt;br /&gt;
&lt;br /&gt;
===Different headings for this class===&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B452/MCDB452&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, written problem sets, and a final written project.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B752/MCDB752&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, written problem sets, and a final, graduate level written project.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;CB&amp;amp;B752/CPSC752&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, programming assignments, and a final programming project.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For &#039;&#039;graduate students&#039;&#039; the course is broken up into two &amp;quot;modules&amp;quot; (each counting 0.5 credit towards MB&amp;amp;B course requirement):&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B 753b3&#039;&#039;&#039;, Bioinformatics: Practical Application of Data Mining (1st half of term)&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B 754b4&#039;&#039;&#039;, Bioinformatics: Practical Application of Simulation (2nd half of term)&lt;br /&gt;
&lt;br /&gt;
Each module consists of lectures, written problem sets, and a final, graduate level written project that is half the length of the full course&#039;s final project.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the grade weighting schemes of each course version, see Class Requirements section.&lt;br /&gt;
&lt;br /&gt;
===Prerequisites===&lt;br /&gt;
The course is keyed towards CBB graduate students as well as advanced MB&amp;amp;B undergraduates and graduate students wishing to learn about types of large-scale quantitative analyses that whole-genome sequencing will make possible. It would also be suitable for students from other fields such as computer science or physics wanting to learn about an important new biological application for computation.&lt;br /&gt;
&lt;br /&gt;
Students should have:&lt;br /&gt;
&lt;br /&gt;
A basic knowledge of biochemistry and molecular biology.&lt;br /&gt;
A knowledge of basic quantitative concepts, such as single variable calculus, some probability and statistics, and basic programming skills.&lt;br /&gt;
These can be fulfilled by the following prerequisites statement: &amp;quot;Prerequisites: MBB 200 and Mathematics 115 or permission of the instructor.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
==Timing &amp;amp; location==&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Class:&#039;&#039;&#039; Meeting from 1:00-2:15 pm on Monday and Wednesday, in Bass 305. (First meeting will be on 9 Jan.)&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Discussion section:&#039;&#039;&#039; [https://docs.google.com/spreadsheet/viewform?formkey=dFpoWF92YlVQQTFFOWY0cWt6RUdueWc6MQ Poll] for students to indicate good times for the weekly discussion section&lt;br /&gt;
&lt;br /&gt;
==Instructors==&lt;br /&gt;
&lt;br /&gt;
===Instructor-in-Charge===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Mark Gerstein&lt;br /&gt;
| Bass 432A&lt;br /&gt;
| mark.gerstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Instructors===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Corey O&#039;Hern&lt;br /&gt;
| Mason Laboratory&lt;br /&gt;
| corey.ohern(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jesse Rinehart&lt;br /&gt;
| 300 George St&lt;br /&gt;
| jesse.rinehart(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| James Noonan&lt;br /&gt;
| 333 Cedar St&lt;br /&gt;
| james.noonan(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Kei Cheung&lt;br /&gt;
| 300 George St&lt;br /&gt;
| kei.cheung(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Steven Kleinstein&lt;br /&gt;
| 300 George St&lt;br /&gt;
| steven.kleinstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Teaching Fellows===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Lucas Lochovsky&lt;br /&gt;
| Bass 437&lt;br /&gt;
| lucas.lochovsky(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jane Leng&lt;br /&gt;
| Bass 437&lt;br /&gt;
| jing.leng(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Topics/Class Schedule==&lt;br /&gt;
&lt;br /&gt;
[https://docs.google.com/spreadsheet/ccc?key=0Ar2M8lIkl9T5dDN4d1Q0aDBGcThXdjV2c21ESWVxSGc Class Schedule] (including a list of topics and quiz dates)&lt;br /&gt;
&lt;br /&gt;
==Discussion Sections==&lt;br /&gt;
&lt;br /&gt;
===Session 1===&lt;br /&gt;
Metzker ML. &amp;quot;Sequencing technologies - the next generation” Nature Reviews Genetics. 11 (2010) [http://www.gersteinlab.org/courses/452/10-spring/pdf/ngs.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Wheeler DA et al. &amp;quot;The complete genome of an individual by massively parallel DNA sequencing,” Nature. 452:872-876 (208) [http://www.gersteinlab.org/courses/452/10-spring/pdf/WatsonGenome.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 2===&lt;br /&gt;
Olsen JV, Blagoev B, Gnad F, Macek B, Kumar C, Mortensen P, Mann M. (2006) Global, in vivo, and site-specific phosphorylation dynamics in signaling networks.Cell. 2006 Nov 3;127(3):635-48. [http://www.pil.sdu.dk/1/MSQuant/Cell_GlobalPhosphorylationSignalingDynamics.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Nevan J. Krogan et al (2006) Global landscape of protein complexes in the yeast Saccharomyces cerevisiae Nature 440, 637-643 (30 March 2006) [http://www.nature.com/nature/journal/v440/n7084/pdf/nature04670.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 3===&lt;br /&gt;
T.F. Smith and M.S. Waterman. (1981) Identification of common molecular subsequences. Journal of Molecular Biology,147(1): 195-7. PMID: 7265238. [http://www.gersteinlab.org/courses/452/10-spring/pdf/sw.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. (1990) Basic local alignment search tool. Journal of Molecular Biology, 215(3):403-10. PMID: 2231712. [http://www.gersteinlab.org/courses/452/10-spring/pdf/Altschul.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 4===&lt;br /&gt;
Bailey TL, Williams N, Misleh C, Li WW. (2006) MEME: discovering and analyzing DNA and protein sequence motifs, Nucl Acids Res.34:W369-373 [http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1538909/?tool=pubmed PDF]&lt;br /&gt;
&lt;br /&gt;
Garnier J, Gibrat JF, Robson B. (1996) GOR method for predicting protein secondary structure from amino acid sequence.Methods in Enzymology,266: 540-53. PMID: 8743705. [http://www.gersteinlab.org/courses/452/10-spring/pdf/GOR.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 5===&lt;br /&gt;
Laura J. van &#039;t Veer et al. Gene expression profiling predicts clinical outcome of breast cancer Nature 415, 530-536 (31 January 2002) | doi:10.1038/415530a; Received 24 August 2001; Accepted 22 November 2001 [http://www.nature.com/nature/journal/v415/n6871/full/415530a.html TEXT]&lt;br /&gt;
&lt;br /&gt;
Kwang-Il Goh, Michael E. Cusick, David Vall, Barton Child, Marc Vidal, and Albert-La ́szlo ́ Barabasi (2007) The human disease network Proc Natl Acad Sci U S A. 2007 May 22;104(21):8685-90. Epub 2007 May 14. [http://www.pnas.org/content/104/21/8685.full.pdf+html PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 6===&lt;br /&gt;
Antezana E, Egaña M, Blondé W, Illarramendi A, Bilbao I, De Baets B, Stevens R, Mironov V, Kuiper M. (2009) The Cell Cycle Ontology: an application ontology for the representation and integrated analysis of the cell cycle process. Genome Biol. 2009;10(5):R58. Epub 2009 May 29. [http://genomebiology.com/content/10/5/R58 PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 7===&lt;br /&gt;
Perelson AS. Modelling viral and immune system dynamics. Nat Rev Immunol. 2002 Jan;2(1):28-36. [http://www.gersteinlab.org/courses/452/10-spring/pdf/perelson.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 8===&lt;br /&gt;
ML Connolly. (1983) Solvent-accessible surfaces of proteins and nucleic acids. Science, 221(4612): 709-13. PMID: 6879170.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Connolly.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Martin Karplus and J. Andrew McCammon. (2002) Molecular dynamics simulations of biomolecules. Nature Structural Biology,9, 646-52. PMID: 12198485.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Karplus.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 9===&lt;br /&gt;
Dill KA, Ozkan SB, Shell MS, Weikl TR. (2008) The Protein Folding Problem.Annu Rev Biophys,9, 37:289-316. PMID: 2443096.[http://www.gersteinlab.org/courses/452/10-spring/pdf/proteinFolding.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Bowman GR, Beauchamp KA, Boxer G, Pande VS. “Progress and challenges in the automated construction of Markov state models for full protein systems,” J. Chem. Phys. 131 (2009) 124101 [http://www.gersteinlab.org/courses/452/10-spring/pdf/bowman.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
==Class Requirements==&lt;br /&gt;
&lt;br /&gt;
===Discussion Section / Readings===&lt;br /&gt;
Papers will be assigned throughout the course. These papers will be presented and discussed in weekly 60-minute sections with the TFs. A brief summary (a half-page per article) should be submitted at the beginning of the discussion session.&lt;br /&gt;
&lt;br /&gt;
===Bioinformatics quizzes===&lt;br /&gt;
There will be four short quizzes (25 minutes) in class comprising SIMPLE questions that you should be able to answer from the lectures plus the main readings.&lt;br /&gt;
&lt;br /&gt;
===Programming Assignments (CBB and CS)===&lt;br /&gt;
There will be several short programming assignments required for CBB and CS students taking this course. Acceptable languages and submission requirements will be discussed prior to the first assignment. These assignments are NOT required for students not taking the CBB or CS sections of the course.&lt;br /&gt;
&lt;br /&gt;
===Non-CBB Final Project===&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
===CBB Final Project===&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
===Grade Categories===&lt;br /&gt;
&lt;br /&gt;
====CBB and CPSC Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 8.50%&lt;br /&gt;
|-&lt;br /&gt;
| Programming Assignments&lt;br /&gt;
| 25.00%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
====MBB and MCDB Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 16.75%&lt;br /&gt;
|-&lt;br /&gt;
| Problem Sets&lt;br /&gt;
| 16.75%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Relevant Yale College Regulations===&lt;br /&gt;
&lt;br /&gt;
Students may have questions concerning end-of-term matters.  Links to further information about these regulations can be found below:&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/reading-period-and-final-examination-period&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/completion-course-work&lt;br /&gt;
&lt;br /&gt;
==Misc==&lt;br /&gt;
[http://info.gersteinlab.org/Permissions Permissions] on using website material&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/mark/out/log/2011/12.29/talk/intro-graphic.ppt Graphic] for course homepage&lt;br /&gt;
&lt;br /&gt;
If you&#039;re really motivated, take a look at http://gersteinlab.org/jobs for further Research Opportunities&lt;br /&gt;
&lt;br /&gt;
==Pages from previous years==&lt;br /&gt;
[http://info.gersteinlab.org/Cbb752b11 2011],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/ 2010],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/previous.html 2009 and earlier]&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1823</id>
		<title>Cbb752b12</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1823"/>
		<updated>2011-12-30T01:46:51Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: /* Timing &amp;amp; location */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=CBB 752=&lt;br /&gt;
&lt;br /&gt;
==Course Information==&lt;br /&gt;
&lt;br /&gt;
===Course Description===&lt;br /&gt;
&lt;br /&gt;
Bioinformatics encompasses the analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. It represents a major practical application for modern techniques in data mining and simulation. Specific topics to be covered include sequence alignment, large-scale processing, next-generation sequencing data, comparative genomics, phylogenetics, biological database design, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, normalization of microarray data, mining of functional genomics data sets, and machine learning approaches for data integration.&lt;br /&gt;
&lt;br /&gt;
===Concise undergraduate course description===&lt;br /&gt;
&lt;br /&gt;
Techniques in data mining and simulation applied to bioinformatics, the computational analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. Sequence alignment, comparative genomics and phylogenetics, biological databases, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, microarray normalization, and machine-learning approaches to data integration.&lt;br /&gt;
&lt;br /&gt;
See entry from undergraduate catalog:&lt;br /&gt;
http://students.yale.edu/oci/resultDetail.jsp?course=21914&amp;amp;term=201201 , viz:&lt;br /&gt;
&lt;br /&gt;
 MB&amp;amp;B 452 01 (21914) /MCDB452/MB&amp;amp;B752/MB&amp;amp;B753/MB&amp;amp;B754/CB&amp;amp;B752/MCDB752/CPSC752&lt;br /&gt;
 Bioinformatics: Practical Application of Simulation and Data Mining &lt;br /&gt;
 Mark Gerstein&lt;br /&gt;
 MW 1.00-2.15 BASS 305&lt;br /&gt;
 Spring 2012 &lt;br /&gt;
 No regular final examination&lt;br /&gt;
 Areas Sc&lt;br /&gt;
 Prerequisites: MB&amp;amp;B 301b and MATH 115a or b, or permission of instructor.&lt;br /&gt;
 MCDB 120a or 200b is a prerequisite for courses numbered MCDB 202 and above.&lt;br /&gt;
&lt;br /&gt;
===Different headings for this class===&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B452/MCDB452&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, written problem sets, and a final written project.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B752/MCDB752&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, written problem sets, and a final, graduate level written project.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;CB&amp;amp;B752/CPSC752&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, programming assignments, and a final programming project.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For &#039;&#039;graduate students&#039;&#039; the course is broken up into two &amp;quot;modules&amp;quot; (each counting 0.5 credit towards MB&amp;amp;B course requirement):&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B 753b3&#039;&#039;&#039;, Bioinformatics: Practical Application of Data Mining (1st half of term)&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B 754b4&#039;&#039;&#039;, Bioinformatics: Practical Application of Simulation (2nd half of term)&lt;br /&gt;
&lt;br /&gt;
Each module consists of lectures, written problem sets, and a final, graduate level written project that is half the length of the full course&#039;s final project.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the grade weighting schemes of each course version, see Class Requirements section.&lt;br /&gt;
&lt;br /&gt;
===Prerequisites===&lt;br /&gt;
The course is keyed towards CBB graduate students as well as advanced MB&amp;amp;B undergraduates and graduate students wishing to learn about types of large-scale quantitative analyses that whole-genome sequencing will make possible. It would also be suitable for students from other fields such as computer science or physics wanting to learn about an important new biological application for computation.&lt;br /&gt;
&lt;br /&gt;
Students should have:&lt;br /&gt;
&lt;br /&gt;
A basic knowledge of biochemistry and molecular biology.&lt;br /&gt;
A knowledge of basic quantitative concepts, such as single variable calculus, some probability and statistics, and basic programming skills.&lt;br /&gt;
These can be fulfilled by the following prerequisites statement: &amp;quot;Prerequisites: MBB 200 and Mathematics 115 or permission of the instructor.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
==Timing &amp;amp; location==&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Class:&#039;&#039;&#039; Meeting from 1:00-2:15 pm on Monday and Wednesday, in Bass 305. (First meeting will be on 9 Jan.)&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Discussion section:&#039;&#039;&#039; [http://www.doodle.com/pmp7z6cm76qu97vs Doodle poll] for students to indicate good times for the weekly discussion section&lt;br /&gt;
&lt;br /&gt;
==Instructors==&lt;br /&gt;
&lt;br /&gt;
===Instructor-in-Charge===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Mark Gerstein&lt;br /&gt;
| Bass 432A&lt;br /&gt;
| mark.gerstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Instructors===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Corey O&#039;Hern&lt;br /&gt;
| Mason Laboratory&lt;br /&gt;
| corey.ohern(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jesse Rinehart&lt;br /&gt;
| 300 George St&lt;br /&gt;
| jesse.rinehart(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| James Noonan&lt;br /&gt;
| 333 Cedar St&lt;br /&gt;
| james.noonan(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Kei Cheung&lt;br /&gt;
| 300 George St&lt;br /&gt;
| kei.cheung(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Steven Kleinstein&lt;br /&gt;
| 300 George St&lt;br /&gt;
| steven.kleinstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Teaching Fellows===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Lucas Lochovsky&lt;br /&gt;
| Bass 437&lt;br /&gt;
| lucas.lochovsky(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jane Leng&lt;br /&gt;
| Bass 437&lt;br /&gt;
| jing.leng(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Topics/Class Schedule==&lt;br /&gt;
&lt;br /&gt;
[https://docs.google.com/spreadsheet/ccc?key=0Ar2M8lIkl9T5dDN4d1Q0aDBGcThXdjV2c21ESWVxSGc Class Schedule] (including a list of topics and quiz dates)&lt;br /&gt;
&lt;br /&gt;
==Discussion Sections==&lt;br /&gt;
&lt;br /&gt;
===Session 1===&lt;br /&gt;
Metzker ML. &amp;quot;Sequencing technologies - the next generation” Nature Reviews Genetics. 11 (2010) [http://www.gersteinlab.org/courses/452/10-spring/pdf/ngs.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Wheeler DA et al. &amp;quot;The complete genome of an individual by massively parallel DNA sequencing,” Nature. 452:872-876 (208) [http://www.gersteinlab.org/courses/452/10-spring/pdf/WatsonGenome.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 2===&lt;br /&gt;
Olsen JV, Blagoev B, Gnad F, Macek B, Kumar C, Mortensen P, Mann M. (2006) Global, in vivo, and site-specific phosphorylation dynamics in signaling networks.Cell. 2006 Nov 3;127(3):635-48. [http://www.pil.sdu.dk/1/MSQuant/Cell_GlobalPhosphorylationSignalingDynamics.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Nevan J. Krogan et al (2006) Global landscape of protein complexes in the yeast Saccharomyces cerevisiae Nature 440, 637-643 (30 March 2006) [http://www.nature.com/nature/journal/v440/n7084/pdf/nature04670.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 3===&lt;br /&gt;
T.F. Smith and M.S. Waterman. (1981) Identification of common molecular subsequences. Journal of Molecular Biology,147(1): 195-7. PMID: 7265238. [http://www.gersteinlab.org/courses/452/10-spring/pdf/sw.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. (1990) Basic local alignment search tool. Journal of Molecular Biology, 215(3):403-10. PMID: 2231712. [http://www.gersteinlab.org/courses/452/10-spring/pdf/Altschul.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 4===&lt;br /&gt;
Bailey TL, Williams N, Misleh C, Li WW. (2006) MEME: discovering and analyzing DNA and protein sequence motifs, Nucl Acids Res.34:W369-373 [http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1538909/?tool=pubmed PDF]&lt;br /&gt;
&lt;br /&gt;
Garnier J, Gibrat JF, Robson B. (1996) GOR method for predicting protein secondary structure from amino acid sequence.Methods in Enzymology,266: 540-53. PMID: 8743705. [http://www.gersteinlab.org/courses/452/10-spring/pdf/GOR.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 5===&lt;br /&gt;
Laura J. van &#039;t Veer et al. Gene expression profiling predicts clinical outcome of breast cancer Nature 415, 530-536 (31 January 2002) | doi:10.1038/415530a; Received 24 August 2001; Accepted 22 November 2001 [http://www.nature.com/nature/journal/v415/n6871/full/415530a.html TEXT]&lt;br /&gt;
&lt;br /&gt;
Kwang-Il Goh, Michael E. Cusick, David Vall, Barton Child, Marc Vidal, and Albert-La ́szlo ́ Barabasi (2007) The human disease network Proc Natl Acad Sci U S A. 2007 May 22;104(21):8685-90. Epub 2007 May 14. [http://www.pnas.org/content/104/21/8685.full.pdf+html PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 6===&lt;br /&gt;
Antezana E, Egaña M, Blondé W, Illarramendi A, Bilbao I, De Baets B, Stevens R, Mironov V, Kuiper M. (2009) The Cell Cycle Ontology: an application ontology for the representation and integrated analysis of the cell cycle process. Genome Biol. 2009;10(5):R58. Epub 2009 May 29. [http://genomebiology.com/content/10/5/R58 PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 7===&lt;br /&gt;
Perelson AS. Modelling viral and immune system dynamics. Nat Rev Immunol. 2002 Jan;2(1):28-36. [http://www.gersteinlab.org/courses/452/10-spring/pdf/perelson.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 8===&lt;br /&gt;
ML Connolly. (1983) Solvent-accessible surfaces of proteins and nucleic acids. Science, 221(4612): 709-13. PMID: 6879170.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Connolly.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Martin Karplus and J. Andrew McCammon. (2002) Molecular dynamics simulations of biomolecules. Nature Structural Biology,9, 646-52. PMID: 12198485.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Karplus.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 9===&lt;br /&gt;
Dill KA, Ozkan SB, Shell MS, Weikl TR. (2008) The Protein Folding Problem.Annu Rev Biophys,9, 37:289-316. PMID: 2443096.[http://www.gersteinlab.org/courses/452/10-spring/pdf/proteinFolding.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Bowman GR, Beauchamp KA, Boxer G, Pande VS. “Progress and challenges in the automated construction of Markov state models for full protein systems,” J. Chem. Phys. 131 (2009) 124101 [http://www.gersteinlab.org/courses/452/10-spring/pdf/bowman.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
==Class Requirements==&lt;br /&gt;
&lt;br /&gt;
===Discussion Section / Readings===&lt;br /&gt;
Papers will be assigned throughout the course. These papers will be presented and discussed in weekly 60-minute sections with the TFs. A brief summary (a half-page per article) should be submitted at the beginning of the discussion session.&lt;br /&gt;
&lt;br /&gt;
===Bioinformatics quizzes===&lt;br /&gt;
There will be four short quizzes (25 minutes) in class comprising SIMPLE questions that you should be able to answer from the lectures plus the main readings.&lt;br /&gt;
&lt;br /&gt;
===Programming Assignments (CBB and CS)===&lt;br /&gt;
There will be several short programming assignments required for CBB and CS students taking this course. Acceptable languages and submission requirements will be discussed prior to the first assignment. These assignments are NOT required for students not taking the CBB or CS sections of the course.&lt;br /&gt;
&lt;br /&gt;
===Non-CBB Final Project===&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
===CBB Final Project===&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
===Grade Categories===&lt;br /&gt;
&lt;br /&gt;
====CBB and CPSC Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 8.50%&lt;br /&gt;
|-&lt;br /&gt;
| Programming Assignments&lt;br /&gt;
| 25.00%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
====MBB and MCDB Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 16.75%&lt;br /&gt;
|-&lt;br /&gt;
| Problem Sets&lt;br /&gt;
| 16.75%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Relevant Yale College Regulations===&lt;br /&gt;
&lt;br /&gt;
Students may have questions concerning end-of-term matters.  Links to further information about these regulations can be found below:&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/reading-period-and-final-examination-period&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/completion-course-work&lt;br /&gt;
&lt;br /&gt;
==Misc==&lt;br /&gt;
[http://info.gersteinlab.org/Permissions Permissions] on using website material&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/mark/out/log/2011/12.29/talk/intro-graphic.ppt Graphic] for course homepage&lt;br /&gt;
&lt;br /&gt;
If you&#039;re really motivated, take a look at http://gersteinlab.org/jobs for further Research Opportunities&lt;br /&gt;
&lt;br /&gt;
==Pages from previous years==&lt;br /&gt;
[http://info.gersteinlab.org/Cbb752b11 2011],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/ 2010],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/previous.html 2009 and earlier]&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1822</id>
		<title>Cbb752b12</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1822"/>
		<updated>2011-12-30T01:30:20Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=CBB 752=&lt;br /&gt;
&lt;br /&gt;
==Course Information==&lt;br /&gt;
&lt;br /&gt;
===Course Description===&lt;br /&gt;
&lt;br /&gt;
Bioinformatics encompasses the analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. It represents a major practical application for modern techniques in data mining and simulation. Specific topics to be covered include sequence alignment, large-scale processing, next-generation sequencing data, comparative genomics, phylogenetics, biological database design, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, normalization of microarray data, mining of functional genomics data sets, and machine learning approaches for data integration.&lt;br /&gt;
&lt;br /&gt;
===Concise undergraduate course description===&lt;br /&gt;
&lt;br /&gt;
Techniques in data mining and simulation applied to bioinformatics, the computational analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. Sequence alignment, comparative genomics and phylogenetics, biological databases, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, microarray normalization, and machine-learning approaches to data integration.&lt;br /&gt;
&lt;br /&gt;
See entry from undergraduate catalog:&lt;br /&gt;
http://students.yale.edu/oci/resultDetail.jsp?course=21914&amp;amp;term=201201 , viz:&lt;br /&gt;
&lt;br /&gt;
 MB&amp;amp;B 452 01 (21914) /MCDB452/MB&amp;amp;B752/MB&amp;amp;B753/MB&amp;amp;B754/CB&amp;amp;B752/MCDB752/CPSC752&lt;br /&gt;
 Bioinformatics: Practical Application of Simulation and Data Mining &lt;br /&gt;
 Mark Gerstein&lt;br /&gt;
 MW 1.00-2.15 BASS 305&lt;br /&gt;
 Spring 2012 &lt;br /&gt;
 No regular final examination&lt;br /&gt;
 Areas Sc&lt;br /&gt;
 Prerequisites: MB&amp;amp;B 301b and MATH 115a or b, or permission of instructor.&lt;br /&gt;
 MCDB 120a or 200b is a prerequisite for courses numbered MCDB 202 and above.&lt;br /&gt;
&lt;br /&gt;
===Different headings for this class===&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B452/MCDB452&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, written problem sets, and a final written project.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B752/MCDB752&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, written problem sets, and a final, graduate level written project.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;CB&amp;amp;B752/CPSC752&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, programming assignments, and a final programming project.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For &#039;&#039;graduate students&#039;&#039; the course is broken up into two &amp;quot;modules&amp;quot; (each counting 0.5 credit towards MB&amp;amp;B course requirement):&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B 753b3&#039;&#039;&#039;, Bioinformatics: Practical Application of Data Mining (1st half of term)&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B 754b4&#039;&#039;&#039;, Bioinformatics: Practical Application of Simulation (2nd half of term)&lt;br /&gt;
&lt;br /&gt;
Each module consists of lectures, written problem sets, and a final, graduate level written project that is half the length of the full course&#039;s final project.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the grade weighting schemes of each course version, see Class Requirements section.&lt;br /&gt;
&lt;br /&gt;
===Prerequisites===&lt;br /&gt;
The course is keyed towards CBB graduate students as well as advanced MB&amp;amp;B undergraduates and graduate students wishing to learn about types of large-scale quantitative analyses that whole-genome sequencing will make possible. It would also be suitable for students from other fields such as computer science or physics wanting to learn about an important new biological application for computation.&lt;br /&gt;
&lt;br /&gt;
Students should have:&lt;br /&gt;
&lt;br /&gt;
A basic knowledge of biochemistry and molecular biology.&lt;br /&gt;
A knowledge of basic quantitative concepts, such as single variable calculus, some probability and statistics, and basic programming skills.&lt;br /&gt;
These can be fulfilled by the following prerequisites statement: &amp;quot;Prerequisites: MBB 200 and Mathematics 115 or permission of the instructor.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
==Timing &amp;amp; location==&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Class:&#039;&#039;&#039; Meeting from 1:00-2:15 pm on Monday and Wednesday, in Bass 305. (First meeting will be on 9 Jan.)&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Discussion section:&#039;&#039;&#039; Doodle poll for students to indicate good times for the weekly discussion section&lt;br /&gt;
&lt;br /&gt;
==Instructors==&lt;br /&gt;
&lt;br /&gt;
===Instructor-in-Charge===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Mark Gerstein&lt;br /&gt;
| Bass 432A&lt;br /&gt;
| mark.gerstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Instructors===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Corey O&#039;Hern&lt;br /&gt;
| Mason Laboratory&lt;br /&gt;
| corey.ohern(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jesse Rinehart&lt;br /&gt;
| 300 George St&lt;br /&gt;
| jesse.rinehart(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| James Noonan&lt;br /&gt;
| 333 Cedar St&lt;br /&gt;
| james.noonan(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Kei Cheung&lt;br /&gt;
| 300 George St&lt;br /&gt;
| kei.cheung(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Steven Kleinstein&lt;br /&gt;
| 300 George St&lt;br /&gt;
| steven.kleinstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Teaching Fellows===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Lucas Lochovsky&lt;br /&gt;
| Bass 437&lt;br /&gt;
| lucas.lochovsky(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jane Leng&lt;br /&gt;
| Bass 437&lt;br /&gt;
| jing.leng(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Topics/Class Schedule==&lt;br /&gt;
&lt;br /&gt;
[https://docs.google.com/spreadsheet/ccc?key=0Ar2M8lIkl9T5dDN4d1Q0aDBGcThXdjV2c21ESWVxSGc Class Schedule] (including a list of topics and quiz dates)&lt;br /&gt;
&lt;br /&gt;
==Discussion Sections==&lt;br /&gt;
&lt;br /&gt;
===Session 1===&lt;br /&gt;
Metzker ML. &amp;quot;Sequencing technologies - the next generation” Nature Reviews Genetics. 11 (2010) [http://www.gersteinlab.org/courses/452/10-spring/pdf/ngs.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Wheeler DA et al. &amp;quot;The complete genome of an individual by massively parallel DNA sequencing,” Nature. 452:872-876 (208) [http://www.gersteinlab.org/courses/452/10-spring/pdf/WatsonGenome.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 2===&lt;br /&gt;
Olsen JV, Blagoev B, Gnad F, Macek B, Kumar C, Mortensen P, Mann M. (2006) Global, in vivo, and site-specific phosphorylation dynamics in signaling networks.Cell. 2006 Nov 3;127(3):635-48. [http://www.pil.sdu.dk/1/MSQuant/Cell_GlobalPhosphorylationSignalingDynamics.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Nevan J. Krogan et al (2006) Global landscape of protein complexes in the yeast Saccharomyces cerevisiae Nature 440, 637-643 (30 March 2006) [http://www.nature.com/nature/journal/v440/n7084/pdf/nature04670.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 3===&lt;br /&gt;
T.F. Smith and M.S. Waterman. (1981) Identification of common molecular subsequences. Journal of Molecular Biology,147(1): 195-7. PMID: 7265238. [http://www.gersteinlab.org/courses/452/10-spring/pdf/sw.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. (1990) Basic local alignment search tool. Journal of Molecular Biology, 215(3):403-10. PMID: 2231712. [http://www.gersteinlab.org/courses/452/10-spring/pdf/Altschul.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 4===&lt;br /&gt;
Bailey TL, Williams N, Misleh C, Li WW. (2006) MEME: discovering and analyzing DNA and protein sequence motifs, Nucl Acids Res.34:W369-373 [http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1538909/?tool=pubmed PDF]&lt;br /&gt;
&lt;br /&gt;
Garnier J, Gibrat JF, Robson B. (1996) GOR method for predicting protein secondary structure from amino acid sequence.Methods in Enzymology,266: 540-53. PMID: 8743705. [http://www.gersteinlab.org/courses/452/10-spring/pdf/GOR.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 5===&lt;br /&gt;
Laura J. van &#039;t Veer et al. Gene expression profiling predicts clinical outcome of breast cancer Nature 415, 530-536 (31 January 2002) | doi:10.1038/415530a; Received 24 August 2001; Accepted 22 November 2001 [http://www.nature.com/nature/journal/v415/n6871/full/415530a.html TEXT]&lt;br /&gt;
&lt;br /&gt;
Kwang-Il Goh, Michael E. Cusick, David Vall, Barton Child, Marc Vidal, and Albert-La ́szlo ́ Barabasi (2007) The human disease network Proc Natl Acad Sci U S A. 2007 May 22;104(21):8685-90. Epub 2007 May 14. [http://www.pnas.org/content/104/21/8685.full.pdf+html PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 6===&lt;br /&gt;
Antezana E, Egaña M, Blondé W, Illarramendi A, Bilbao I, De Baets B, Stevens R, Mironov V, Kuiper M. (2009) The Cell Cycle Ontology: an application ontology for the representation and integrated analysis of the cell cycle process. Genome Biol. 2009;10(5):R58. Epub 2009 May 29. [http://genomebiology.com/content/10/5/R58 PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 7===&lt;br /&gt;
Perelson AS. Modelling viral and immune system dynamics. Nat Rev Immunol. 2002 Jan;2(1):28-36. [http://www.gersteinlab.org/courses/452/10-spring/pdf/perelson.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 8===&lt;br /&gt;
ML Connolly. (1983) Solvent-accessible surfaces of proteins and nucleic acids. Science, 221(4612): 709-13. PMID: 6879170.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Connolly.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Martin Karplus and J. Andrew McCammon. (2002) Molecular dynamics simulations of biomolecules. Nature Structural Biology,9, 646-52. PMID: 12198485.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Karplus.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 9===&lt;br /&gt;
Dill KA, Ozkan SB, Shell MS, Weikl TR. (2008) The Protein Folding Problem.Annu Rev Biophys,9, 37:289-316. PMID: 2443096.[http://www.gersteinlab.org/courses/452/10-spring/pdf/proteinFolding.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Bowman GR, Beauchamp KA, Boxer G, Pande VS. “Progress and challenges in the automated construction of Markov state models for full protein systems,” J. Chem. Phys. 131 (2009) 124101 [http://www.gersteinlab.org/courses/452/10-spring/pdf/bowman.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
==Class Requirements==&lt;br /&gt;
&lt;br /&gt;
===Discussion Section / Readings===&lt;br /&gt;
Papers will be assigned throughout the course. These papers will be presented and discussed in weekly 60-minute sections with the TFs. A brief summary (a half-page per article) should be submitted at the beginning of the discussion session.&lt;br /&gt;
&lt;br /&gt;
===Bioinformatics quizzes===&lt;br /&gt;
There will be four short quizzes (25 minutes) in class comprising SIMPLE questions that you should be able to answer from the lectures plus the main readings.&lt;br /&gt;
&lt;br /&gt;
===Programming Assignments (CBB and CS)===&lt;br /&gt;
There will be several short programming assignments required for CBB and CS students taking this course. Acceptable languages and submission requirements will be discussed prior to the first assignment. These assignments are NOT required for students not taking the CBB or CS sections of the course.&lt;br /&gt;
&lt;br /&gt;
===Non-CBB Final Project===&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
===CBB Final Project===&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
===Grade Categories===&lt;br /&gt;
&lt;br /&gt;
====CBB and CPSC Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 8.50%&lt;br /&gt;
|-&lt;br /&gt;
| Programming Assignments&lt;br /&gt;
| 25.00%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
====MBB and MCDB Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 16.75%&lt;br /&gt;
|-&lt;br /&gt;
| Problem Sets&lt;br /&gt;
| 16.75%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Relevant Yale College Regulations===&lt;br /&gt;
&lt;br /&gt;
Students may have questions concerning end-of-term matters.  Links to further information about these regulations can be found below:&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/reading-period-and-final-examination-period&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/completion-course-work&lt;br /&gt;
&lt;br /&gt;
==Misc==&lt;br /&gt;
[http://info.gersteinlab.org/Permissions Permissions] on using website material&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/mark/out/log/2011/12.29/talk/intro-graphic.ppt Graphic] for course homepage&lt;br /&gt;
&lt;br /&gt;
If you&#039;re really motivated, take a look at http://gersteinlab.org/jobs for further Research Opportunities&lt;br /&gt;
&lt;br /&gt;
==Pages from previous years==&lt;br /&gt;
[http://info.gersteinlab.org/Cbb752b11 2011],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/ 2010],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/previous.html 2009 and earlier]&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1821</id>
		<title>Cbb752b12</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1821"/>
		<updated>2011-12-30T01:16:31Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: /* Different headings for this class */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=CBB 752=&lt;br /&gt;
&lt;br /&gt;
==Course Information==&lt;br /&gt;
&lt;br /&gt;
===Course Description===&lt;br /&gt;
&lt;br /&gt;
Bioinformatics encompasses the analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. It represents a major practical application for modern techniques in data mining and simulation. Specific topics to be covered include sequence alignment, large-scale processing, next-generation sequencing data, comparative genomics, phylogenetics, biological database design, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, normalization of microarray data, mining of functional genomics data sets, and machine learning approaches for data integration.&lt;br /&gt;
&lt;br /&gt;
===Concise undergraduate course description===&lt;br /&gt;
&lt;br /&gt;
Techniques in data mining and simulation applied to bioinformatics, the computational analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. Sequence alignment, comparative genomics and phylogenetics, biological databases, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, microarray normalization, and machine-learning approaches to data integration.&lt;br /&gt;
&lt;br /&gt;
See entry from undergraduate catalog:&lt;br /&gt;
http://students.yale.edu/oci/resultDetail.jsp?course=21914&amp;amp;term=201201 , viz:&lt;br /&gt;
&lt;br /&gt;
 MB&amp;amp;B 452 01 (21914) /MCDB452/MB&amp;amp;B752/MB&amp;amp;B753/MB&amp;amp;B754/CB&amp;amp;B752/MCDB752/CPSC752&lt;br /&gt;
 Bioinformatics: Practical Application of Simulation and Data Mining &lt;br /&gt;
 Mark Gerstein&lt;br /&gt;
 MW 1.00-2.15 BASS 305&lt;br /&gt;
 Spring 2012 &lt;br /&gt;
 No regular final examination&lt;br /&gt;
 Areas Sc&lt;br /&gt;
 Prerequisites: MB&amp;amp;B 301b and MATH 115a or b, or permission of instructor.&lt;br /&gt;
 MCDB 120a or 200b is a prerequisite for courses numbered MCDB 202 and above.&lt;br /&gt;
&lt;br /&gt;
===Different headings for this class===&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B452/MCDB452&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, written problem sets, and a final written project.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B752/MCDB752&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, written problem sets, and a final, graduate level written project.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;CB&amp;amp;B752/CPSC752&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, programming assignments, and a final programming project.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For &#039;&#039;graduate students&#039;&#039; the course is broken up into two &amp;quot;modules&amp;quot; (each counting 0.5 credit towards MB&amp;amp;B course requirement):&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B 753b3&#039;&#039;&#039;, Bioinformatics: Practical Application of Data Mining (1st half of term)&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B 754b4&#039;&#039;&#039;, Bioinformatics: Practical Application of Simulation (2nd half of term)&lt;br /&gt;
&lt;br /&gt;
Each module consists of lectures, written problem sets, and a final, graduate level written project that is half the length of the full course&#039;s final project.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the grade weighting schemes of each course version, see Class Requirements (below).&lt;br /&gt;
&lt;br /&gt;
==Timing &amp;amp; location==&lt;br /&gt;
&lt;br /&gt;
Class: Meeting from 1:00-2:15 pm on Monday and Wednesday, in Bass 305. (First meeting will be on 9 Jan.)&lt;br /&gt;
&lt;br /&gt;
Discussion section: TBA&lt;br /&gt;
&lt;br /&gt;
==Instructors==&lt;br /&gt;
&lt;br /&gt;
===Instructor-in-Charge===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Mark Gerstein&lt;br /&gt;
| Bass 432A&lt;br /&gt;
| mark.gerstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Instructors===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Corey O&#039;Hern&lt;br /&gt;
| Mason Laboratory&lt;br /&gt;
| corey.ohern(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jesse Rinehart&lt;br /&gt;
| 300 George St&lt;br /&gt;
| jesse.rinehart(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| James Noonan&lt;br /&gt;
| 333 Cedar St&lt;br /&gt;
| james.noonan(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Kei Cheung&lt;br /&gt;
| 300 George St&lt;br /&gt;
| kei.cheung(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Steven Kleinstein&lt;br /&gt;
| 300 George St&lt;br /&gt;
| steven.kleinstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Teaching Fellows===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Lucas Lochovsky&lt;br /&gt;
| Bass 437&lt;br /&gt;
| lucas.lochovsky(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jane Leng&lt;br /&gt;
| Bass 437&lt;br /&gt;
| jing.leng(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Topics/Class Schedule==&lt;br /&gt;
&lt;br /&gt;
[https://docs.google.com/spreadsheet/ccc?key=0Ar2M8lIkl9T5dDN4d1Q0aDBGcThXdjV2c21ESWVxSGc Class Schedule] (including a list of topics and quiz dates)&lt;br /&gt;
&lt;br /&gt;
==Discussion Sections==&lt;br /&gt;
&lt;br /&gt;
===Session 1===&lt;br /&gt;
Metzker ML. &amp;quot;Sequencing technologies - the next generation” Nature Reviews Genetics. 11 (2010) [http://www.gersteinlab.org/courses/452/10-spring/pdf/ngs.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Wheeler DA et al. &amp;quot;The complete genome of an individual by massively parallel DNA sequencing,” Nature. 452:872-876 (208) [http://www.gersteinlab.org/courses/452/10-spring/pdf/WatsonGenome.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 2===&lt;br /&gt;
Olsen JV, Blagoev B, Gnad F, Macek B, Kumar C, Mortensen P, Mann M. (2006) Global, in vivo, and site-specific phosphorylation dynamics in signaling networks.Cell. 2006 Nov 3;127(3):635-48. [http://www.pil.sdu.dk/1/MSQuant/Cell_GlobalPhosphorylationSignalingDynamics.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Nevan J. Krogan et al (2006) Global landscape of protein complexes in the yeast Saccharomyces cerevisiae Nature 440, 637-643 (30 March 2006) [http://www.nature.com/nature/journal/v440/n7084/pdf/nature04670.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 3===&lt;br /&gt;
T.F. Smith and M.S. Waterman. (1981) Identification of common molecular subsequences. Journal of Molecular Biology,147(1): 195-7. PMID: 7265238. [http://www.gersteinlab.org/courses/452/10-spring/pdf/sw.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. (1990) Basic local alignment search tool. Journal of Molecular Biology, 215(3):403-10. PMID: 2231712. [http://www.gersteinlab.org/courses/452/10-spring/pdf/Altschul.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 4===&lt;br /&gt;
Bailey TL, Williams N, Misleh C, Li WW. (2006) MEME: discovering and analyzing DNA and protein sequence motifs, Nucl Acids Res.34:W369-373 [http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1538909/?tool=pubmed PDF]&lt;br /&gt;
&lt;br /&gt;
Garnier J, Gibrat JF, Robson B. (1996) GOR method for predicting protein secondary structure from amino acid sequence.Methods in Enzymology,266: 540-53. PMID: 8743705. [http://www.gersteinlab.org/courses/452/10-spring/pdf/GOR.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 5===&lt;br /&gt;
Laura J. van &#039;t Veer et al. Gene expression profiling predicts clinical outcome of breast cancer Nature 415, 530-536 (31 January 2002) | doi:10.1038/415530a; Received 24 August 2001; Accepted 22 November 2001 [http://www.nature.com/nature/journal/v415/n6871/full/415530a.html TEXT]&lt;br /&gt;
&lt;br /&gt;
Kwang-Il Goh, Michael E. Cusick, David Vall, Barton Child, Marc Vidal, and Albert-La ́szlo ́ Barabasi (2007) The human disease network Proc Natl Acad Sci U S A. 2007 May 22;104(21):8685-90. Epub 2007 May 14. [http://www.pnas.org/content/104/21/8685.full.pdf+html PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 6===&lt;br /&gt;
Antezana E, Egaña M, Blondé W, Illarramendi A, Bilbao I, De Baets B, Stevens R, Mironov V, Kuiper M. (2009) The Cell Cycle Ontology: an application ontology for the representation and integrated analysis of the cell cycle process. Genome Biol. 2009;10(5):R58. Epub 2009 May 29. [http://genomebiology.com/content/10/5/R58 PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 7===&lt;br /&gt;
Perelson AS. Modelling viral and immune system dynamics. Nat Rev Immunol. 2002 Jan;2(1):28-36. [http://www.gersteinlab.org/courses/452/10-spring/pdf/perelson.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 8===&lt;br /&gt;
ML Connolly. (1983) Solvent-accessible surfaces of proteins and nucleic acids. Science, 221(4612): 709-13. PMID: 6879170.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Connolly.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Martin Karplus and J. Andrew McCammon. (2002) Molecular dynamics simulations of biomolecules. Nature Structural Biology,9, 646-52. PMID: 12198485.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Karplus.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 9===&lt;br /&gt;
Dill KA, Ozkan SB, Shell MS, Weikl TR. (2008) The Protein Folding Problem.Annu Rev Biophys,9, 37:289-316. PMID: 2443096.[http://www.gersteinlab.org/courses/452/10-spring/pdf/proteinFolding.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Bowman GR, Beauchamp KA, Boxer G, Pande VS. “Progress and challenges in the automated construction of Markov state models for full protein systems,” J. Chem. Phys. 131 (2009) 124101 [http://www.gersteinlab.org/courses/452/10-spring/pdf/bowman.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
==Class Requirements==&lt;br /&gt;
&lt;br /&gt;
===Discussion Section / Readings===&lt;br /&gt;
Papers will be assigned throughout the course. These papers will be presented and discussed in weekly 60-minute sections with the TFs. A brief summary (a half-page per article) should be submitted at the beginning of the discussion session.&lt;br /&gt;
&lt;br /&gt;
===Bioinformatics quizzes===&lt;br /&gt;
There will be four short quizzes (25 minutes) in class comprising SIMPLE questions that you should be able to answer from the lectures plus the main readings.&lt;br /&gt;
&lt;br /&gt;
===Programming Assignments (CBB and CS)===&lt;br /&gt;
There will be several short programming assignments required for CBB and CS students taking this course. Acceptable languages and submission requirements will be discussed prior to the first assignment. These assignments are NOT required for students not taking the CBB or CS sections of the course.&lt;br /&gt;
&lt;br /&gt;
===Non-CBB Final Project===&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
===CBB Final Project===&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
===Grade Categories===&lt;br /&gt;
&lt;br /&gt;
====CBB and CPSC Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 8.50%&lt;br /&gt;
|-&lt;br /&gt;
| Programming Assignments&lt;br /&gt;
| 25.00%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
====MBB and MCDB Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 16.75%&lt;br /&gt;
|-&lt;br /&gt;
| Problem Sets&lt;br /&gt;
| 16.75%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Prerequisites===&lt;br /&gt;
The course is keyed towards CBB graduate students as well as advanced MB&amp;amp;B undergraduates and graduate students wishing to learn about types of large-scale quantitative analyses that whole-genome sequencing will make possible. It would also be suitable for students from other fields such as computer science or physics wanting to learn about an important new biological application for computation.&lt;br /&gt;
&lt;br /&gt;
Students should have:&lt;br /&gt;
&lt;br /&gt;
A basic knowledge of biochemistry and molecular biology.&lt;br /&gt;
A knowledge of basic quantitative concepts, such as single variable calculus, some probability and statistics, and basic programming skills.&lt;br /&gt;
These can be fulfilled by the following prerequisites statement: &amp;quot;Prerequisites: MBB 200 and Mathematics 115 or permission of the instructor.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
===Relevant Yale College Regulations===&lt;br /&gt;
&lt;br /&gt;
Students may have questions concerning end-of-term matters.  Links to further information about these regulations can be found below:&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/reading-period-and-final-examination-period&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/completion-course-work&lt;br /&gt;
&lt;br /&gt;
==Misc==&lt;br /&gt;
[http://info.gersteinlab.org/Permissions Permissions] on using website material&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/mark/out/log/2011/12.29/talk/intro-graphic.ppt Graphic] for course homepage&lt;br /&gt;
&lt;br /&gt;
If you&#039;re really motivated, take a look at http://gersteinlab.org/jobs for further Research Opportunities&lt;br /&gt;
&lt;br /&gt;
==Pages from previous years==&lt;br /&gt;
[http://info.gersteinlab.org/Cbb752b11 2011],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/ 2010],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/previous.html 2009 and earlier]&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1820</id>
		<title>Cbb752b12</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1820"/>
		<updated>2011-12-30T01:15:29Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=CBB 752=&lt;br /&gt;
&lt;br /&gt;
==Course Information==&lt;br /&gt;
&lt;br /&gt;
===Course Description===&lt;br /&gt;
&lt;br /&gt;
Bioinformatics encompasses the analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. It represents a major practical application for modern techniques in data mining and simulation. Specific topics to be covered include sequence alignment, large-scale processing, next-generation sequencing data, comparative genomics, phylogenetics, biological database design, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, normalization of microarray data, mining of functional genomics data sets, and machine learning approaches for data integration.&lt;br /&gt;
&lt;br /&gt;
===Concise undergraduate course description===&lt;br /&gt;
&lt;br /&gt;
Techniques in data mining and simulation applied to bioinformatics, the computational analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. Sequence alignment, comparative genomics and phylogenetics, biological databases, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, microarray normalization, and machine-learning approaches to data integration.&lt;br /&gt;
&lt;br /&gt;
See entry from undergraduate catalog:&lt;br /&gt;
http://students.yale.edu/oci/resultDetail.jsp?course=21914&amp;amp;term=201201 , viz:&lt;br /&gt;
&lt;br /&gt;
 MB&amp;amp;B 452 01 (21914) /MCDB452/MB&amp;amp;B752/MB&amp;amp;B753/MB&amp;amp;B754/CB&amp;amp;B752/MCDB752/CPSC752&lt;br /&gt;
 Bioinformatics: Practical Application of Simulation and Data Mining &lt;br /&gt;
 Mark Gerstein&lt;br /&gt;
 MW 1.00-2.15 BASS 305&lt;br /&gt;
 Spring 2012 &lt;br /&gt;
 No regular final examination&lt;br /&gt;
 Areas Sc&lt;br /&gt;
 Prerequisites: MB&amp;amp;B 301b and MATH 115a or b, or permission of instructor.&lt;br /&gt;
 MCDB 120a or 200b is a prerequisite for courses numbered MCDB 202 and above.&lt;br /&gt;
&lt;br /&gt;
===Different headings for this class===&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B452/MCDB452&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, written problem sets, and a final written project.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B752/MCDB752&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, written problem sets, and a final, graduate level written project.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;CB&amp;amp;B752/CPSC752&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, programming assignments, and a final programming project.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For &#039;&#039;graduate students&#039;&#039; the course is broken up into two &amp;quot;modules&amp;quot; (each counting 0.5 credit towards MB&amp;amp;B course requirement):&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B 753b3&#039;&#039;&#039;, Bioinformatics: Practical Application of Data Mining (1st half of term)&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B 754b4&#039;&#039;&#039;, Bioinformatics: Practical Application of Simulation (2nd half of term)&lt;br /&gt;
&lt;br /&gt;
Each module consists of lectures, written problem sets, and a final, graduate level written project that is half the length of the full course&#039;s final project.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For grade weightings of each course version, see Class Requirements (below).&lt;br /&gt;
&lt;br /&gt;
==Timing &amp;amp; location==&lt;br /&gt;
&lt;br /&gt;
Class: Meeting from 1:00-2:15 pm on Monday and Wednesday, in Bass 305. (First meeting will be on 9 Jan.)&lt;br /&gt;
&lt;br /&gt;
Discussion section: TBA&lt;br /&gt;
&lt;br /&gt;
==Instructors==&lt;br /&gt;
&lt;br /&gt;
===Instructor-in-Charge===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Mark Gerstein&lt;br /&gt;
| Bass 432A&lt;br /&gt;
| mark.gerstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Instructors===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Corey O&#039;Hern&lt;br /&gt;
| Mason Laboratory&lt;br /&gt;
| corey.ohern(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jesse Rinehart&lt;br /&gt;
| 300 George St&lt;br /&gt;
| jesse.rinehart(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| James Noonan&lt;br /&gt;
| 333 Cedar St&lt;br /&gt;
| james.noonan(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Kei Cheung&lt;br /&gt;
| 300 George St&lt;br /&gt;
| kei.cheung(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Steven Kleinstein&lt;br /&gt;
| 300 George St&lt;br /&gt;
| steven.kleinstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Teaching Fellows===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Lucas Lochovsky&lt;br /&gt;
| Bass 437&lt;br /&gt;
| lucas.lochovsky(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jane Leng&lt;br /&gt;
| Bass 437&lt;br /&gt;
| jing.leng(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Topics/Class Schedule==&lt;br /&gt;
&lt;br /&gt;
[https://docs.google.com/spreadsheet/ccc?key=0Ar2M8lIkl9T5dDN4d1Q0aDBGcThXdjV2c21ESWVxSGc Class Schedule] (including a list of topics and quiz dates)&lt;br /&gt;
&lt;br /&gt;
==Discussion Sections==&lt;br /&gt;
&lt;br /&gt;
===Session 1===&lt;br /&gt;
Metzker ML. &amp;quot;Sequencing technologies - the next generation” Nature Reviews Genetics. 11 (2010) [http://www.gersteinlab.org/courses/452/10-spring/pdf/ngs.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Wheeler DA et al. &amp;quot;The complete genome of an individual by massively parallel DNA sequencing,” Nature. 452:872-876 (208) [http://www.gersteinlab.org/courses/452/10-spring/pdf/WatsonGenome.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 2===&lt;br /&gt;
Olsen JV, Blagoev B, Gnad F, Macek B, Kumar C, Mortensen P, Mann M. (2006) Global, in vivo, and site-specific phosphorylation dynamics in signaling networks.Cell. 2006 Nov 3;127(3):635-48. [http://www.pil.sdu.dk/1/MSQuant/Cell_GlobalPhosphorylationSignalingDynamics.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Nevan J. Krogan et al (2006) Global landscape of protein complexes in the yeast Saccharomyces cerevisiae Nature 440, 637-643 (30 March 2006) [http://www.nature.com/nature/journal/v440/n7084/pdf/nature04670.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 3===&lt;br /&gt;
T.F. Smith and M.S. Waterman. (1981) Identification of common molecular subsequences. Journal of Molecular Biology,147(1): 195-7. PMID: 7265238. [http://www.gersteinlab.org/courses/452/10-spring/pdf/sw.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. (1990) Basic local alignment search tool. Journal of Molecular Biology, 215(3):403-10. PMID: 2231712. [http://www.gersteinlab.org/courses/452/10-spring/pdf/Altschul.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 4===&lt;br /&gt;
Bailey TL, Williams N, Misleh C, Li WW. (2006) MEME: discovering and analyzing DNA and protein sequence motifs, Nucl Acids Res.34:W369-373 [http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1538909/?tool=pubmed PDF]&lt;br /&gt;
&lt;br /&gt;
Garnier J, Gibrat JF, Robson B. (1996) GOR method for predicting protein secondary structure from amino acid sequence.Methods in Enzymology,266: 540-53. PMID: 8743705. [http://www.gersteinlab.org/courses/452/10-spring/pdf/GOR.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 5===&lt;br /&gt;
Laura J. van &#039;t Veer et al. Gene expression profiling predicts clinical outcome of breast cancer Nature 415, 530-536 (31 January 2002) | doi:10.1038/415530a; Received 24 August 2001; Accepted 22 November 2001 [http://www.nature.com/nature/journal/v415/n6871/full/415530a.html TEXT]&lt;br /&gt;
&lt;br /&gt;
Kwang-Il Goh, Michael E. Cusick, David Vall, Barton Child, Marc Vidal, and Albert-La ́szlo ́ Barabasi (2007) The human disease network Proc Natl Acad Sci U S A. 2007 May 22;104(21):8685-90. Epub 2007 May 14. [http://www.pnas.org/content/104/21/8685.full.pdf+html PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 6===&lt;br /&gt;
Antezana E, Egaña M, Blondé W, Illarramendi A, Bilbao I, De Baets B, Stevens R, Mironov V, Kuiper M. (2009) The Cell Cycle Ontology: an application ontology for the representation and integrated analysis of the cell cycle process. Genome Biol. 2009;10(5):R58. Epub 2009 May 29. [http://genomebiology.com/content/10/5/R58 PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 7===&lt;br /&gt;
Perelson AS. Modelling viral and immune system dynamics. Nat Rev Immunol. 2002 Jan;2(1):28-36. [http://www.gersteinlab.org/courses/452/10-spring/pdf/perelson.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 8===&lt;br /&gt;
ML Connolly. (1983) Solvent-accessible surfaces of proteins and nucleic acids. Science, 221(4612): 709-13. PMID: 6879170.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Connolly.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Martin Karplus and J. Andrew McCammon. (2002) Molecular dynamics simulations of biomolecules. Nature Structural Biology,9, 646-52. PMID: 12198485.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Karplus.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 9===&lt;br /&gt;
Dill KA, Ozkan SB, Shell MS, Weikl TR. (2008) The Protein Folding Problem.Annu Rev Biophys,9, 37:289-316. PMID: 2443096.[http://www.gersteinlab.org/courses/452/10-spring/pdf/proteinFolding.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Bowman GR, Beauchamp KA, Boxer G, Pande VS. “Progress and challenges in the automated construction of Markov state models for full protein systems,” J. Chem. Phys. 131 (2009) 124101 [http://www.gersteinlab.org/courses/452/10-spring/pdf/bowman.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
==Class Requirements==&lt;br /&gt;
&lt;br /&gt;
===Discussion Section / Readings===&lt;br /&gt;
Papers will be assigned throughout the course. These papers will be presented and discussed in weekly 60-minute sections with the TFs. A brief summary (a half-page per article) should be submitted at the beginning of the discussion session.&lt;br /&gt;
&lt;br /&gt;
===Bioinformatics quizzes===&lt;br /&gt;
There will be four short quizzes (25 minutes) in class comprising SIMPLE questions that you should be able to answer from the lectures plus the main readings.&lt;br /&gt;
&lt;br /&gt;
===Programming Assignments (CBB and CS)===&lt;br /&gt;
There will be several short programming assignments required for CBB and CS students taking this course. Acceptable languages and submission requirements will be discussed prior to the first assignment. These assignments are NOT required for students not taking the CBB or CS sections of the course.&lt;br /&gt;
&lt;br /&gt;
===Non-CBB Final Project===&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
===CBB Final Project===&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
===Grade Categories===&lt;br /&gt;
&lt;br /&gt;
====CBB and CPSC Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 8.50%&lt;br /&gt;
|-&lt;br /&gt;
| Programming Assignments&lt;br /&gt;
| 25.00%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
====MBB and MCDB Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 16.75%&lt;br /&gt;
|-&lt;br /&gt;
| Problem Sets&lt;br /&gt;
| 16.75%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Prerequisites===&lt;br /&gt;
The course is keyed towards CBB graduate students as well as advanced MB&amp;amp;B undergraduates and graduate students wishing to learn about types of large-scale quantitative analyses that whole-genome sequencing will make possible. It would also be suitable for students from other fields such as computer science or physics wanting to learn about an important new biological application for computation.&lt;br /&gt;
&lt;br /&gt;
Students should have:&lt;br /&gt;
&lt;br /&gt;
A basic knowledge of biochemistry and molecular biology.&lt;br /&gt;
A knowledge of basic quantitative concepts, such as single variable calculus, some probability and statistics, and basic programming skills.&lt;br /&gt;
These can be fulfilled by the following prerequisites statement: &amp;quot;Prerequisites: MBB 200 and Mathematics 115 or permission of the instructor.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
===Relevant Yale College Regulations===&lt;br /&gt;
&lt;br /&gt;
Students may have questions concerning end-of-term matters.  Links to further information about these regulations can be found below:&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/reading-period-and-final-examination-period&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/completion-course-work&lt;br /&gt;
&lt;br /&gt;
==Misc==&lt;br /&gt;
[http://info.gersteinlab.org/Permissions Permissions] on using website material&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/mark/out/log/2011/12.29/talk/intro-graphic.ppt Graphic] for course homepage&lt;br /&gt;
&lt;br /&gt;
If you&#039;re really motivated, take a look at http://gersteinlab.org/jobs for further Research Opportunities&lt;br /&gt;
&lt;br /&gt;
==Pages from previous years==&lt;br /&gt;
[http://info.gersteinlab.org/Cbb752b11 2011],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/ 2010],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/previous.html 2009 and earlier]&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1819</id>
		<title>Cbb752b12</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1819"/>
		<updated>2011-12-30T01:13:22Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: /* Discussion Section / Readings */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=CBB 752=&lt;br /&gt;
&lt;br /&gt;
==Course Information==&lt;br /&gt;
&lt;br /&gt;
===Course Description===&lt;br /&gt;
&lt;br /&gt;
Bioinformatics encompasses the analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. It represents a major practical application for modern techniques in data mining and simulation. Specific topics to be covered include sequence alignment, large-scale processing, next-generation sequencing data, comparative genomics, phylogenetics, biological database design, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, normalization of microarray data, mining of functional genomics data sets, and machine learning approaches for data integration.&lt;br /&gt;
&lt;br /&gt;
===Concise undergraduate course description===&lt;br /&gt;
&lt;br /&gt;
Techniques in data mining and simulation applied to bioinformatics, the computational analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. Sequence alignment, comparative genomics and phylogenetics, biological databases, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, microarray normalization, and machine-learning approaches to data integration.&lt;br /&gt;
&lt;br /&gt;
See entry from undergraduate catalog:&lt;br /&gt;
http://students.yale.edu/oci/resultDetail.jsp?course=21914&amp;amp;term=201201 , viz:&lt;br /&gt;
&lt;br /&gt;
 MB&amp;amp;B 452 01 (21914) /MCDB452/MB&amp;amp;B752/MB&amp;amp;B753/MB&amp;amp;B754/CB&amp;amp;B752/MCDB752/CPSC752&lt;br /&gt;
 Bioinformatics: Practical Application of Simulation and Data Mining &lt;br /&gt;
 Mark Gerstein&lt;br /&gt;
 MW 1.00-2.15 BASS 305&lt;br /&gt;
 Spring 2012 &lt;br /&gt;
 No regular final examination&lt;br /&gt;
 Areas Sc&lt;br /&gt;
 Prerequisites: MB&amp;amp;B 301b and MATH 115a or b, or permission of instructor.&lt;br /&gt;
 MCDB 120a or 200b is a prerequisite for courses numbered MCDB 202 and above.&lt;br /&gt;
&lt;br /&gt;
===Different headings for this class===&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B452/MCDB452&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, written problem sets, and a final written project.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B752/MCDB752&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, written problem sets, and a final, graduate level written project.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;CB&amp;amp;B752/CPSC752&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, programming assignments, and a final programming project.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For &#039;&#039;graduate students&#039;&#039; the course is broken up into two &amp;quot;modules&amp;quot; (each counting 0.5 credit towards MB&amp;amp;B course requirement):&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B 753b3&#039;&#039;&#039;, Bioinformatics: Practical Application of Data Mining (1st half of term)&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B 754b4&#039;&#039;&#039;, Bioinformatics: Practical Application of Simulation (2nd half of term)&lt;br /&gt;
&lt;br /&gt;
Each module consists of lectures, written problem sets, and a final, graduate level written project that is half the length of the full course&#039;s final project.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For grade weightings of each course version, see Class Requirements (below).&lt;br /&gt;
&lt;br /&gt;
==Timing &amp;amp; location==&lt;br /&gt;
&lt;br /&gt;
Class: Meeting from 1:00-2:15 pm on Monday and Wednesday, in Bass 305. (First meeting will be on 9 Jan.)&lt;br /&gt;
&lt;br /&gt;
Discussion section: TBA&lt;br /&gt;
&lt;br /&gt;
==Instructors==&lt;br /&gt;
&lt;br /&gt;
===Instructor-in-Charge===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Mark Gerstein&lt;br /&gt;
| Bass 432A&lt;br /&gt;
| mark.gerstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Instructors===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Corey O&#039;Hern&lt;br /&gt;
| Mason Laboratory&lt;br /&gt;
| corey.ohern(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jesse Rinehart&lt;br /&gt;
| 300 George St&lt;br /&gt;
| jesse.rinehart(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| James Noonan&lt;br /&gt;
| 333 Cedar St&lt;br /&gt;
| james.noonan(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Kei Cheung&lt;br /&gt;
| 300 George St&lt;br /&gt;
| kei.cheung(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Steven Kleinstein&lt;br /&gt;
| 300 George St&lt;br /&gt;
| steven.kleinstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Teaching Fellows===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Lucas Lochovsky&lt;br /&gt;
| Bass 437&lt;br /&gt;
| lucas.lochovsky(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jane Leng&lt;br /&gt;
| Bass 437&lt;br /&gt;
| jing.leng(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Topics/Class Schedule==&lt;br /&gt;
&lt;br /&gt;
[https://docs.google.com/spreadsheet/ccc?key=0Ar2M8lIkl9T5dDN4d1Q0aDBGcThXdjV2c21ESWVxSGc Class Schedule] (including a list of topics and quiz dates)&lt;br /&gt;
&lt;br /&gt;
==Discussion Sections==&lt;br /&gt;
&lt;br /&gt;
===Session 1===&lt;br /&gt;
Metzker ML. &amp;quot;Sequencing technologies - the next generation” Nature Reviews Genetics. 11 (2010) [http://www.gersteinlab.org/courses/452/10-spring/pdf/ngs.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Wheeler DA et al. &amp;quot;The complete genome of an individual by massively parallel DNA sequencing,” Nature. 452:872-876 (208) [http://www.gersteinlab.org/courses/452/10-spring/pdf/WatsonGenome.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 2===&lt;br /&gt;
Olsen JV, Blagoev B, Gnad F, Macek B, Kumar C, Mortensen P, Mann M. (2006) Global, in vivo, and site-specific phosphorylation dynamics in signaling networks.Cell. 2006 Nov 3;127(3):635-48. [http://www.pil.sdu.dk/1/MSQuant/Cell_GlobalPhosphorylationSignalingDynamics.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Nevan J. Krogan et al (2006) Global landscape of protein complexes in the yeast Saccharomyces cerevisiae Nature 440, 637-643 (30 March 2006) [http://www.nature.com/nature/journal/v440/n7084/pdf/nature04670.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 3===&lt;br /&gt;
T.F. Smith and M.S. Waterman. (1981) Identification of common molecular subsequences. Journal of Molecular Biology,147(1): 195-7. PMID: 7265238. [http://www.gersteinlab.org/courses/452/10-spring/pdf/sw.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. (1990) Basic local alignment search tool. Journal of Molecular Biology, 215(3):403-10. PMID: 2231712. [http://www.gersteinlab.org/courses/452/10-spring/pdf/Altschul.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 4===&lt;br /&gt;
Bailey TL, Williams N, Misleh C, Li WW. (2006) MEME: discovering and analyzing DNA and protein sequence motifs, Nucl Acids Res.34:W369-373 [http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1538909/?tool=pubmed PDF]&lt;br /&gt;
&lt;br /&gt;
Garnier J, Gibrat JF, Robson B. (1996) GOR method for predicting protein secondary structure from amino acid sequence.Methods in Enzymology,266: 540-53. PMID: 8743705. [http://www.gersteinlab.org/courses/452/10-spring/pdf/GOR.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 5===&lt;br /&gt;
Laura J. van &#039;t Veer et al. Gene expression profiling predicts clinical outcome of breast cancer Nature 415, 530-536 (31 January 2002) | doi:10.1038/415530a; Received 24 August 2001; Accepted 22 November 2001 [http://www.nature.com/nature/journal/v415/n6871/full/415530a.html TEXT]&lt;br /&gt;
&lt;br /&gt;
Kwang-Il Goh, Michael E. Cusick, David Vall, Barton Child, Marc Vidal, and Albert-La ́szlo ́ Barabasi (2007) The human disease network Proc Natl Acad Sci U S A. 2007 May 22;104(21):8685-90. Epub 2007 May 14. [http://www.pnas.org/content/104/21/8685.full.pdf+html PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 6===&lt;br /&gt;
Antezana E, Egaña M, Blondé W, Illarramendi A, Bilbao I, De Baets B, Stevens R, Mironov V, Kuiper M. (2009) The Cell Cycle Ontology: an application ontology for the representation and integrated analysis of the cell cycle process. Genome Biol. 2009;10(5):R58. Epub 2009 May 29. [http://genomebiology.com/content/10/5/R58 PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 7===&lt;br /&gt;
Perelson AS. Modelling viral and immune system dynamics. Nat Rev Immunol. 2002 Jan;2(1):28-36. [http://www.gersteinlab.org/courses/452/10-spring/pdf/perelson.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 8===&lt;br /&gt;
ML Connolly. (1983) Solvent-accessible surfaces of proteins and nucleic acids. Science, 221(4612): 709-13. PMID: 6879170.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Connolly.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Martin Karplus and J. Andrew McCammon. (2002) Molecular dynamics simulations of biomolecules. Nature Structural Biology,9, 646-52. PMID: 12198485.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Karplus.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 9===&lt;br /&gt;
Dill KA, Ozkan SB, Shell MS, Weikl TR. (2008) The Protein Folding Problem.Annu Rev Biophys,9, 37:289-316. PMID: 2443096.[http://www.gersteinlab.org/courses/452/10-spring/pdf/proteinFolding.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Bowman GR, Beauchamp KA, Boxer G, Pande VS. “Progress and challenges in the automated construction of Markov state models for full protein systems,” J. Chem. Phys. 131 (2009) 124101 [http://www.gersteinlab.org/courses/452/10-spring/pdf/bowman.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
==Class Requirements==&lt;br /&gt;
&lt;br /&gt;
===Discussion Section / Readings===&lt;br /&gt;
Papers will be assigned throughout the course. These papers will be presented and discussed in weekly 60-minute sections with the TFs. A brief summary (a half-page per article) should be submitted at the beginning of the discussion session.&lt;br /&gt;
&lt;br /&gt;
===Bioinformatics quizzes===&lt;br /&gt;
There will be four short quizzes (25 minutes) in class comprising SIMPLE questions that you should be able to answer from the lectures plus the main readings.&lt;br /&gt;
&lt;br /&gt;
===Programming Assignments (CBB and CS)===&lt;br /&gt;
There will be several short programming assignments required for CBB and CS students taking this course. Acceptable languages and submission requirements will be discussed prior to the first assignment. These assignments are NOT required for students not taking the CBB or CS sections of the course.&lt;br /&gt;
&lt;br /&gt;
==Non-CBB Final Project==&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
==CBB Final Project==&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
===Grade Categories===&lt;br /&gt;
&lt;br /&gt;
====CBB and CPSC Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 8.50%&lt;br /&gt;
|-&lt;br /&gt;
| Programming Assignments&lt;br /&gt;
| 25.00%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
====MBB and MCDB Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 16.75%&lt;br /&gt;
|-&lt;br /&gt;
| Problem Sets&lt;br /&gt;
| 16.75%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Prerequisites===&lt;br /&gt;
The course is keyed towards CBB graduate students as well as advanced MB&amp;amp;B undergraduates and graduate students wishing to learn about types of large-scale quantitative analyses that whole-genome sequencing will make possible. It would also be suitable for students from other fields such as computer science or physics wanting to learn about an important new biological application for computation.&lt;br /&gt;
&lt;br /&gt;
Students should have:&lt;br /&gt;
&lt;br /&gt;
A basic knowledge of biochemistry and molecular biology.&lt;br /&gt;
A knowledge of basic quantitative concepts, such as single variable calculus, some probability and statistics, and basic programming skills.&lt;br /&gt;
These can be fulfilled by the following prerequisites statement: &amp;quot;Prerequisites: MBB 200 and Mathematics 115 or permission of the instructor.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
===Relevant Yale College Regulations===&lt;br /&gt;
&lt;br /&gt;
Students may have questions concerning end-of-term matters.  Links to further information about these regulations can be found below:&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/reading-period-and-final-examination-period&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/completion-course-work&lt;br /&gt;
&lt;br /&gt;
==Misc==&lt;br /&gt;
[http://info.gersteinlab.org/Permissions Permissions] on using website material&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/mark/out/log/2011/12.29/talk/intro-graphic.ppt Graphic] for course homepage&lt;br /&gt;
&lt;br /&gt;
If you&#039;re really motivated, take a look at http://gersteinlab.org/jobs for further Research Opportunities&lt;br /&gt;
&lt;br /&gt;
==Pages from previous years==&lt;br /&gt;
[http://info.gersteinlab.org/Cbb752b11 2011],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/ 2010],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/previous.html 2009 and earlier]&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1818</id>
		<title>Cbb752b12</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1818"/>
		<updated>2011-12-30T01:12:44Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=CBB 752=&lt;br /&gt;
&lt;br /&gt;
==Course Information==&lt;br /&gt;
&lt;br /&gt;
===Course Description===&lt;br /&gt;
&lt;br /&gt;
Bioinformatics encompasses the analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. It represents a major practical application for modern techniques in data mining and simulation. Specific topics to be covered include sequence alignment, large-scale processing, next-generation sequencing data, comparative genomics, phylogenetics, biological database design, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, normalization of microarray data, mining of functional genomics data sets, and machine learning approaches for data integration.&lt;br /&gt;
&lt;br /&gt;
===Concise undergraduate course description===&lt;br /&gt;
&lt;br /&gt;
Techniques in data mining and simulation applied to bioinformatics, the computational analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. Sequence alignment, comparative genomics and phylogenetics, biological databases, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, microarray normalization, and machine-learning approaches to data integration.&lt;br /&gt;
&lt;br /&gt;
See entry from undergraduate catalog:&lt;br /&gt;
http://students.yale.edu/oci/resultDetail.jsp?course=21914&amp;amp;term=201201 , viz:&lt;br /&gt;
&lt;br /&gt;
 MB&amp;amp;B 452 01 (21914) /MCDB452/MB&amp;amp;B752/MB&amp;amp;B753/MB&amp;amp;B754/CB&amp;amp;B752/MCDB752/CPSC752&lt;br /&gt;
 Bioinformatics: Practical Application of Simulation and Data Mining &lt;br /&gt;
 Mark Gerstein&lt;br /&gt;
 MW 1.00-2.15 BASS 305&lt;br /&gt;
 Spring 2012 &lt;br /&gt;
 No regular final examination&lt;br /&gt;
 Areas Sc&lt;br /&gt;
 Prerequisites: MB&amp;amp;B 301b and MATH 115a or b, or permission of instructor.&lt;br /&gt;
 MCDB 120a or 200b is a prerequisite for courses numbered MCDB 202 and above.&lt;br /&gt;
&lt;br /&gt;
===Different headings for this class===&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B452/MCDB452&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, written problem sets, and a final written project.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B752/MCDB752&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, written problem sets, and a final, graduate level written project.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;CB&amp;amp;B752/CPSC752&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, programming assignments, and a final programming project.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For &#039;&#039;graduate students&#039;&#039; the course is broken up into two &amp;quot;modules&amp;quot; (each counting 0.5 credit towards MB&amp;amp;B course requirement):&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B 753b3&#039;&#039;&#039;, Bioinformatics: Practical Application of Data Mining (1st half of term)&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B 754b4&#039;&#039;&#039;, Bioinformatics: Practical Application of Simulation (2nd half of term)&lt;br /&gt;
&lt;br /&gt;
Each module consists of lectures, written problem sets, and a final, graduate level written project that is half the length of the full course&#039;s final project.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For grade weightings of each course version, see Class Requirements (below).&lt;br /&gt;
&lt;br /&gt;
==Timing &amp;amp; location==&lt;br /&gt;
&lt;br /&gt;
Class: Meeting from 1:00-2:15 pm on Monday and Wednesday, in Bass 305. (First meeting will be on 9 Jan.)&lt;br /&gt;
&lt;br /&gt;
Discussion section: TBA&lt;br /&gt;
&lt;br /&gt;
==Instructors==&lt;br /&gt;
&lt;br /&gt;
===Instructor-in-Charge===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Mark Gerstein&lt;br /&gt;
| Bass 432A&lt;br /&gt;
| mark.gerstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Instructors===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Corey O&#039;Hern&lt;br /&gt;
| Mason Laboratory&lt;br /&gt;
| corey.ohern(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jesse Rinehart&lt;br /&gt;
| 300 George St&lt;br /&gt;
| jesse.rinehart(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| James Noonan&lt;br /&gt;
| 333 Cedar St&lt;br /&gt;
| james.noonan(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Kei Cheung&lt;br /&gt;
| 300 George St&lt;br /&gt;
| kei.cheung(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Steven Kleinstein&lt;br /&gt;
| 300 George St&lt;br /&gt;
| steven.kleinstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Teaching Fellows===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Lucas Lochovsky&lt;br /&gt;
| Bass 437&lt;br /&gt;
| lucas.lochovsky(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jane Leng&lt;br /&gt;
| Bass 437&lt;br /&gt;
| jing.leng(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Topics/Class Schedule==&lt;br /&gt;
&lt;br /&gt;
[https://docs.google.com/spreadsheet/ccc?key=0Ar2M8lIkl9T5dDN4d1Q0aDBGcThXdjV2c21ESWVxSGc Class Schedule] (including a list of topics and quiz dates)&lt;br /&gt;
&lt;br /&gt;
==Discussion Sections==&lt;br /&gt;
&lt;br /&gt;
===Session 1===&lt;br /&gt;
Metzker ML. &amp;quot;Sequencing technologies - the next generation” Nature Reviews Genetics. 11 (2010) [http://www.gersteinlab.org/courses/452/10-spring/pdf/ngs.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Wheeler DA et al. &amp;quot;The complete genome of an individual by massively parallel DNA sequencing,” Nature. 452:872-876 (208) [http://www.gersteinlab.org/courses/452/10-spring/pdf/WatsonGenome.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 2===&lt;br /&gt;
Olsen JV, Blagoev B, Gnad F, Macek B, Kumar C, Mortensen P, Mann M. (2006) Global, in vivo, and site-specific phosphorylation dynamics in signaling networks.Cell. 2006 Nov 3;127(3):635-48. [http://www.pil.sdu.dk/1/MSQuant/Cell_GlobalPhosphorylationSignalingDynamics.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Nevan J. Krogan et al (2006) Global landscape of protein complexes in the yeast Saccharomyces cerevisiae Nature 440, 637-643 (30 March 2006) [http://www.nature.com/nature/journal/v440/n7084/pdf/nature04670.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 3===&lt;br /&gt;
T.F. Smith and M.S. Waterman. (1981) Identification of common molecular subsequences. Journal of Molecular Biology,147(1): 195-7. PMID: 7265238. [http://www.gersteinlab.org/courses/452/10-spring/pdf/sw.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. (1990) Basic local alignment search tool. Journal of Molecular Biology, 215(3):403-10. PMID: 2231712. [http://www.gersteinlab.org/courses/452/10-spring/pdf/Altschul.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 4===&lt;br /&gt;
Bailey TL, Williams N, Misleh C, Li WW. (2006) MEME: discovering and analyzing DNA and protein sequence motifs, Nucl Acids Res.34:W369-373 [http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1538909/?tool=pubmed PDF]&lt;br /&gt;
&lt;br /&gt;
Garnier J, Gibrat JF, Robson B. (1996) GOR method for predicting protein secondary structure from amino acid sequence.Methods in Enzymology,266: 540-53. PMID: 8743705. [http://www.gersteinlab.org/courses/452/10-spring/pdf/GOR.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 5===&lt;br /&gt;
Laura J. van &#039;t Veer et al. Gene expression profiling predicts clinical outcome of breast cancer Nature 415, 530-536 (31 January 2002) | doi:10.1038/415530a; Received 24 August 2001; Accepted 22 November 2001 [http://www.nature.com/nature/journal/v415/n6871/full/415530a.html TEXT]&lt;br /&gt;
&lt;br /&gt;
Kwang-Il Goh, Michael E. Cusick, David Vall, Barton Child, Marc Vidal, and Albert-La ́szlo ́ Barabasi (2007) The human disease network Proc Natl Acad Sci U S A. 2007 May 22;104(21):8685-90. Epub 2007 May 14. [http://www.pnas.org/content/104/21/8685.full.pdf+html PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 6===&lt;br /&gt;
Antezana E, Egaña M, Blondé W, Illarramendi A, Bilbao I, De Baets B, Stevens R, Mironov V, Kuiper M. (2009) The Cell Cycle Ontology: an application ontology for the representation and integrated analysis of the cell cycle process. Genome Biol. 2009;10(5):R58. Epub 2009 May 29. [http://genomebiology.com/content/10/5/R58 PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 7===&lt;br /&gt;
Perelson AS. Modelling viral and immune system dynamics. Nat Rev Immunol. 2002 Jan;2(1):28-36. [http://www.gersteinlab.org/courses/452/10-spring/pdf/perelson.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 8===&lt;br /&gt;
ML Connolly. (1983) Solvent-accessible surfaces of proteins and nucleic acids. Science, 221(4612): 709-13. PMID: 6879170.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Connolly.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Martin Karplus and J. Andrew McCammon. (2002) Molecular dynamics simulations of biomolecules. Nature Structural Biology,9, 646-52. PMID: 12198485.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Karplus.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 9===&lt;br /&gt;
Dill KA, Ozkan SB, Shell MS, Weikl TR. (2008) The Protein Folding Problem.Annu Rev Biophys,9, 37:289-316. PMID: 2443096.[http://www.gersteinlab.org/courses/452/10-spring/pdf/proteinFolding.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Bowman GR, Beauchamp KA, Boxer G, Pande VS. “Progress and challenges in the automated construction of Markov state models for full protein systems,” J. Chem. Phys. 131 (2009) 124101 [http://www.gersteinlab.org/courses/452/10-spring/pdf/bowman.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
==Class Requirements==&lt;br /&gt;
&lt;br /&gt;
===Discussion Section / Readings===&lt;br /&gt;
Papers will be assigned throughout the course. These papers will be presented and discussed in weekly sections with the TAs. A brief summary (a half-page per article) should be submitted at the beginning of the discussion session.&lt;br /&gt;
&lt;br /&gt;
===Bioinformatics quizzes===&lt;br /&gt;
There will be four short quizzes (25 minutes) in class comprising SIMPLE questions that you should be able to answer from the lectures plus the main readings.&lt;br /&gt;
&lt;br /&gt;
===Programming Assignments (CBB and CS)===&lt;br /&gt;
There will be several short programming assignments required for CBB and CS students taking this course. Acceptable languages and submission requirements will be discussed prior to the first assignment. These assignments are NOT required for students not taking the CBB or CS sections of the course.&lt;br /&gt;
&lt;br /&gt;
==Non-CBB Final Project==&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
==CBB Final Project==&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
===Grade Categories===&lt;br /&gt;
&lt;br /&gt;
====CBB and CPSC Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 8.50%&lt;br /&gt;
|-&lt;br /&gt;
| Programming Assignments&lt;br /&gt;
| 25.00%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
====MBB and MCDB Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 16.75%&lt;br /&gt;
|-&lt;br /&gt;
| Problem Sets&lt;br /&gt;
| 16.75%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Prerequisites===&lt;br /&gt;
The course is keyed towards CBB graduate students as well as advanced MB&amp;amp;B undergraduates and graduate students wishing to learn about types of large-scale quantitative analyses that whole-genome sequencing will make possible. It would also be suitable for students from other fields such as computer science or physics wanting to learn about an important new biological application for computation.&lt;br /&gt;
&lt;br /&gt;
Students should have:&lt;br /&gt;
&lt;br /&gt;
A basic knowledge of biochemistry and molecular biology.&lt;br /&gt;
A knowledge of basic quantitative concepts, such as single variable calculus, some probability and statistics, and basic programming skills.&lt;br /&gt;
These can be fulfilled by the following prerequisites statement: &amp;quot;Prerequisites: MBB 200 and Mathematics 115 or permission of the instructor.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
===Relevant Yale College Regulations===&lt;br /&gt;
&lt;br /&gt;
Students may have questions concerning end-of-term matters.  Links to further information about these regulations can be found below:&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/reading-period-and-final-examination-period&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/completion-course-work&lt;br /&gt;
&lt;br /&gt;
==Misc==&lt;br /&gt;
[http://info.gersteinlab.org/Permissions Permissions] on using website material&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/mark/out/log/2011/12.29/talk/intro-graphic.ppt Graphic] for course homepage&lt;br /&gt;
&lt;br /&gt;
If you&#039;re really motivated, take a look at http://gersteinlab.org/jobs for further Research Opportunities&lt;br /&gt;
&lt;br /&gt;
==Pages from previous years==&lt;br /&gt;
[http://info.gersteinlab.org/Cbb752b11 2011],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/ 2010],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/previous.html 2009 and earlier]&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1817</id>
		<title>Cbb752b12</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1817"/>
		<updated>2011-12-30T01:11:20Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: /* Different headings for this class */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=CBB 752=&lt;br /&gt;
&lt;br /&gt;
==Course Information==&lt;br /&gt;
&lt;br /&gt;
===Course Description===&lt;br /&gt;
&lt;br /&gt;
Bioinformatics encompasses the analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. It represents a major practical application for modern techniques in data mining and simulation. Specific topics to be covered include sequence alignment, large-scale processing, next-generation sequencing data, comparative genomics, phylogenetics, biological database design, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, normalization of microarray data, mining of functional genomics data sets, and machine learning approaches for data integration.&lt;br /&gt;
&lt;br /&gt;
===Concise undergraduate course description===&lt;br /&gt;
&lt;br /&gt;
Techniques in data mining and simulation applied to bioinformatics, the computational analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. Sequence alignment, comparative genomics and phylogenetics, biological databases, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, microarray normalization, and machine-learning approaches to data integration.&lt;br /&gt;
&lt;br /&gt;
See entry from undergraduate catalog:&lt;br /&gt;
http://students.yale.edu/oci/resultDetail.jsp?course=21914&amp;amp;term=201201 , viz:&lt;br /&gt;
&lt;br /&gt;
 MB&amp;amp;B 452 01 (21914) /MCDB452/MB&amp;amp;B752/MB&amp;amp;B753/MB&amp;amp;B754/CB&amp;amp;B752/MCDB752/CPSC752&lt;br /&gt;
 Bioinformatics: Practical Application of Simulation and Data Mining &lt;br /&gt;
 Mark Gerstein&lt;br /&gt;
 MW 1.00-2.15 BASS 305&lt;br /&gt;
 Spring 2012 &lt;br /&gt;
 No regular final examination&lt;br /&gt;
 Areas Sc&lt;br /&gt;
 Prerequisites: MB&amp;amp;B 301b and MATH 115a or b, or permission of instructor.&lt;br /&gt;
 MCDB 120a or 200b is a prerequisite for courses numbered MCDB 202 and above.&lt;br /&gt;
&lt;br /&gt;
===Different headings for this class===&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B452/MCDB452&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, written problem sets, and a final written project.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B752/MCDB752&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, written problem sets, and a final, graduate level written project.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;CB&amp;amp;B752/CPSC752&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, programming assignments, and a final programming project.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For &#039;&#039;graduate students&#039;&#039; the course is broken up into two &amp;quot;modules&amp;quot; (each counting 0.5 credit towards MB&amp;amp;B course requirement):&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B 753b3&#039;&#039;&#039;, Bioinformatics: Practical Application of Data Mining (1st half of term)&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B 754b4&#039;&#039;&#039;, Bioinformatics: Practical Application of Simulation (2nd half of term)&lt;br /&gt;
&lt;br /&gt;
Each module consists of lectures, written problem sets, and a final, graduate level written project that is half the length of the full course&#039;s final project.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For grade weightings of each course version, see Class Requirements (below).&lt;br /&gt;
&lt;br /&gt;
===Differences Between Class Sections===&lt;br /&gt;
&lt;br /&gt;
In general, the graduate level CS/CBB course is significantly different than MBB/MCDB (graduate and undergraduate) in several ways. Although the lectures are the same for each section, the graduate level CPSC/CBB course has additional programming assignments in addition to the work being completed by the MBB students. Homework for the MBB section centers on the completion of several problem sets without a programming component. The CPSC/CBB section forgoes these problem sets and instead requires that students implement several of the algorithms discussed in class. Also, the final project for CPSC/CBB MUST be a programming assignment rather than the final paper required for the MBB section. Due to the distinct course requirements, category weightings for final grades are also different.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Timing &amp;amp; location==&lt;br /&gt;
&lt;br /&gt;
Class: Meeting from 1:00-2:15 pm on Monday and Wednesday, in Bass 305. (First meeting will be on 9 Jan.)&lt;br /&gt;
&lt;br /&gt;
Discussion section: TBA&lt;br /&gt;
&lt;br /&gt;
==Instructors==&lt;br /&gt;
&lt;br /&gt;
===Instructor-in-Charge===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Mark Gerstein&lt;br /&gt;
| Bass 432A&lt;br /&gt;
| mark.gerstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Instructors===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Corey O&#039;Hern&lt;br /&gt;
| Mason Laboratory&lt;br /&gt;
| corey.ohern(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jesse Rinehart&lt;br /&gt;
| 300 George St&lt;br /&gt;
| jesse.rinehart(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| James Noonan&lt;br /&gt;
| 333 Cedar St&lt;br /&gt;
| james.noonan(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Kei Cheung&lt;br /&gt;
| 300 George St&lt;br /&gt;
| kei.cheung(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Steven Kleinstein&lt;br /&gt;
| 300 George St&lt;br /&gt;
| steven.kleinstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Teaching Fellows===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Lucas Lochovsky&lt;br /&gt;
| Bass 437&lt;br /&gt;
| lucas.lochovsky(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jane Leng&lt;br /&gt;
| Bass 437&lt;br /&gt;
| jing.leng(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Topics/Class Schedule==&lt;br /&gt;
&lt;br /&gt;
[https://docs.google.com/spreadsheet/ccc?key=0Ar2M8lIkl9T5dDN4d1Q0aDBGcThXdjV2c21ESWVxSGc Class Schedule] (including a list of topics and quiz dates)&lt;br /&gt;
&lt;br /&gt;
==Discussion Sections==&lt;br /&gt;
&lt;br /&gt;
===Session 1===&lt;br /&gt;
Metzker ML. &amp;quot;Sequencing technologies - the next generation” Nature Reviews Genetics. 11 (2010) [http://www.gersteinlab.org/courses/452/10-spring/pdf/ngs.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Wheeler DA et al. &amp;quot;The complete genome of an individual by massively parallel DNA sequencing,” Nature. 452:872-876 (208) [http://www.gersteinlab.org/courses/452/10-spring/pdf/WatsonGenome.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 2===&lt;br /&gt;
Olsen JV, Blagoev B, Gnad F, Macek B, Kumar C, Mortensen P, Mann M. (2006) Global, in vivo, and site-specific phosphorylation dynamics in signaling networks.Cell. 2006 Nov 3;127(3):635-48. [http://www.pil.sdu.dk/1/MSQuant/Cell_GlobalPhosphorylationSignalingDynamics.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Nevan J. Krogan et al (2006) Global landscape of protein complexes in the yeast Saccharomyces cerevisiae Nature 440, 637-643 (30 March 2006) [http://www.nature.com/nature/journal/v440/n7084/pdf/nature04670.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 3===&lt;br /&gt;
T.F. Smith and M.S. Waterman. (1981) Identification of common molecular subsequences. Journal of Molecular Biology,147(1): 195-7. PMID: 7265238. [http://www.gersteinlab.org/courses/452/10-spring/pdf/sw.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. (1990) Basic local alignment search tool. Journal of Molecular Biology, 215(3):403-10. PMID: 2231712. [http://www.gersteinlab.org/courses/452/10-spring/pdf/Altschul.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 4===&lt;br /&gt;
Bailey TL, Williams N, Misleh C, Li WW. (2006) MEME: discovering and analyzing DNA and protein sequence motifs, Nucl Acids Res.34:W369-373 [http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1538909/?tool=pubmed PDF]&lt;br /&gt;
&lt;br /&gt;
Garnier J, Gibrat JF, Robson B. (1996) GOR method for predicting protein secondary structure from amino acid sequence.Methods in Enzymology,266: 540-53. PMID: 8743705. [http://www.gersteinlab.org/courses/452/10-spring/pdf/GOR.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 5===&lt;br /&gt;
Laura J. van &#039;t Veer et al. Gene expression profiling predicts clinical outcome of breast cancer Nature 415, 530-536 (31 January 2002) | doi:10.1038/415530a; Received 24 August 2001; Accepted 22 November 2001 [http://www.nature.com/nature/journal/v415/n6871/full/415530a.html TEXT]&lt;br /&gt;
&lt;br /&gt;
Kwang-Il Goh, Michael E. Cusick, David Vall, Barton Child, Marc Vidal, and Albert-La ́szlo ́ Barabasi (2007) The human disease network Proc Natl Acad Sci U S A. 2007 May 22;104(21):8685-90. Epub 2007 May 14. [http://www.pnas.org/content/104/21/8685.full.pdf+html PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 6===&lt;br /&gt;
Antezana E, Egaña M, Blondé W, Illarramendi A, Bilbao I, De Baets B, Stevens R, Mironov V, Kuiper M. (2009) The Cell Cycle Ontology: an application ontology for the representation and integrated analysis of the cell cycle process. Genome Biol. 2009;10(5):R58. Epub 2009 May 29. [http://genomebiology.com/content/10/5/R58 PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 7===&lt;br /&gt;
Perelson AS. Modelling viral and immune system dynamics. Nat Rev Immunol. 2002 Jan;2(1):28-36. [http://www.gersteinlab.org/courses/452/10-spring/pdf/perelson.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 8===&lt;br /&gt;
ML Connolly. (1983) Solvent-accessible surfaces of proteins and nucleic acids. Science, 221(4612): 709-13. PMID: 6879170.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Connolly.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Martin Karplus and J. Andrew McCammon. (2002) Molecular dynamics simulations of biomolecules. Nature Structural Biology,9, 646-52. PMID: 12198485.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Karplus.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 9===&lt;br /&gt;
Dill KA, Ozkan SB, Shell MS, Weikl TR. (2008) The Protein Folding Problem.Annu Rev Biophys,9, 37:289-316. PMID: 2443096.[http://www.gersteinlab.org/courses/452/10-spring/pdf/proteinFolding.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Bowman GR, Beauchamp KA, Boxer G, Pande VS. “Progress and challenges in the automated construction of Markov state models for full protein systems,” J. Chem. Phys. 131 (2009) 124101 [http://www.gersteinlab.org/courses/452/10-spring/pdf/bowman.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
==Class Requirements==&lt;br /&gt;
&lt;br /&gt;
===Discussion Section / Readings===&lt;br /&gt;
Papers will be assigned throughout the course. These papers will be presented and discussed in weekly sections with the TAs. A brief summary (a half-page per article) should be submitted at the beginning of the discussion session.&lt;br /&gt;
&lt;br /&gt;
===Bioinformatics quizzes===&lt;br /&gt;
There will be four short quizzes (25 minutes) in class comprising SIMPLE questions that you should be able to answer from the lectures plus the main readings.&lt;br /&gt;
&lt;br /&gt;
===Programming Assignments (CBB and CS)===&lt;br /&gt;
There will be several short programming assignments required for CBB and CS students taking this course. Acceptable languages and submission requirements will be discussed prior to the first assignment. These assignments are NOT required for students not taking the CBB or CS sections of the course.&lt;br /&gt;
&lt;br /&gt;
==Non-CBB Final Project==&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
==CBB Final Project==&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
===Grade Categories===&lt;br /&gt;
&lt;br /&gt;
====CBB and CPSC Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 8.50%&lt;br /&gt;
|-&lt;br /&gt;
| Programming Assignments&lt;br /&gt;
| 25.00%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
====MBB and MCDB Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 16.75%&lt;br /&gt;
|-&lt;br /&gt;
| Problem Sets&lt;br /&gt;
| 16.75%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Prerequisites===&lt;br /&gt;
The course is keyed towards CBB graduate students as well as advanced MB&amp;amp;B undergraduates and graduate students wishing to learn about types of large-scale quantitative analyses that whole-genome sequencing will make possible. It would also be suitable for students from other fields such as computer science or physics wanting to learn about an important new biological application for computation.&lt;br /&gt;
&lt;br /&gt;
Students should have:&lt;br /&gt;
&lt;br /&gt;
A basic knowledge of biochemistry and molecular biology.&lt;br /&gt;
A knowledge of basic quantitative concepts, such as single variable calculus, some probability and statistics, and basic programming skills.&lt;br /&gt;
These can be fulfilled by the following prerequisites statement: &amp;quot;Prerequisites: MBB 200 and Mathematics 115 or permission of the instructor.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
===Relevant Yale College Regulations===&lt;br /&gt;
&lt;br /&gt;
Students may have questions concerning end-of-term matters.  Links to further information about these regulations can be found below:&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/reading-period-and-final-examination-period&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/completion-course-work&lt;br /&gt;
&lt;br /&gt;
==Misc==&lt;br /&gt;
[http://info.gersteinlab.org/Permissions Permissions] on using website material&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/mark/out/log/2011/12.29/talk/intro-graphic.ppt Graphic] for course homepage&lt;br /&gt;
&lt;br /&gt;
If you&#039;re really motivated, take a look at http://gersteinlab.org/jobs for further Research Opportunities&lt;br /&gt;
&lt;br /&gt;
==Pages from previous years==&lt;br /&gt;
[http://info.gersteinlab.org/Cbb752b11 2011],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/ 2010],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/previous.html 2009 and earlier]&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1816</id>
		<title>Cbb752b12</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1816"/>
		<updated>2011-12-30T00:53:24Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: /* Different headings for this class */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=CBB 752=&lt;br /&gt;
&lt;br /&gt;
==Course Information==&lt;br /&gt;
&lt;br /&gt;
===Course Description===&lt;br /&gt;
&lt;br /&gt;
Bioinformatics encompasses the analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. It represents a major practical application for modern techniques in data mining and simulation. Specific topics to be covered include sequence alignment, large-scale processing, next-generation sequencing data, comparative genomics, phylogenetics, biological database design, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, normalization of microarray data, mining of functional genomics data sets, and machine learning approaches for data integration.&lt;br /&gt;
&lt;br /&gt;
===Concise undergraduate course description===&lt;br /&gt;
&lt;br /&gt;
Techniques in data mining and simulation applied to bioinformatics, the computational analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. Sequence alignment, comparative genomics and phylogenetics, biological databases, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, microarray normalization, and machine-learning approaches to data integration.&lt;br /&gt;
&lt;br /&gt;
See entry from undergraduate catalog:&lt;br /&gt;
http://students.yale.edu/oci/resultDetail.jsp?course=21914&amp;amp;term=201201 , viz:&lt;br /&gt;
&lt;br /&gt;
 MB&amp;amp;B 452 01 (21914) /MCDB452/MB&amp;amp;B752/MB&amp;amp;B753/MB&amp;amp;B754/CB&amp;amp;B752/MCDB752/CPSC752&lt;br /&gt;
 Bioinformatics: Practical Application of Simulation and Data Mining &lt;br /&gt;
 Mark Gerstein&lt;br /&gt;
 MW 1.00-2.15 BASS 305&lt;br /&gt;
 Spring 2012 &lt;br /&gt;
 No regular final examination&lt;br /&gt;
 Areas Sc&lt;br /&gt;
 Prerequisites: MB&amp;amp;B 301b and MATH 115a or b, or permission of instructor.&lt;br /&gt;
 MCDB 120a or 200b is a prerequisite for courses numbered MCDB 202 and above.&lt;br /&gt;
&lt;br /&gt;
===Different headings for this class===&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B452/MCDB452&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, written problem sets, and a final written project.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B752/MCDB752&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, written problem sets, and a final, graduate level written project.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;CB&amp;amp;B752/CPSC752&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This version of the course consists of lectures, programming assignments, and a final programming project.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For &#039;&#039;graduate students&#039;&#039; the course is broken up into two &amp;quot;modules&amp;quot; (each counting 0.5 credit towards MB&amp;amp;B course requirement):&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B 753b3&#039;&#039;&#039;, Bioinformatics: Practical Application of Data Mining (1st half of term)&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;MB&amp;amp;B 754b4&#039;&#039;&#039;, Bioinformatics: Practical Application of Simulation (2nd half of term)&lt;br /&gt;
&lt;br /&gt;
Each module consists of lectures, written problem sets, and a final, graduate level written project that is half the length of the full course&#039;s final project.&lt;br /&gt;
&lt;br /&gt;
===Differences Between Class Sections===&lt;br /&gt;
&lt;br /&gt;
In general, the graduate level CS/CBB course is significantly different than MBB/MCDB (graduate and undergraduate) in several ways. Although the lectures are the same for each section, the graduate level CPSC/CBB course has additional programming assignments in addition to the work being completed by the MBB students. Homework for the MBB section centers on the completion of several problem sets without a programming component. The CPSC/CBB section forgoes these problem sets and instead requires that students implement several of the algorithms discussed in class. Also, the final project for CPSC/CBB MUST be a programming assignment rather than the final paper required for the MBB section. Due to the distinct course requirements, category weightings for final grades are also different.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Timing &amp;amp; location==&lt;br /&gt;
&lt;br /&gt;
Class: Meeting from 1:00-2:15 pm on Monday and Wednesday, in Bass 305. (First meeting will be on 9 Jan.)&lt;br /&gt;
&lt;br /&gt;
Discussion section: TBA&lt;br /&gt;
&lt;br /&gt;
==Instructors==&lt;br /&gt;
&lt;br /&gt;
===Instructor-in-Charge===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Mark Gerstein&lt;br /&gt;
| Bass 432A&lt;br /&gt;
| mark.gerstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Instructors===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Corey O&#039;Hern&lt;br /&gt;
| Mason Laboratory&lt;br /&gt;
| corey.ohern(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jesse Rinehart&lt;br /&gt;
| 300 George St&lt;br /&gt;
| jesse.rinehart(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| James Noonan&lt;br /&gt;
| 333 Cedar St&lt;br /&gt;
| james.noonan(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Kei Cheung&lt;br /&gt;
| 300 George St&lt;br /&gt;
| kei.cheung(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Steven Kleinstein&lt;br /&gt;
| 300 George St&lt;br /&gt;
| steven.kleinstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Teaching Fellows===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Lucas Lochovsky&lt;br /&gt;
| Bass 437&lt;br /&gt;
| lucas.lochovsky(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jane Leng&lt;br /&gt;
| Bass 437&lt;br /&gt;
| jing.leng(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Topics/Class Schedule==&lt;br /&gt;
&lt;br /&gt;
[https://docs.google.com/spreadsheet/ccc?key=0Ar2M8lIkl9T5dDN4d1Q0aDBGcThXdjV2c21ESWVxSGc Class Schedule] (including a list of topics and quiz dates)&lt;br /&gt;
&lt;br /&gt;
==Discussion Sections==&lt;br /&gt;
&lt;br /&gt;
===Session 1===&lt;br /&gt;
Metzker ML. &amp;quot;Sequencing technologies - the next generation” Nature Reviews Genetics. 11 (2010) [http://www.gersteinlab.org/courses/452/10-spring/pdf/ngs.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Wheeler DA et al. &amp;quot;The complete genome of an individual by massively parallel DNA sequencing,” Nature. 452:872-876 (208) [http://www.gersteinlab.org/courses/452/10-spring/pdf/WatsonGenome.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 2===&lt;br /&gt;
Olsen JV, Blagoev B, Gnad F, Macek B, Kumar C, Mortensen P, Mann M. (2006) Global, in vivo, and site-specific phosphorylation dynamics in signaling networks.Cell. 2006 Nov 3;127(3):635-48. [http://www.pil.sdu.dk/1/MSQuant/Cell_GlobalPhosphorylationSignalingDynamics.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Nevan J. Krogan et al (2006) Global landscape of protein complexes in the yeast Saccharomyces cerevisiae Nature 440, 637-643 (30 March 2006) [http://www.nature.com/nature/journal/v440/n7084/pdf/nature04670.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 3===&lt;br /&gt;
T.F. Smith and M.S. Waterman. (1981) Identification of common molecular subsequences. Journal of Molecular Biology,147(1): 195-7. PMID: 7265238. [http://www.gersteinlab.org/courses/452/10-spring/pdf/sw.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. (1990) Basic local alignment search tool. Journal of Molecular Biology, 215(3):403-10. PMID: 2231712. [http://www.gersteinlab.org/courses/452/10-spring/pdf/Altschul.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 4===&lt;br /&gt;
Bailey TL, Williams N, Misleh C, Li WW. (2006) MEME: discovering and analyzing DNA and protein sequence motifs, Nucl Acids Res.34:W369-373 [http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1538909/?tool=pubmed PDF]&lt;br /&gt;
&lt;br /&gt;
Garnier J, Gibrat JF, Robson B. (1996) GOR method for predicting protein secondary structure from amino acid sequence.Methods in Enzymology,266: 540-53. PMID: 8743705. [http://www.gersteinlab.org/courses/452/10-spring/pdf/GOR.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 5===&lt;br /&gt;
Laura J. van &#039;t Veer et al. Gene expression profiling predicts clinical outcome of breast cancer Nature 415, 530-536 (31 January 2002) | doi:10.1038/415530a; Received 24 August 2001; Accepted 22 November 2001 [http://www.nature.com/nature/journal/v415/n6871/full/415530a.html TEXT]&lt;br /&gt;
&lt;br /&gt;
Kwang-Il Goh, Michael E. Cusick, David Vall, Barton Child, Marc Vidal, and Albert-La ́szlo ́ Barabasi (2007) The human disease network Proc Natl Acad Sci U S A. 2007 May 22;104(21):8685-90. Epub 2007 May 14. [http://www.pnas.org/content/104/21/8685.full.pdf+html PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 6===&lt;br /&gt;
Antezana E, Egaña M, Blondé W, Illarramendi A, Bilbao I, De Baets B, Stevens R, Mironov V, Kuiper M. (2009) The Cell Cycle Ontology: an application ontology for the representation and integrated analysis of the cell cycle process. Genome Biol. 2009;10(5):R58. Epub 2009 May 29. [http://genomebiology.com/content/10/5/R58 PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 7===&lt;br /&gt;
Perelson AS. Modelling viral and immune system dynamics. Nat Rev Immunol. 2002 Jan;2(1):28-36. [http://www.gersteinlab.org/courses/452/10-spring/pdf/perelson.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 8===&lt;br /&gt;
ML Connolly. (1983) Solvent-accessible surfaces of proteins and nucleic acids. Science, 221(4612): 709-13. PMID: 6879170.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Connolly.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Martin Karplus and J. Andrew McCammon. (2002) Molecular dynamics simulations of biomolecules. Nature Structural Biology,9, 646-52. PMID: 12198485.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Karplus.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 9===&lt;br /&gt;
Dill KA, Ozkan SB, Shell MS, Weikl TR. (2008) The Protein Folding Problem.Annu Rev Biophys,9, 37:289-316. PMID: 2443096.[http://www.gersteinlab.org/courses/452/10-spring/pdf/proteinFolding.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Bowman GR, Beauchamp KA, Boxer G, Pande VS. “Progress and challenges in the automated construction of Markov state models for full protein systems,” J. Chem. Phys. 131 (2009) 124101 [http://www.gersteinlab.org/courses/452/10-spring/pdf/bowman.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
==Class Requirements==&lt;br /&gt;
&lt;br /&gt;
===Discussion Section / Readings===&lt;br /&gt;
Papers will be assigned throughout the course. These papers will be presented and discussed in weekly sections with the TAs. A brief summary (a half-page per article) should be submitted at the beginning of the discussion session.&lt;br /&gt;
&lt;br /&gt;
===Bioinformatics quizzes===&lt;br /&gt;
There will be four short quizzes (25 minutes) in class comprising SIMPLE questions that you should be able to answer from the lectures plus the main readings.&lt;br /&gt;
&lt;br /&gt;
===Programming Assignments (CBB and CS)===&lt;br /&gt;
There will be several short programming assignments required for CBB and CS students taking this course. Acceptable languages and submission requirements will be discussed prior to the first assignment. These assignments are NOT required for students not taking the CBB or CS sections of the course.&lt;br /&gt;
&lt;br /&gt;
==Non-CBB Final Project==&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
==CBB Final Project==&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
===Grade Categories===&lt;br /&gt;
&lt;br /&gt;
====CBB and CPSC Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 8.50%&lt;br /&gt;
|-&lt;br /&gt;
| Programming Assignments&lt;br /&gt;
| 25.00%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
====MBB and MCDB Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 16.75%&lt;br /&gt;
|-&lt;br /&gt;
| Problem Sets&lt;br /&gt;
| 16.75%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Prerequisites===&lt;br /&gt;
The course is keyed towards CBB graduate students as well as advanced MB&amp;amp;B undergraduates and graduate students wishing to learn about types of large-scale quantitative analyses that whole-genome sequencing will make possible. It would also be suitable for students from other fields such as computer science or physics wanting to learn about an important new biological application for computation.&lt;br /&gt;
&lt;br /&gt;
Students should have:&lt;br /&gt;
&lt;br /&gt;
A basic knowledge of biochemistry and molecular biology.&lt;br /&gt;
A knowledge of basic quantitative concepts, such as single variable calculus, some probability and statistics, and basic programming skills.&lt;br /&gt;
These can be fulfilled by the following prerequisites statement: &amp;quot;Prerequisites: MBB 200 and Mathematics 115 or permission of the instructor.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
===Relevant Yale College Regulations===&lt;br /&gt;
&lt;br /&gt;
Students may have questions concerning end-of-term matters.  Links to further information about these regulations can be found below:&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/reading-period-and-final-examination-period&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/completion-course-work&lt;br /&gt;
&lt;br /&gt;
==Misc==&lt;br /&gt;
[http://info.gersteinlab.org/Permissions Permissions] on using website material&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/mark/out/log/2011/12.29/talk/intro-graphic.ppt Graphic] for course homepage&lt;br /&gt;
&lt;br /&gt;
If you&#039;re really motivated, take a look at http://gersteinlab.org/jobs for further Research Opportunities&lt;br /&gt;
&lt;br /&gt;
==Pages from previous years==&lt;br /&gt;
[http://info.gersteinlab.org/Cbb752b11 2011],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/ 2010],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/previous.html 2009 and earlier]&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1815</id>
		<title>Cbb752b12</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1815"/>
		<updated>2011-12-30T00:23:29Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: /* Teaching Fellows */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=CBB 752=&lt;br /&gt;
&lt;br /&gt;
==Course Information==&lt;br /&gt;
&lt;br /&gt;
===Course Description===&lt;br /&gt;
&lt;br /&gt;
Bioinformatics encompasses the analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. It represents a major practical application for modern techniques in data mining and simulation. Specific topics to be covered include sequence alignment, large-scale processing, next-generation sequencing data, comparative genomics, phylogenetics, biological database design, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, normalization of microarray data, mining of functional genomics data sets, and machine learning approaches for data integration.&lt;br /&gt;
&lt;br /&gt;
===Concise undergraduate course description===&lt;br /&gt;
&lt;br /&gt;
Techniques in data mining and simulation applied to bioinformatics, the computational analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. Sequence alignment, comparative genomics and phylogenetics, biological databases, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, microarray normalization, and machine-learning approaches to data integration.&lt;br /&gt;
&lt;br /&gt;
See entry from undergraduate catalog:&lt;br /&gt;
http://students.yale.edu/oci/resultDetail.jsp?course=21914&amp;amp;term=201201 , viz:&lt;br /&gt;
&lt;br /&gt;
 MB&amp;amp;B 452 01 (21914) /MCDB452/MB&amp;amp;B752/MB&amp;amp;B753/MB&amp;amp;B754/CB&amp;amp;B752/MCDB752/CPSC752&lt;br /&gt;
 Bioinformatics: Practical Application of Simulation and Data Mining &lt;br /&gt;
 Mark Gerstein&lt;br /&gt;
 MW 1.00-2.15 BASS 305&lt;br /&gt;
 Spring 2012 &lt;br /&gt;
 No regular final examination&lt;br /&gt;
 Areas Sc&lt;br /&gt;
 Prerequisites: MB&amp;amp;B 301b and MATH 115a or b, or permission of instructor.&lt;br /&gt;
 MCDB 120a or 200b is a prerequisite for courses numbered MCDB 202 and above.&lt;br /&gt;
&lt;br /&gt;
===Different headings for this class===&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
MBB452 &lt;br /&gt;
&lt;br /&gt;
MB&amp;amp;B 452 01 (21914) &lt;br /&gt;
MCDB45&lt;br /&gt;
&lt;br /&gt;
everything + written proj + written homeworks &lt;br /&gt;
&lt;br /&gt;
MB&amp;amp;B752&lt;br /&gt;
MCDB752&lt;br /&gt;
&lt;br /&gt;
everything + grad level written proj + homeworks &lt;br /&gt;
&lt;br /&gt;
CB&amp;amp;B752&lt;br /&gt;
CPSC752&lt;br /&gt;
&lt;br /&gt;
everything + programming proj&lt;br /&gt;
+ prog assignment&lt;br /&gt;
&lt;br /&gt;
For graduate students the course is broken up into two &amp;quot;modules&amp;quot; (each counting 0.5 credit towards MB&amp;amp;B course requirement.):&lt;br /&gt;
&lt;br /&gt;
MB&amp;amp;B 753b3, Bioinformatics: Practical Application of Data Mining (1st half of term)&lt;br /&gt;
&lt;br /&gt;
MB&amp;amp;B 754b4, Bioinformatics: Practical Application of Simulation (2nd half of term)&lt;br /&gt;
&lt;br /&gt;
same as mbb752&lt;br /&gt;
incl the final proj. at half length&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Differences Between Class Sections===&lt;br /&gt;
&lt;br /&gt;
In general, the graduate level CS/CBB course is significantly different than MBB/MCDB (graduate and undergraduate) in several ways. Although the lectures are the same for each section, the graduate level CPSC/CBB course has additional programming assignments in addition to the work being completed by the MBB students. Homework for the MBB section centers on the completion of several problem sets without a programming component. The CPSC/CBB section forgoes these problem sets and instead requires that students implement several of the algorithms discussed in class. Also, the final project for CPSC/CBB MUST be a programming assignment rather than the final paper required for the MBB section. Due to the distinct course requirements, category weightings for final grades are also different.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Timing &amp;amp; location==&lt;br /&gt;
&lt;br /&gt;
Class: Meeting from 1:00-2:15 pm on Monday and Wednesday, in Bass 305. (First meeting will be on 9 Jan.)&lt;br /&gt;
&lt;br /&gt;
Discussion section: TBA&lt;br /&gt;
&lt;br /&gt;
==Instructors==&lt;br /&gt;
&lt;br /&gt;
===Instructor-in-Charge===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Mark Gerstein&lt;br /&gt;
| Bass 432A&lt;br /&gt;
| mark.gerstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Instructors===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Corey O&#039;Hern&lt;br /&gt;
| Mason Laboratory&lt;br /&gt;
| corey.ohern(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jesse Rinehart&lt;br /&gt;
| 300 George St&lt;br /&gt;
| jesse.rinehart(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| James Noonan&lt;br /&gt;
| 333 Cedar St&lt;br /&gt;
| james.noonan(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Kei Cheung&lt;br /&gt;
| 300 George St&lt;br /&gt;
| kei.cheung(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Steven Kleinstein&lt;br /&gt;
| 300 George St&lt;br /&gt;
| steven.kleinstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Teaching Fellows===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Lucas Lochovsky&lt;br /&gt;
| Bass 437&lt;br /&gt;
| lucas.lochovsky(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jane Leng&lt;br /&gt;
| Bass 437&lt;br /&gt;
| jing.leng(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Topics/Class Schedule==&lt;br /&gt;
&lt;br /&gt;
[https://docs.google.com/spreadsheet/ccc?key=0Ar2M8lIkl9T5dDN4d1Q0aDBGcThXdjV2c21ESWVxSGc Class Schedule] (including a list of topics and quiz dates)&lt;br /&gt;
&lt;br /&gt;
==Discussion Sections==&lt;br /&gt;
&lt;br /&gt;
===Session 1===&lt;br /&gt;
Metzker ML. &amp;quot;Sequencing technologies - the next generation” Nature Reviews Genetics. 11 (2010) [http://www.gersteinlab.org/courses/452/10-spring/pdf/ngs.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Wheeler DA et al. &amp;quot;The complete genome of an individual by massively parallel DNA sequencing,” Nature. 452:872-876 (208) [http://www.gersteinlab.org/courses/452/10-spring/pdf/WatsonGenome.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 2===&lt;br /&gt;
Olsen JV, Blagoev B, Gnad F, Macek B, Kumar C, Mortensen P, Mann M. (2006) Global, in vivo, and site-specific phosphorylation dynamics in signaling networks.Cell. 2006 Nov 3;127(3):635-48. [http://www.pil.sdu.dk/1/MSQuant/Cell_GlobalPhosphorylationSignalingDynamics.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Nevan J. Krogan et al (2006) Global landscape of protein complexes in the yeast Saccharomyces cerevisiae Nature 440, 637-643 (30 March 2006) [http://www.nature.com/nature/journal/v440/n7084/pdf/nature04670.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 3===&lt;br /&gt;
T.F. Smith and M.S. Waterman. (1981) Identification of common molecular subsequences. Journal of Molecular Biology,147(1): 195-7. PMID: 7265238. [http://www.gersteinlab.org/courses/452/10-spring/pdf/sw.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. (1990) Basic local alignment search tool. Journal of Molecular Biology, 215(3):403-10. PMID: 2231712. [http://www.gersteinlab.org/courses/452/10-spring/pdf/Altschul.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 4===&lt;br /&gt;
Bailey TL, Williams N, Misleh C, Li WW. (2006) MEME: discovering and analyzing DNA and protein sequence motifs, Nucl Acids Res.34:W369-373 [http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1538909/?tool=pubmed PDF]&lt;br /&gt;
&lt;br /&gt;
Garnier J, Gibrat JF, Robson B. (1996) GOR method for predicting protein secondary structure from amino acid sequence.Methods in Enzymology,266: 540-53. PMID: 8743705. [http://www.gersteinlab.org/courses/452/10-spring/pdf/GOR.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 5===&lt;br /&gt;
Laura J. van &#039;t Veer et al. Gene expression profiling predicts clinical outcome of breast cancer Nature 415, 530-536 (31 January 2002) | doi:10.1038/415530a; Received 24 August 2001; Accepted 22 November 2001 [http://www.nature.com/nature/journal/v415/n6871/full/415530a.html TEXT]&lt;br /&gt;
&lt;br /&gt;
Kwang-Il Goh, Michael E. Cusick, David Vall, Barton Child, Marc Vidal, and Albert-La ́szlo ́ Barabasi (2007) The human disease network Proc Natl Acad Sci U S A. 2007 May 22;104(21):8685-90. Epub 2007 May 14. [http://www.pnas.org/content/104/21/8685.full.pdf+html PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 6===&lt;br /&gt;
Antezana E, Egaña M, Blondé W, Illarramendi A, Bilbao I, De Baets B, Stevens R, Mironov V, Kuiper M. (2009) The Cell Cycle Ontology: an application ontology for the representation and integrated analysis of the cell cycle process. Genome Biol. 2009;10(5):R58. Epub 2009 May 29. [http://genomebiology.com/content/10/5/R58 PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 7===&lt;br /&gt;
Perelson AS. Modelling viral and immune system dynamics. Nat Rev Immunol. 2002 Jan;2(1):28-36. [http://www.gersteinlab.org/courses/452/10-spring/pdf/perelson.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 8===&lt;br /&gt;
ML Connolly. (1983) Solvent-accessible surfaces of proteins and nucleic acids. Science, 221(4612): 709-13. PMID: 6879170.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Connolly.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Martin Karplus and J. Andrew McCammon. (2002) Molecular dynamics simulations of biomolecules. Nature Structural Biology,9, 646-52. PMID: 12198485.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Karplus.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 9===&lt;br /&gt;
Dill KA, Ozkan SB, Shell MS, Weikl TR. (2008) The Protein Folding Problem.Annu Rev Biophys,9, 37:289-316. PMID: 2443096.[http://www.gersteinlab.org/courses/452/10-spring/pdf/proteinFolding.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Bowman GR, Beauchamp KA, Boxer G, Pande VS. “Progress and challenges in the automated construction of Markov state models for full protein systems,” J. Chem. Phys. 131 (2009) 124101 [http://www.gersteinlab.org/courses/452/10-spring/pdf/bowman.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
==Class Requirements==&lt;br /&gt;
&lt;br /&gt;
===Discussion Section / Readings===&lt;br /&gt;
Papers will be assigned throughout the course. These papers will be presented and discussed in weekly sections with the TAs. A brief summary (a half-page per article) should be submitted at the beginning of the discussion session.&lt;br /&gt;
&lt;br /&gt;
===Bioinformatics quizzes===&lt;br /&gt;
There will be four short quizzes (25 minutes) in class comprising SIMPLE questions that you should be able to answer from the lectures plus the main readings.&lt;br /&gt;
&lt;br /&gt;
===Programming Assignments (CBB and CS)===&lt;br /&gt;
There will be several short programming assignments required for CBB and CS students taking this course. Acceptable languages and submission requirements will be discussed prior to the first assignment. These assignments are NOT required for students not taking the CBB or CS sections of the course.&lt;br /&gt;
&lt;br /&gt;
==Non-CBB Final Project==&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
==CBB Final Project==&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
===Grade Categories===&lt;br /&gt;
&lt;br /&gt;
====CBB and CPSC Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 8.50%&lt;br /&gt;
|-&lt;br /&gt;
| Programming Assignments&lt;br /&gt;
| 25.00%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
====MBB and MCDB Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 16.75%&lt;br /&gt;
|-&lt;br /&gt;
| Problem Sets&lt;br /&gt;
| 16.75%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Prerequisites===&lt;br /&gt;
The course is keyed towards CBB graduate students as well as advanced MB&amp;amp;B undergraduates and graduate students wishing to learn about types of large-scale quantitative analyses that whole-genome sequencing will make possible. It would also be suitable for students from other fields such as computer science or physics wanting to learn about an important new biological application for computation.&lt;br /&gt;
&lt;br /&gt;
Students should have:&lt;br /&gt;
&lt;br /&gt;
A basic knowledge of biochemistry and molecular biology.&lt;br /&gt;
A knowledge of basic quantitative concepts, such as single variable calculus, some probability and statistics, and basic programming skills.&lt;br /&gt;
These can be fulfilled by the following prerequisites statement: &amp;quot;Prerequisites: MBB 200 and Mathematics 115 or permission of the instructor.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
===Relevant Yale College Regulations===&lt;br /&gt;
&lt;br /&gt;
Students may have questions concerning end-of-term matters.  Links to further information about these regulations can be found below:&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/reading-period-and-final-examination-period&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/completion-course-work&lt;br /&gt;
&lt;br /&gt;
==Misc==&lt;br /&gt;
[http://info.gersteinlab.org/Permissions Permissions] on using website material&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/mark/out/log/2011/12.29/talk/intro-graphic.ppt Graphic] for course homepage&lt;br /&gt;
&lt;br /&gt;
If you&#039;re really motivated, take a look at http://gersteinlab.org/jobs for further Research Opportunities&lt;br /&gt;
&lt;br /&gt;
==Pages from previous years==&lt;br /&gt;
[http://info.gersteinlab.org/Cbb752b11 2011],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/ 2010],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/previous.html 2009 and earlier]&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1814</id>
		<title>Cbb752b12</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1814"/>
		<updated>2011-12-30T00:22:47Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: /* Instructor-in-Charge */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=CBB 752=&lt;br /&gt;
&lt;br /&gt;
==Course Information==&lt;br /&gt;
&lt;br /&gt;
===Course Description===&lt;br /&gt;
&lt;br /&gt;
Bioinformatics encompasses the analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. It represents a major practical application for modern techniques in data mining and simulation. Specific topics to be covered include sequence alignment, large-scale processing, next-generation sequencing data, comparative genomics, phylogenetics, biological database design, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, normalization of microarray data, mining of functional genomics data sets, and machine learning approaches for data integration.&lt;br /&gt;
&lt;br /&gt;
===Concise undergraduate course description===&lt;br /&gt;
&lt;br /&gt;
Techniques in data mining and simulation applied to bioinformatics, the computational analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. Sequence alignment, comparative genomics and phylogenetics, biological databases, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, microarray normalization, and machine-learning approaches to data integration.&lt;br /&gt;
&lt;br /&gt;
See entry from undergraduate catalog:&lt;br /&gt;
http://students.yale.edu/oci/resultDetail.jsp?course=21914&amp;amp;term=201201 , viz:&lt;br /&gt;
&lt;br /&gt;
 MB&amp;amp;B 452 01 (21914) /MCDB452/MB&amp;amp;B752/MB&amp;amp;B753/MB&amp;amp;B754/CB&amp;amp;B752/MCDB752/CPSC752&lt;br /&gt;
 Bioinformatics: Practical Application of Simulation and Data Mining &lt;br /&gt;
 Mark Gerstein&lt;br /&gt;
 MW 1.00-2.15 BASS 305&lt;br /&gt;
 Spring 2012 &lt;br /&gt;
 No regular final examination&lt;br /&gt;
 Areas Sc&lt;br /&gt;
 Prerequisites: MB&amp;amp;B 301b and MATH 115a or b, or permission of instructor.&lt;br /&gt;
 MCDB 120a or 200b is a prerequisite for courses numbered MCDB 202 and above.&lt;br /&gt;
&lt;br /&gt;
===Different headings for this class===&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
MBB452 &lt;br /&gt;
&lt;br /&gt;
MB&amp;amp;B 452 01 (21914) &lt;br /&gt;
MCDB45&lt;br /&gt;
&lt;br /&gt;
everything + written proj + written homeworks &lt;br /&gt;
&lt;br /&gt;
MB&amp;amp;B752&lt;br /&gt;
MCDB752&lt;br /&gt;
&lt;br /&gt;
everything + grad level written proj + homeworks &lt;br /&gt;
&lt;br /&gt;
CB&amp;amp;B752&lt;br /&gt;
CPSC752&lt;br /&gt;
&lt;br /&gt;
everything + programming proj&lt;br /&gt;
+ prog assignment&lt;br /&gt;
&lt;br /&gt;
For graduate students the course is broken up into two &amp;quot;modules&amp;quot; (each counting 0.5 credit towards MB&amp;amp;B course requirement.):&lt;br /&gt;
&lt;br /&gt;
MB&amp;amp;B 753b3, Bioinformatics: Practical Application of Data Mining (1st half of term)&lt;br /&gt;
&lt;br /&gt;
MB&amp;amp;B 754b4, Bioinformatics: Practical Application of Simulation (2nd half of term)&lt;br /&gt;
&lt;br /&gt;
same as mbb752&lt;br /&gt;
incl the final proj. at half length&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Differences Between Class Sections===&lt;br /&gt;
&lt;br /&gt;
In general, the graduate level CS/CBB course is significantly different than MBB/MCDB (graduate and undergraduate) in several ways. Although the lectures are the same for each section, the graduate level CPSC/CBB course has additional programming assignments in addition to the work being completed by the MBB students. Homework for the MBB section centers on the completion of several problem sets without a programming component. The CPSC/CBB section forgoes these problem sets and instead requires that students implement several of the algorithms discussed in class. Also, the final project for CPSC/CBB MUST be a programming assignment rather than the final paper required for the MBB section. Due to the distinct course requirements, category weightings for final grades are also different.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Timing &amp;amp; location==&lt;br /&gt;
&lt;br /&gt;
Class: Meeting from 1:00-2:15 pm on Monday and Wednesday, in Bass 305. (First meeting will be on 9 Jan.)&lt;br /&gt;
&lt;br /&gt;
Discussion section: TBA&lt;br /&gt;
&lt;br /&gt;
==Instructors==&lt;br /&gt;
&lt;br /&gt;
===Instructor-in-Charge===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Mark Gerstein&lt;br /&gt;
| Bass 432A&lt;br /&gt;
| mark.gerstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Instructors===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Corey O&#039;Hern&lt;br /&gt;
| Mason Laboratory&lt;br /&gt;
| corey.ohern(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jesse Rinehart&lt;br /&gt;
| 300 George St&lt;br /&gt;
| jesse.rinehart(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| James Noonan&lt;br /&gt;
| 333 Cedar St&lt;br /&gt;
| james.noonan(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Kei Cheung&lt;br /&gt;
| 300 George St&lt;br /&gt;
| kei.cheung(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Steven Kleinstein&lt;br /&gt;
| 300 George St&lt;br /&gt;
| steven.kleinstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Teaching Fellows===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Office Phone&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Lucas Lochovsky&lt;br /&gt;
| Bass 437&lt;br /&gt;
| (203) 432-5405&lt;br /&gt;
| lucas.lochovsky(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jane Leng&lt;br /&gt;
| Bass 437&lt;br /&gt;
| (203) 432-5405&lt;br /&gt;
| jing.leng(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Topics/Class Schedule==&lt;br /&gt;
&lt;br /&gt;
[https://docs.google.com/spreadsheet/ccc?key=0Ar2M8lIkl9T5dDN4d1Q0aDBGcThXdjV2c21ESWVxSGc Class Schedule] (including a list of topics and quiz dates)&lt;br /&gt;
&lt;br /&gt;
==Discussion Sections==&lt;br /&gt;
&lt;br /&gt;
===Session 1===&lt;br /&gt;
Metzker ML. &amp;quot;Sequencing technologies - the next generation” Nature Reviews Genetics. 11 (2010) [http://www.gersteinlab.org/courses/452/10-spring/pdf/ngs.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Wheeler DA et al. &amp;quot;The complete genome of an individual by massively parallel DNA sequencing,” Nature. 452:872-876 (208) [http://www.gersteinlab.org/courses/452/10-spring/pdf/WatsonGenome.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 2===&lt;br /&gt;
Olsen JV, Blagoev B, Gnad F, Macek B, Kumar C, Mortensen P, Mann M. (2006) Global, in vivo, and site-specific phosphorylation dynamics in signaling networks.Cell. 2006 Nov 3;127(3):635-48. [http://www.pil.sdu.dk/1/MSQuant/Cell_GlobalPhosphorylationSignalingDynamics.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Nevan J. Krogan et al (2006) Global landscape of protein complexes in the yeast Saccharomyces cerevisiae Nature 440, 637-643 (30 March 2006) [http://www.nature.com/nature/journal/v440/n7084/pdf/nature04670.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 3===&lt;br /&gt;
T.F. Smith and M.S. Waterman. (1981) Identification of common molecular subsequences. Journal of Molecular Biology,147(1): 195-7. PMID: 7265238. [http://www.gersteinlab.org/courses/452/10-spring/pdf/sw.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. (1990) Basic local alignment search tool. Journal of Molecular Biology, 215(3):403-10. PMID: 2231712. [http://www.gersteinlab.org/courses/452/10-spring/pdf/Altschul.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 4===&lt;br /&gt;
Bailey TL, Williams N, Misleh C, Li WW. (2006) MEME: discovering and analyzing DNA and protein sequence motifs, Nucl Acids Res.34:W369-373 [http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1538909/?tool=pubmed PDF]&lt;br /&gt;
&lt;br /&gt;
Garnier J, Gibrat JF, Robson B. (1996) GOR method for predicting protein secondary structure from amino acid sequence.Methods in Enzymology,266: 540-53. PMID: 8743705. [http://www.gersteinlab.org/courses/452/10-spring/pdf/GOR.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 5===&lt;br /&gt;
Laura J. van &#039;t Veer et al. Gene expression profiling predicts clinical outcome of breast cancer Nature 415, 530-536 (31 January 2002) | doi:10.1038/415530a; Received 24 August 2001; Accepted 22 November 2001 [http://www.nature.com/nature/journal/v415/n6871/full/415530a.html TEXT]&lt;br /&gt;
&lt;br /&gt;
Kwang-Il Goh, Michael E. Cusick, David Vall, Barton Child, Marc Vidal, and Albert-La ́szlo ́ Barabasi (2007) The human disease network Proc Natl Acad Sci U S A. 2007 May 22;104(21):8685-90. Epub 2007 May 14. [http://www.pnas.org/content/104/21/8685.full.pdf+html PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 6===&lt;br /&gt;
Antezana E, Egaña M, Blondé W, Illarramendi A, Bilbao I, De Baets B, Stevens R, Mironov V, Kuiper M. (2009) The Cell Cycle Ontology: an application ontology for the representation and integrated analysis of the cell cycle process. Genome Biol. 2009;10(5):R58. Epub 2009 May 29. [http://genomebiology.com/content/10/5/R58 PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 7===&lt;br /&gt;
Perelson AS. Modelling viral and immune system dynamics. Nat Rev Immunol. 2002 Jan;2(1):28-36. [http://www.gersteinlab.org/courses/452/10-spring/pdf/perelson.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 8===&lt;br /&gt;
ML Connolly. (1983) Solvent-accessible surfaces of proteins and nucleic acids. Science, 221(4612): 709-13. PMID: 6879170.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Connolly.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Martin Karplus and J. Andrew McCammon. (2002) Molecular dynamics simulations of biomolecules. Nature Structural Biology,9, 646-52. PMID: 12198485.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Karplus.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 9===&lt;br /&gt;
Dill KA, Ozkan SB, Shell MS, Weikl TR. (2008) The Protein Folding Problem.Annu Rev Biophys,9, 37:289-316. PMID: 2443096.[http://www.gersteinlab.org/courses/452/10-spring/pdf/proteinFolding.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Bowman GR, Beauchamp KA, Boxer G, Pande VS. “Progress and challenges in the automated construction of Markov state models for full protein systems,” J. Chem. Phys. 131 (2009) 124101 [http://www.gersteinlab.org/courses/452/10-spring/pdf/bowman.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
==Class Requirements==&lt;br /&gt;
&lt;br /&gt;
===Discussion Section / Readings===&lt;br /&gt;
Papers will be assigned throughout the course. These papers will be presented and discussed in weekly sections with the TAs. A brief summary (a half-page per article) should be submitted at the beginning of the discussion session.&lt;br /&gt;
&lt;br /&gt;
===Bioinformatics quizzes===&lt;br /&gt;
There will be four short quizzes (25 minutes) in class comprising SIMPLE questions that you should be able to answer from the lectures plus the main readings.&lt;br /&gt;
&lt;br /&gt;
===Programming Assignments (CBB and CS)===&lt;br /&gt;
There will be several short programming assignments required for CBB and CS students taking this course. Acceptable languages and submission requirements will be discussed prior to the first assignment. These assignments are NOT required for students not taking the CBB or CS sections of the course.&lt;br /&gt;
&lt;br /&gt;
==Non-CBB Final Project==&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
==CBB Final Project==&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
===Grade Categories===&lt;br /&gt;
&lt;br /&gt;
====CBB and CPSC Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 8.50%&lt;br /&gt;
|-&lt;br /&gt;
| Programming Assignments&lt;br /&gt;
| 25.00%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
====MBB and MCDB Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 16.75%&lt;br /&gt;
|-&lt;br /&gt;
| Problem Sets&lt;br /&gt;
| 16.75%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Prerequisites===&lt;br /&gt;
The course is keyed towards CBB graduate students as well as advanced MB&amp;amp;B undergraduates and graduate students wishing to learn about types of large-scale quantitative analyses that whole-genome sequencing will make possible. It would also be suitable for students from other fields such as computer science or physics wanting to learn about an important new biological application for computation.&lt;br /&gt;
&lt;br /&gt;
Students should have:&lt;br /&gt;
&lt;br /&gt;
A basic knowledge of biochemistry and molecular biology.&lt;br /&gt;
A knowledge of basic quantitative concepts, such as single variable calculus, some probability and statistics, and basic programming skills.&lt;br /&gt;
These can be fulfilled by the following prerequisites statement: &amp;quot;Prerequisites: MBB 200 and Mathematics 115 or permission of the instructor.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
===Relevant Yale College Regulations===&lt;br /&gt;
&lt;br /&gt;
Students may have questions concerning end-of-term matters.  Links to further information about these regulations can be found below:&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/reading-period-and-final-examination-period&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/completion-course-work&lt;br /&gt;
&lt;br /&gt;
==Misc==&lt;br /&gt;
[http://info.gersteinlab.org/Permissions Permissions] on using website material&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/mark/out/log/2011/12.29/talk/intro-graphic.ppt Graphic] for course homepage&lt;br /&gt;
&lt;br /&gt;
If you&#039;re really motivated, take a look at http://gersteinlab.org/jobs for further Research Opportunities&lt;br /&gt;
&lt;br /&gt;
==Pages from previous years==&lt;br /&gt;
[http://info.gersteinlab.org/Cbb752b11 2011],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/ 2010],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/previous.html 2009 and earlier]&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1813</id>
		<title>Cbb752b12</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1813"/>
		<updated>2011-12-30T00:22:01Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: /* Instructors */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=CBB 752=&lt;br /&gt;
&lt;br /&gt;
==Course Information==&lt;br /&gt;
&lt;br /&gt;
===Course Description===&lt;br /&gt;
&lt;br /&gt;
Bioinformatics encompasses the analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. It represents a major practical application for modern techniques in data mining and simulation. Specific topics to be covered include sequence alignment, large-scale processing, next-generation sequencing data, comparative genomics, phylogenetics, biological database design, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, normalization of microarray data, mining of functional genomics data sets, and machine learning approaches for data integration.&lt;br /&gt;
&lt;br /&gt;
===Concise undergraduate course description===&lt;br /&gt;
&lt;br /&gt;
Techniques in data mining and simulation applied to bioinformatics, the computational analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. Sequence alignment, comparative genomics and phylogenetics, biological databases, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, microarray normalization, and machine-learning approaches to data integration.&lt;br /&gt;
&lt;br /&gt;
See entry from undergraduate catalog:&lt;br /&gt;
http://students.yale.edu/oci/resultDetail.jsp?course=21914&amp;amp;term=201201 , viz:&lt;br /&gt;
&lt;br /&gt;
 MB&amp;amp;B 452 01 (21914) /MCDB452/MB&amp;amp;B752/MB&amp;amp;B753/MB&amp;amp;B754/CB&amp;amp;B752/MCDB752/CPSC752&lt;br /&gt;
 Bioinformatics: Practical Application of Simulation and Data Mining &lt;br /&gt;
 Mark Gerstein&lt;br /&gt;
 MW 1.00-2.15 BASS 305&lt;br /&gt;
 Spring 2012 &lt;br /&gt;
 No regular final examination&lt;br /&gt;
 Areas Sc&lt;br /&gt;
 Prerequisites: MB&amp;amp;B 301b and MATH 115a or b, or permission of instructor.&lt;br /&gt;
 MCDB 120a or 200b is a prerequisite for courses numbered MCDB 202 and above.&lt;br /&gt;
&lt;br /&gt;
===Different headings for this class===&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
MBB452 &lt;br /&gt;
&lt;br /&gt;
MB&amp;amp;B 452 01 (21914) &lt;br /&gt;
MCDB45&lt;br /&gt;
&lt;br /&gt;
everything + written proj + written homeworks &lt;br /&gt;
&lt;br /&gt;
MB&amp;amp;B752&lt;br /&gt;
MCDB752&lt;br /&gt;
&lt;br /&gt;
everything + grad level written proj + homeworks &lt;br /&gt;
&lt;br /&gt;
CB&amp;amp;B752&lt;br /&gt;
CPSC752&lt;br /&gt;
&lt;br /&gt;
everything + programming proj&lt;br /&gt;
+ prog assignment&lt;br /&gt;
&lt;br /&gt;
For graduate students the course is broken up into two &amp;quot;modules&amp;quot; (each counting 0.5 credit towards MB&amp;amp;B course requirement.):&lt;br /&gt;
&lt;br /&gt;
MB&amp;amp;B 753b3, Bioinformatics: Practical Application of Data Mining (1st half of term)&lt;br /&gt;
&lt;br /&gt;
MB&amp;amp;B 754b4, Bioinformatics: Practical Application of Simulation (2nd half of term)&lt;br /&gt;
&lt;br /&gt;
same as mbb752&lt;br /&gt;
incl the final proj. at half length&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Differences Between Class Sections===&lt;br /&gt;
&lt;br /&gt;
In general, the graduate level CS/CBB course is significantly different than MBB/MCDB (graduate and undergraduate) in several ways. Although the lectures are the same for each section, the graduate level CPSC/CBB course has additional programming assignments in addition to the work being completed by the MBB students. Homework for the MBB section centers on the completion of several problem sets without a programming component. The CPSC/CBB section forgoes these problem sets and instead requires that students implement several of the algorithms discussed in class. Also, the final project for CPSC/CBB MUST be a programming assignment rather than the final paper required for the MBB section. Due to the distinct course requirements, category weightings for final grades are also different.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Timing &amp;amp; location==&lt;br /&gt;
&lt;br /&gt;
Class: Meeting from 1:00-2:15 pm on Monday and Wednesday, in Bass 305. (First meeting will be on 9 Jan.)&lt;br /&gt;
&lt;br /&gt;
Discussion section: TBA&lt;br /&gt;
&lt;br /&gt;
==Instructors==&lt;br /&gt;
&lt;br /&gt;
===Instructor-in-Charge===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Office Phone&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Mark Gerstein&lt;br /&gt;
| Bass 432A&lt;br /&gt;
| (203) 432-6105&lt;br /&gt;
| mark.gerstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Instructors===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Corey O&#039;Hern&lt;br /&gt;
| Mason Laboratory&lt;br /&gt;
| corey.ohern(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jesse Rinehart&lt;br /&gt;
| 300 George St&lt;br /&gt;
| jesse.rinehart(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| James Noonan&lt;br /&gt;
| 333 Cedar St&lt;br /&gt;
| james.noonan(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Kei Cheung&lt;br /&gt;
| 300 George St&lt;br /&gt;
| kei.cheung(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Steven Kleinstein&lt;br /&gt;
| 300 George St&lt;br /&gt;
| steven.kleinstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Teaching Fellows===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Office Phone&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Lucas Lochovsky&lt;br /&gt;
| Bass 437&lt;br /&gt;
| (203) 432-5405&lt;br /&gt;
| lucas.lochovsky(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jane Leng&lt;br /&gt;
| Bass 437&lt;br /&gt;
| (203) 432-5405&lt;br /&gt;
| jing.leng(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Topics/Class Schedule==&lt;br /&gt;
&lt;br /&gt;
[https://docs.google.com/spreadsheet/ccc?key=0Ar2M8lIkl9T5dDN4d1Q0aDBGcThXdjV2c21ESWVxSGc Class Schedule] (including a list of topics and quiz dates)&lt;br /&gt;
&lt;br /&gt;
==Discussion Sections==&lt;br /&gt;
&lt;br /&gt;
===Session 1===&lt;br /&gt;
Metzker ML. &amp;quot;Sequencing technologies - the next generation” Nature Reviews Genetics. 11 (2010) [http://www.gersteinlab.org/courses/452/10-spring/pdf/ngs.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Wheeler DA et al. &amp;quot;The complete genome of an individual by massively parallel DNA sequencing,” Nature. 452:872-876 (208) [http://www.gersteinlab.org/courses/452/10-spring/pdf/WatsonGenome.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 2===&lt;br /&gt;
Olsen JV, Blagoev B, Gnad F, Macek B, Kumar C, Mortensen P, Mann M. (2006) Global, in vivo, and site-specific phosphorylation dynamics in signaling networks.Cell. 2006 Nov 3;127(3):635-48. [http://www.pil.sdu.dk/1/MSQuant/Cell_GlobalPhosphorylationSignalingDynamics.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Nevan J. Krogan et al (2006) Global landscape of protein complexes in the yeast Saccharomyces cerevisiae Nature 440, 637-643 (30 March 2006) [http://www.nature.com/nature/journal/v440/n7084/pdf/nature04670.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 3===&lt;br /&gt;
T.F. Smith and M.S. Waterman. (1981) Identification of common molecular subsequences. Journal of Molecular Biology,147(1): 195-7. PMID: 7265238. [http://www.gersteinlab.org/courses/452/10-spring/pdf/sw.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. (1990) Basic local alignment search tool. Journal of Molecular Biology, 215(3):403-10. PMID: 2231712. [http://www.gersteinlab.org/courses/452/10-spring/pdf/Altschul.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 4===&lt;br /&gt;
Bailey TL, Williams N, Misleh C, Li WW. (2006) MEME: discovering and analyzing DNA and protein sequence motifs, Nucl Acids Res.34:W369-373 [http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1538909/?tool=pubmed PDF]&lt;br /&gt;
&lt;br /&gt;
Garnier J, Gibrat JF, Robson B. (1996) GOR method for predicting protein secondary structure from amino acid sequence.Methods in Enzymology,266: 540-53. PMID: 8743705. [http://www.gersteinlab.org/courses/452/10-spring/pdf/GOR.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 5===&lt;br /&gt;
Laura J. van &#039;t Veer et al. Gene expression profiling predicts clinical outcome of breast cancer Nature 415, 530-536 (31 January 2002) | doi:10.1038/415530a; Received 24 August 2001; Accepted 22 November 2001 [http://www.nature.com/nature/journal/v415/n6871/full/415530a.html TEXT]&lt;br /&gt;
&lt;br /&gt;
Kwang-Il Goh, Michael E. Cusick, David Vall, Barton Child, Marc Vidal, and Albert-La ́szlo ́ Barabasi (2007) The human disease network Proc Natl Acad Sci U S A. 2007 May 22;104(21):8685-90. Epub 2007 May 14. [http://www.pnas.org/content/104/21/8685.full.pdf+html PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 6===&lt;br /&gt;
Antezana E, Egaña M, Blondé W, Illarramendi A, Bilbao I, De Baets B, Stevens R, Mironov V, Kuiper M. (2009) The Cell Cycle Ontology: an application ontology for the representation and integrated analysis of the cell cycle process. Genome Biol. 2009;10(5):R58. Epub 2009 May 29. [http://genomebiology.com/content/10/5/R58 PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 7===&lt;br /&gt;
Perelson AS. Modelling viral and immune system dynamics. Nat Rev Immunol. 2002 Jan;2(1):28-36. [http://www.gersteinlab.org/courses/452/10-spring/pdf/perelson.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 8===&lt;br /&gt;
ML Connolly. (1983) Solvent-accessible surfaces of proteins and nucleic acids. Science, 221(4612): 709-13. PMID: 6879170.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Connolly.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Martin Karplus and J. Andrew McCammon. (2002) Molecular dynamics simulations of biomolecules. Nature Structural Biology,9, 646-52. PMID: 12198485.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Karplus.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 9===&lt;br /&gt;
Dill KA, Ozkan SB, Shell MS, Weikl TR. (2008) The Protein Folding Problem.Annu Rev Biophys,9, 37:289-316. PMID: 2443096.[http://www.gersteinlab.org/courses/452/10-spring/pdf/proteinFolding.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Bowman GR, Beauchamp KA, Boxer G, Pande VS. “Progress and challenges in the automated construction of Markov state models for full protein systems,” J. Chem. Phys. 131 (2009) 124101 [http://www.gersteinlab.org/courses/452/10-spring/pdf/bowman.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
==Class Requirements==&lt;br /&gt;
&lt;br /&gt;
===Discussion Section / Readings===&lt;br /&gt;
Papers will be assigned throughout the course. These papers will be presented and discussed in weekly sections with the TAs. A brief summary (a half-page per article) should be submitted at the beginning of the discussion session.&lt;br /&gt;
&lt;br /&gt;
===Bioinformatics quizzes===&lt;br /&gt;
There will be four short quizzes (25 minutes) in class comprising SIMPLE questions that you should be able to answer from the lectures plus the main readings.&lt;br /&gt;
&lt;br /&gt;
===Programming Assignments (CBB and CS)===&lt;br /&gt;
There will be several short programming assignments required for CBB and CS students taking this course. Acceptable languages and submission requirements will be discussed prior to the first assignment. These assignments are NOT required for students not taking the CBB or CS sections of the course.&lt;br /&gt;
&lt;br /&gt;
==Non-CBB Final Project==&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
==CBB Final Project==&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
===Grade Categories===&lt;br /&gt;
&lt;br /&gt;
====CBB and CPSC Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 8.50%&lt;br /&gt;
|-&lt;br /&gt;
| Programming Assignments&lt;br /&gt;
| 25.00%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
====MBB and MCDB Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 16.75%&lt;br /&gt;
|-&lt;br /&gt;
| Problem Sets&lt;br /&gt;
| 16.75%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Prerequisites===&lt;br /&gt;
The course is keyed towards CBB graduate students as well as advanced MB&amp;amp;B undergraduates and graduate students wishing to learn about types of large-scale quantitative analyses that whole-genome sequencing will make possible. It would also be suitable for students from other fields such as computer science or physics wanting to learn about an important new biological application for computation.&lt;br /&gt;
&lt;br /&gt;
Students should have:&lt;br /&gt;
&lt;br /&gt;
A basic knowledge of biochemistry and molecular biology.&lt;br /&gt;
A knowledge of basic quantitative concepts, such as single variable calculus, some probability and statistics, and basic programming skills.&lt;br /&gt;
These can be fulfilled by the following prerequisites statement: &amp;quot;Prerequisites: MBB 200 and Mathematics 115 or permission of the instructor.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
===Relevant Yale College Regulations===&lt;br /&gt;
&lt;br /&gt;
Students may have questions concerning end-of-term matters.  Links to further information about these regulations can be found below:&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/reading-period-and-final-examination-period&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/completion-course-work&lt;br /&gt;
&lt;br /&gt;
==Misc==&lt;br /&gt;
[http://info.gersteinlab.org/Permissions Permissions] on using website material&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/mark/out/log/2011/12.29/talk/intro-graphic.ppt Graphic] for course homepage&lt;br /&gt;
&lt;br /&gt;
If you&#039;re really motivated, take a look at http://gersteinlab.org/jobs for further Research Opportunities&lt;br /&gt;
&lt;br /&gt;
==Pages from previous years==&lt;br /&gt;
[http://info.gersteinlab.org/Cbb752b11 2011],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/ 2010],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/previous.html 2009 and earlier]&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1812</id>
		<title>Cbb752b12</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1812"/>
		<updated>2011-12-30T00:19:59Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=CBB 752=&lt;br /&gt;
&lt;br /&gt;
==Course Information==&lt;br /&gt;
&lt;br /&gt;
===Course Description===&lt;br /&gt;
&lt;br /&gt;
Bioinformatics encompasses the analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. It represents a major practical application for modern techniques in data mining and simulation. Specific topics to be covered include sequence alignment, large-scale processing, next-generation sequencing data, comparative genomics, phylogenetics, biological database design, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, normalization of microarray data, mining of functional genomics data sets, and machine learning approaches for data integration.&lt;br /&gt;
&lt;br /&gt;
===Concise undergraduate course description===&lt;br /&gt;
&lt;br /&gt;
Techniques in data mining and simulation applied to bioinformatics, the computational analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. Sequence alignment, comparative genomics and phylogenetics, biological databases, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, microarray normalization, and machine-learning approaches to data integration.&lt;br /&gt;
&lt;br /&gt;
See entry from undergraduate catalog:&lt;br /&gt;
http://students.yale.edu/oci/resultDetail.jsp?course=21914&amp;amp;term=201201 , viz:&lt;br /&gt;
&lt;br /&gt;
 MB&amp;amp;B 452 01 (21914) /MCDB452/MB&amp;amp;B752/MB&amp;amp;B753/MB&amp;amp;B754/CB&amp;amp;B752/MCDB752/CPSC752&lt;br /&gt;
 Bioinformatics: Practical Application of Simulation and Data Mining &lt;br /&gt;
 Mark Gerstein&lt;br /&gt;
 MW 1.00-2.15 BASS 305&lt;br /&gt;
 Spring 2012 &lt;br /&gt;
 No regular final examination&lt;br /&gt;
 Areas Sc&lt;br /&gt;
 Prerequisites: MB&amp;amp;B 301b and MATH 115a or b, or permission of instructor.&lt;br /&gt;
 MCDB 120a or 200b is a prerequisite for courses numbered MCDB 202 and above.&lt;br /&gt;
&lt;br /&gt;
===Different headings for this class===&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
MBB452 &lt;br /&gt;
&lt;br /&gt;
MB&amp;amp;B 452 01 (21914) &lt;br /&gt;
MCDB45&lt;br /&gt;
&lt;br /&gt;
everything + written proj + written homeworks &lt;br /&gt;
&lt;br /&gt;
MB&amp;amp;B752&lt;br /&gt;
MCDB752&lt;br /&gt;
&lt;br /&gt;
everything + grad level written proj + homeworks &lt;br /&gt;
&lt;br /&gt;
CB&amp;amp;B752&lt;br /&gt;
CPSC752&lt;br /&gt;
&lt;br /&gt;
everything + programming proj&lt;br /&gt;
+ prog assignment&lt;br /&gt;
&lt;br /&gt;
For graduate students the course is broken up into two &amp;quot;modules&amp;quot; (each counting 0.5 credit towards MB&amp;amp;B course requirement.):&lt;br /&gt;
&lt;br /&gt;
MB&amp;amp;B 753b3, Bioinformatics: Practical Application of Data Mining (1st half of term)&lt;br /&gt;
&lt;br /&gt;
MB&amp;amp;B 754b4, Bioinformatics: Practical Application of Simulation (2nd half of term)&lt;br /&gt;
&lt;br /&gt;
same as mbb752&lt;br /&gt;
incl the final proj. at half length&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Differences Between Class Sections===&lt;br /&gt;
&lt;br /&gt;
In general, the graduate level CS/CBB course is significantly different than MBB/MCDB (graduate and undergraduate) in several ways. Although the lectures are the same for each section, the graduate level CPSC/CBB course has additional programming assignments in addition to the work being completed by the MBB students. Homework for the MBB section centers on the completion of several problem sets without a programming component. The CPSC/CBB section forgoes these problem sets and instead requires that students implement several of the algorithms discussed in class. Also, the final project for CPSC/CBB MUST be a programming assignment rather than the final paper required for the MBB section. Due to the distinct course requirements, category weightings for final grades are also different.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Timing &amp;amp; location==&lt;br /&gt;
&lt;br /&gt;
Class: Meeting from 1:00-2:15 pm on Monday and Wednesday, in Bass 305. (First meeting will be on 9 Jan.)&lt;br /&gt;
&lt;br /&gt;
Discussion section: TBA&lt;br /&gt;
&lt;br /&gt;
==Instructors==&lt;br /&gt;
&lt;br /&gt;
===Instructor-in-Charge===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Office Phone&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Mark Gerstein&lt;br /&gt;
| Bass 432A&lt;br /&gt;
| (203) 432-6105&lt;br /&gt;
| mark.gerstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Instructors===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
! Office Hours&lt;br /&gt;
|-&lt;br /&gt;
| Corey O&#039;Hern&lt;br /&gt;
| Mason Laboratory&lt;br /&gt;
| corey.ohern(at)yale.edu&lt;br /&gt;
| TBA&lt;br /&gt;
|-&lt;br /&gt;
| Jesse Rinehart&lt;br /&gt;
| 300 George St&lt;br /&gt;
| jesse.rinehart(at)yale.edu&lt;br /&gt;
| TBA&lt;br /&gt;
|-&lt;br /&gt;
| James Noonan&lt;br /&gt;
| 333 Cedar St&lt;br /&gt;
| james.noonan(at)yale.edu&lt;br /&gt;
| TBA&lt;br /&gt;
|-&lt;br /&gt;
| Kei Cheung&lt;br /&gt;
| 300 George St&lt;br /&gt;
| kei.cheung(at)yale.edu&lt;br /&gt;
| TBA&lt;br /&gt;
|-&lt;br /&gt;
| Steven Kleinstein&lt;br /&gt;
| 300 George St&lt;br /&gt;
| steven.kleinstein(at)yale.edu&lt;br /&gt;
| TBA&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Teaching Fellows===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Office Phone&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Lucas Lochovsky&lt;br /&gt;
| Bass 437&lt;br /&gt;
| (203) 432-5405&lt;br /&gt;
| lucas.lochovsky(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jane Leng&lt;br /&gt;
| Bass 437&lt;br /&gt;
| (203) 432-5405&lt;br /&gt;
| jing.leng(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Topics/Class Schedule==&lt;br /&gt;
&lt;br /&gt;
[https://docs.google.com/spreadsheet/ccc?key=0Ar2M8lIkl9T5dDN4d1Q0aDBGcThXdjV2c21ESWVxSGc Class Schedule] (including a list of topics and quiz dates)&lt;br /&gt;
&lt;br /&gt;
==Discussion Sections==&lt;br /&gt;
&lt;br /&gt;
===Session 1===&lt;br /&gt;
Metzker ML. &amp;quot;Sequencing technologies - the next generation” Nature Reviews Genetics. 11 (2010) [http://www.gersteinlab.org/courses/452/10-spring/pdf/ngs.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Wheeler DA et al. &amp;quot;The complete genome of an individual by massively parallel DNA sequencing,” Nature. 452:872-876 (208) [http://www.gersteinlab.org/courses/452/10-spring/pdf/WatsonGenome.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 2===&lt;br /&gt;
Olsen JV, Blagoev B, Gnad F, Macek B, Kumar C, Mortensen P, Mann M. (2006) Global, in vivo, and site-specific phosphorylation dynamics in signaling networks.Cell. 2006 Nov 3;127(3):635-48. [http://www.pil.sdu.dk/1/MSQuant/Cell_GlobalPhosphorylationSignalingDynamics.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Nevan J. Krogan et al (2006) Global landscape of protein complexes in the yeast Saccharomyces cerevisiae Nature 440, 637-643 (30 March 2006) [http://www.nature.com/nature/journal/v440/n7084/pdf/nature04670.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 3===&lt;br /&gt;
T.F. Smith and M.S. Waterman. (1981) Identification of common molecular subsequences. Journal of Molecular Biology,147(1): 195-7. PMID: 7265238. [http://www.gersteinlab.org/courses/452/10-spring/pdf/sw.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. (1990) Basic local alignment search tool. Journal of Molecular Biology, 215(3):403-10. PMID: 2231712. [http://www.gersteinlab.org/courses/452/10-spring/pdf/Altschul.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 4===&lt;br /&gt;
Bailey TL, Williams N, Misleh C, Li WW. (2006) MEME: discovering and analyzing DNA and protein sequence motifs, Nucl Acids Res.34:W369-373 [http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1538909/?tool=pubmed PDF]&lt;br /&gt;
&lt;br /&gt;
Garnier J, Gibrat JF, Robson B. (1996) GOR method for predicting protein secondary structure from amino acid sequence.Methods in Enzymology,266: 540-53. PMID: 8743705. [http://www.gersteinlab.org/courses/452/10-spring/pdf/GOR.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 5===&lt;br /&gt;
Laura J. van &#039;t Veer et al. Gene expression profiling predicts clinical outcome of breast cancer Nature 415, 530-536 (31 January 2002) | doi:10.1038/415530a; Received 24 August 2001; Accepted 22 November 2001 [http://www.nature.com/nature/journal/v415/n6871/full/415530a.html TEXT]&lt;br /&gt;
&lt;br /&gt;
Kwang-Il Goh, Michael E. Cusick, David Vall, Barton Child, Marc Vidal, and Albert-La ́szlo ́ Barabasi (2007) The human disease network Proc Natl Acad Sci U S A. 2007 May 22;104(21):8685-90. Epub 2007 May 14. [http://www.pnas.org/content/104/21/8685.full.pdf+html PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 6===&lt;br /&gt;
Antezana E, Egaña M, Blondé W, Illarramendi A, Bilbao I, De Baets B, Stevens R, Mironov V, Kuiper M. (2009) The Cell Cycle Ontology: an application ontology for the representation and integrated analysis of the cell cycle process. Genome Biol. 2009;10(5):R58. Epub 2009 May 29. [http://genomebiology.com/content/10/5/R58 PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 7===&lt;br /&gt;
Perelson AS. Modelling viral and immune system dynamics. Nat Rev Immunol. 2002 Jan;2(1):28-36. [http://www.gersteinlab.org/courses/452/10-spring/pdf/perelson.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 8===&lt;br /&gt;
ML Connolly. (1983) Solvent-accessible surfaces of proteins and nucleic acids. Science, 221(4612): 709-13. PMID: 6879170.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Connolly.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Martin Karplus and J. Andrew McCammon. (2002) Molecular dynamics simulations of biomolecules. Nature Structural Biology,9, 646-52. PMID: 12198485.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Karplus.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 9===&lt;br /&gt;
Dill KA, Ozkan SB, Shell MS, Weikl TR. (2008) The Protein Folding Problem.Annu Rev Biophys,9, 37:289-316. PMID: 2443096.[http://www.gersteinlab.org/courses/452/10-spring/pdf/proteinFolding.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Bowman GR, Beauchamp KA, Boxer G, Pande VS. “Progress and challenges in the automated construction of Markov state models for full protein systems,” J. Chem. Phys. 131 (2009) 124101 [http://www.gersteinlab.org/courses/452/10-spring/pdf/bowman.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
==Class Requirements==&lt;br /&gt;
&lt;br /&gt;
===Discussion Section / Readings===&lt;br /&gt;
Papers will be assigned throughout the course. These papers will be presented and discussed in weekly sections with the TAs. A brief summary (a half-page per article) should be submitted at the beginning of the discussion session.&lt;br /&gt;
&lt;br /&gt;
===Bioinformatics quizzes===&lt;br /&gt;
There will be four short quizzes (25 minutes) in class comprising SIMPLE questions that you should be able to answer from the lectures plus the main readings.&lt;br /&gt;
&lt;br /&gt;
===Programming Assignments (CBB and CS)===&lt;br /&gt;
There will be several short programming assignments required for CBB and CS students taking this course. Acceptable languages and submission requirements will be discussed prior to the first assignment. These assignments are NOT required for students not taking the CBB or CS sections of the course.&lt;br /&gt;
&lt;br /&gt;
==Non-CBB Final Project==&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
==CBB Final Project==&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
===Grade Categories===&lt;br /&gt;
&lt;br /&gt;
====CBB and CPSC Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 8.50%&lt;br /&gt;
|-&lt;br /&gt;
| Programming Assignments&lt;br /&gt;
| 25.00%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
====MBB and MCDB Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 16.75%&lt;br /&gt;
|-&lt;br /&gt;
| Problem Sets&lt;br /&gt;
| 16.75%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Prerequisites===&lt;br /&gt;
The course is keyed towards CBB graduate students as well as advanced MB&amp;amp;B undergraduates and graduate students wishing to learn about types of large-scale quantitative analyses that whole-genome sequencing will make possible. It would also be suitable for students from other fields such as computer science or physics wanting to learn about an important new biological application for computation.&lt;br /&gt;
&lt;br /&gt;
Students should have:&lt;br /&gt;
&lt;br /&gt;
A basic knowledge of biochemistry and molecular biology.&lt;br /&gt;
A knowledge of basic quantitative concepts, such as single variable calculus, some probability and statistics, and basic programming skills.&lt;br /&gt;
These can be fulfilled by the following prerequisites statement: &amp;quot;Prerequisites: MBB 200 and Mathematics 115 or permission of the instructor.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
===Relevant Yale College Regulations===&lt;br /&gt;
&lt;br /&gt;
Students may have questions concerning end-of-term matters.  Links to further information about these regulations can be found below:&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/reading-period-and-final-examination-period&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/completion-course-work&lt;br /&gt;
&lt;br /&gt;
==Misc==&lt;br /&gt;
[http://info.gersteinlab.org/Permissions Permissions] on using website material&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/mark/out/log/2011/12.29/talk/intro-graphic.ppt Graphic] for course homepage&lt;br /&gt;
&lt;br /&gt;
If you&#039;re really motivated, take a look at http://gersteinlab.org/jobs for further Research Opportunities&lt;br /&gt;
&lt;br /&gt;
==Pages from previous years==&lt;br /&gt;
[http://info.gersteinlab.org/Cbb752b11 2011],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/ 2010],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/previous.html 2009 and earlier]&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1811</id>
		<title>Cbb752b12</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1811"/>
		<updated>2011-12-30T00:12:12Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: /* Literature discussion section */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=CBB 752=&lt;br /&gt;
&lt;br /&gt;
==Course Information==&lt;br /&gt;
&lt;br /&gt;
===Course Description===&lt;br /&gt;
&lt;br /&gt;
Bioinformatics encompasses the analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. It represents a major practical application for modern techniques in data mining and simulation. Specific topics to be covered include sequence alignment, large-scale processing, next-generation sequencing data, comparative genomics, phylogenetics, biological database design, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, normalization of microarray data, mining of functional genomics data sets, and machine learning approaches for data integration.&lt;br /&gt;
&lt;br /&gt;
===Concise undergraduate course description===&lt;br /&gt;
&lt;br /&gt;
Techniques in data mining and simulation applied to bioinformatics, the computational analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. Sequence alignment, comparative genomics and phylogenetics, biological databases, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, microarray normalization, and machine-learning approaches to data integration.&lt;br /&gt;
&lt;br /&gt;
See entry from undergraduate catalog:&lt;br /&gt;
http://students.yale.edu/oci/resultDetail.jsp?course=21914&amp;amp;term=201201 , viz:&lt;br /&gt;
&lt;br /&gt;
 MB&amp;amp;B 452 01 (21914) /MCDB452/MB&amp;amp;B752/MB&amp;amp;B753/MB&amp;amp;B754/CB&amp;amp;B752/MCDB752/CPSC752&lt;br /&gt;
 Bioinformatics: Practical Application of Simulation and Data Mining &lt;br /&gt;
 Mark Gerstein&lt;br /&gt;
 MW 1.00-2.15 BASS 305&lt;br /&gt;
 Spring 2012 &lt;br /&gt;
 No regular final examination&lt;br /&gt;
 Areas Sc&lt;br /&gt;
 Prerequisites: MB&amp;amp;B 301b and MATH 115a or b, or permission of instructor.&lt;br /&gt;
 MCDB 120a or 200b is a prerequisite for courses numbered MCDB 202 and above.&lt;br /&gt;
&lt;br /&gt;
===Quizzes and Final Project===&lt;br /&gt;
&lt;br /&gt;
There will be approximately four short quizzes during the semester and a take-home final project. For CBB and CS sections, the final project will be a programming assignment. For MB&amp;amp;B, the final project will be a paper. Further details will be announced at a later date.&lt;br /&gt;
&lt;br /&gt;
===Literature discussion section===&lt;br /&gt;
&lt;br /&gt;
One session of 60 minutes per week, time to be arranged. Student presentations of recent research papers relevant to the topics of the course. Led by Lucas Lochovsky and Jane Leng.&lt;br /&gt;
&lt;br /&gt;
===Different headings for this class===&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
MBB452 &lt;br /&gt;
&lt;br /&gt;
MB&amp;amp;B 452 01 (21914) &lt;br /&gt;
MCDB45&lt;br /&gt;
&lt;br /&gt;
everything + written proj + written homeworks &lt;br /&gt;
&lt;br /&gt;
MB&amp;amp;B752&lt;br /&gt;
MCDB752&lt;br /&gt;
&lt;br /&gt;
everything + grad level written proj + homeworks &lt;br /&gt;
&lt;br /&gt;
CB&amp;amp;B752&lt;br /&gt;
CPSC752&lt;br /&gt;
&lt;br /&gt;
everything + programming proj&lt;br /&gt;
+ prog assignment&lt;br /&gt;
&lt;br /&gt;
For graduate students the course is broken up into two &amp;quot;modules&amp;quot; (each counting 0.5 credit towards MB&amp;amp;B course requirement.):&lt;br /&gt;
&lt;br /&gt;
MB&amp;amp;B 753b3, Bioinformatics: Practical Application of Data Mining (1st half of term)&lt;br /&gt;
&lt;br /&gt;
MB&amp;amp;B 754b4, Bioinformatics: Practical Application of Simulation (2nd half of term)&lt;br /&gt;
&lt;br /&gt;
same as mbb752&lt;br /&gt;
incl the final proj. at half length&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Programming Projects/Problem Sets===&lt;br /&gt;
&lt;br /&gt;
Students taking this course listed under Computational Biology and Bioinformatics or Computer Science will be required to complete several short programming assignments. Further details will be discussed in the literature discussion section and during class.&lt;br /&gt;
&lt;br /&gt;
===Grade Categories===&lt;br /&gt;
&lt;br /&gt;
====CBB and CPSC Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 8.50%&lt;br /&gt;
|-&lt;br /&gt;
| Programming Assignments&lt;br /&gt;
| 25.00%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
====MBB and MCDB Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 16.75%&lt;br /&gt;
|-&lt;br /&gt;
| Problem Sets&lt;br /&gt;
| 16.75%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Differences Between Class Sections===&lt;br /&gt;
&lt;br /&gt;
In general, the graduate level CS/CBB course is significantly different than MBB/MCDB (graduate and undergraduate) in several ways. Although the lectures are the same for each section, the graduate level CPSC/CBB course has additional programming assignments in addition to the work being completed by the MBB students. Homework for the MBB section centers on the completion of several problem sets without a programming component. The CPSC/CBB section forgoes these problem sets and instead requires that students implement several of the algorithms discussed in class. Also, the final project for CPSC/CBB MUST be a programming assignment rather than the final paper required for the MBB section. Due to the distinct course requirements, category weightings for final grades are also different.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Timing &amp;amp; location==&lt;br /&gt;
&lt;br /&gt;
Class: Meeting from 1:00-2:15 pm on Monday and Wednesday, in Bass 305. (First meeting will be on 9 Jan.)&lt;br /&gt;
&lt;br /&gt;
Discussion section: TBA&lt;br /&gt;
&lt;br /&gt;
==Instructors==&lt;br /&gt;
&lt;br /&gt;
===Instructor-in-Charge===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Office Phone&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Mark Gerstein&lt;br /&gt;
| Bass 432A&lt;br /&gt;
| (203) 432-6105&lt;br /&gt;
| mark.gerstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Instructors===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
! Office Hours&lt;br /&gt;
|-&lt;br /&gt;
| Corey O&#039;Hern&lt;br /&gt;
| Mason Laboratory&lt;br /&gt;
| corey.ohern(at)yale.edu&lt;br /&gt;
| TBA&lt;br /&gt;
|-&lt;br /&gt;
| Jesse Rinehart&lt;br /&gt;
| 300 George St&lt;br /&gt;
| jesse.rinehart(at)yale.edu&lt;br /&gt;
| TBA&lt;br /&gt;
|-&lt;br /&gt;
| James Noonan&lt;br /&gt;
| 333 Cedar St&lt;br /&gt;
| james.noonan(at)yale.edu&lt;br /&gt;
| TBA&lt;br /&gt;
|-&lt;br /&gt;
| Kei Cheung&lt;br /&gt;
| 300 George St&lt;br /&gt;
| kei.cheung(at)yale.edu&lt;br /&gt;
| TBA&lt;br /&gt;
|-&lt;br /&gt;
| Steven Kleinstein&lt;br /&gt;
| 300 George St&lt;br /&gt;
| steven.kleinstein(at)yale.edu&lt;br /&gt;
| TBA&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Teaching Fellows===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Office Phone&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Lucas Lochovsky&lt;br /&gt;
| Bass 437&lt;br /&gt;
| (203) 432-5405&lt;br /&gt;
| lucas.lochovsky(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jane Leng&lt;br /&gt;
| Bass 437&lt;br /&gt;
| (203) 432-5405&lt;br /&gt;
| jing.leng(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Topics/Class Schedule==&lt;br /&gt;
&lt;br /&gt;
[https://docs.google.com/spreadsheet/ccc?key=0Ar2M8lIkl9T5dDN4d1Q0aDBGcThXdjV2c21ESWVxSGc Class Schedule] (including a list of topics and quiz dates)&lt;br /&gt;
&lt;br /&gt;
==Discussion Sections==&lt;br /&gt;
&lt;br /&gt;
===Session 1===&lt;br /&gt;
Metzker ML. &amp;quot;Sequencing technologies - the next generation” Nature Reviews Genetics. 11 (2010) [http://www.gersteinlab.org/courses/452/10-spring/pdf/ngs.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Wheeler DA et al. &amp;quot;The complete genome of an individual by massively parallel DNA sequencing,” Nature. 452:872-876 (208) [http://www.gersteinlab.org/courses/452/10-spring/pdf/WatsonGenome.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 2===&lt;br /&gt;
Olsen JV, Blagoev B, Gnad F, Macek B, Kumar C, Mortensen P, Mann M. (2006) Global, in vivo, and site-specific phosphorylation dynamics in signaling networks.Cell. 2006 Nov 3;127(3):635-48. [http://www.pil.sdu.dk/1/MSQuant/Cell_GlobalPhosphorylationSignalingDynamics.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Nevan J. Krogan et al (2006) Global landscape of protein complexes in the yeast Saccharomyces cerevisiae Nature 440, 637-643 (30 March 2006) [http://www.nature.com/nature/journal/v440/n7084/pdf/nature04670.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 3===&lt;br /&gt;
T.F. Smith and M.S. Waterman. (1981) Identification of common molecular subsequences. Journal of Molecular Biology,147(1): 195-7. PMID: 7265238. [http://www.gersteinlab.org/courses/452/10-spring/pdf/sw.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. (1990) Basic local alignment search tool. Journal of Molecular Biology, 215(3):403-10. PMID: 2231712. [http://www.gersteinlab.org/courses/452/10-spring/pdf/Altschul.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 4===&lt;br /&gt;
Bailey TL, Williams N, Misleh C, Li WW. (2006) MEME: discovering and analyzing DNA and protein sequence motifs, Nucl Acids Res.34:W369-373 [http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1538909/?tool=pubmed PDF]&lt;br /&gt;
&lt;br /&gt;
Garnier J, Gibrat JF, Robson B. (1996) GOR method for predicting protein secondary structure from amino acid sequence.Methods in Enzymology,266: 540-53. PMID: 8743705. [http://www.gersteinlab.org/courses/452/10-spring/pdf/GOR.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 5===&lt;br /&gt;
Laura J. van &#039;t Veer et al. Gene expression profiling predicts clinical outcome of breast cancer Nature 415, 530-536 (31 January 2002) | doi:10.1038/415530a; Received 24 August 2001; Accepted 22 November 2001 [http://www.nature.com/nature/journal/v415/n6871/full/415530a.html TEXT]&lt;br /&gt;
&lt;br /&gt;
Kwang-Il Goh, Michael E. Cusick, David Vall, Barton Child, Marc Vidal, and Albert-La ́szlo ́ Barabasi (2007) The human disease network Proc Natl Acad Sci U S A. 2007 May 22;104(21):8685-90. Epub 2007 May 14. [http://www.pnas.org/content/104/21/8685.full.pdf+html PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 6===&lt;br /&gt;
Antezana E, Egaña M, Blondé W, Illarramendi A, Bilbao I, De Baets B, Stevens R, Mironov V, Kuiper M. (2009) The Cell Cycle Ontology: an application ontology for the representation and integrated analysis of the cell cycle process. Genome Biol. 2009;10(5):R58. Epub 2009 May 29. [http://genomebiology.com/content/10/5/R58 PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 7===&lt;br /&gt;
Perelson AS. Modelling viral and immune system dynamics. Nat Rev Immunol. 2002 Jan;2(1):28-36. [http://www.gersteinlab.org/courses/452/10-spring/pdf/perelson.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 8===&lt;br /&gt;
ML Connolly. (1983) Solvent-accessible surfaces of proteins and nucleic acids. Science, 221(4612): 709-13. PMID: 6879170.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Connolly.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Martin Karplus and J. Andrew McCammon. (2002) Molecular dynamics simulations of biomolecules. Nature Structural Biology,9, 646-52. PMID: 12198485.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Karplus.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 9===&lt;br /&gt;
Dill KA, Ozkan SB, Shell MS, Weikl TR. (2008) The Protein Folding Problem.Annu Rev Biophys,9, 37:289-316. PMID: 2443096.[http://www.gersteinlab.org/courses/452/10-spring/pdf/proteinFolding.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Bowman GR, Beauchamp KA, Boxer G, Pande VS. “Progress and challenges in the automated construction of Markov state models for full protein systems,” J. Chem. Phys. 131 (2009) 124101 [http://www.gersteinlab.org/courses/452/10-spring/pdf/bowman.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
==Non-CBB Final Project==&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
==CBB Final Project==&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
==Class Requirements==&lt;br /&gt;
&lt;br /&gt;
===Discussion Section / Readings===&lt;br /&gt;
Papers will be assigned throughout the course. These papers will be presented and discussed in weekly sections with the TAs. A brief summary (a half-page per article) should be submitted at the beginning of the discussion session.&lt;br /&gt;
&lt;br /&gt;
===Bioinformatics quizzes===&lt;br /&gt;
There will be four short quizzes (25 minutes) in class comprising SIMPLE questions that you should be able to answer from the lectures plus the main readings.&lt;br /&gt;
&lt;br /&gt;
===Programming Assignments (CBB and CS)===&lt;br /&gt;
There will be several short programming assignments required for CBB and CS students taking this course. Acceptable languages and submission requirements will be discussed prior to the first assignment. These assignments are NOT required for students not taking the CBB or CS sections of the course.&lt;br /&gt;
&lt;br /&gt;
===Prerequisites===&lt;br /&gt;
The course is keyed towards CBB graduate students as well as advanced MB&amp;amp;B undergraduates and graduate students wishing to learn about types of large-scale quantitative analyses that whole-genome sequencing will make possible. It would also be suitable for students from other fields such as computer science or physics wanting to learn about an important new biological application for computation.&lt;br /&gt;
&lt;br /&gt;
Students should have:&lt;br /&gt;
&lt;br /&gt;
A basic knowledge of biochemistry and molecular biology.&lt;br /&gt;
A knowledge of basic quantitative concepts, such as single variable calculus, some probability and statistics, and basic programming skills.&lt;br /&gt;
These can be fulfilled by the following prerequisites statement: &amp;quot;Prerequisites: MBB 200 and Mathematics 115 or permission of the instructor.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
===Relevant Yale College Regulations===&lt;br /&gt;
&lt;br /&gt;
Students may have questions concerning end-of-term matters.  Links to further information about these regulations can be found below:&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/reading-period-and-final-examination-period&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/completion-course-work&lt;br /&gt;
&lt;br /&gt;
==Misc==&lt;br /&gt;
[http://info.gersteinlab.org/Permissions Permissions] on using website material&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/mark/out/log/2011/12.29/talk/intro-graphic.ppt Graphic] for course homepage&lt;br /&gt;
&lt;br /&gt;
If you&#039;re really motivated, take a look at http://gersteinlab.org/jobs for further Research Opportunities&lt;br /&gt;
&lt;br /&gt;
==Pages from previous years==&lt;br /&gt;
[http://info.gersteinlab.org/Cbb752b11 2011],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/ 2010],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/previous.html 2009 and earlier]&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1810</id>
		<title>Cbb752b12</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1810"/>
		<updated>2011-12-30T00:09:35Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=CBB 752=&lt;br /&gt;
&lt;br /&gt;
==Course Information==&lt;br /&gt;
&lt;br /&gt;
===Course Description===&lt;br /&gt;
&lt;br /&gt;
Bioinformatics encompasses the analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. It represents a major practical application for modern techniques in data mining and simulation. Specific topics to be covered include sequence alignment, large-scale processing, next-generation sequencing data, comparative genomics, phylogenetics, biological database design, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, normalization of microarray data, mining of functional genomics data sets, and machine learning approaches for data integration.&lt;br /&gt;
&lt;br /&gt;
===Concise undergraduate course description===&lt;br /&gt;
&lt;br /&gt;
Techniques in data mining and simulation applied to bioinformatics, the computational analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. Sequence alignment, comparative genomics and phylogenetics, biological databases, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, microarray normalization, and machine-learning approaches to data integration.&lt;br /&gt;
&lt;br /&gt;
See entry from undergraduate catalog:&lt;br /&gt;
http://students.yale.edu/oci/resultDetail.jsp?course=21914&amp;amp;term=201201 , viz:&lt;br /&gt;
&lt;br /&gt;
 MB&amp;amp;B 452 01 (21914) /MCDB452/MB&amp;amp;B752/MB&amp;amp;B753/MB&amp;amp;B754/CB&amp;amp;B752/MCDB752/CPSC752&lt;br /&gt;
 Bioinformatics: Practical Application of Simulation and Data Mining &lt;br /&gt;
 Mark Gerstein&lt;br /&gt;
 MW 1.00-2.15 BASS 305&lt;br /&gt;
 Spring 2012 &lt;br /&gt;
 No regular final examination&lt;br /&gt;
 Areas Sc&lt;br /&gt;
 Prerequisites: MB&amp;amp;B 301b and MATH 115a or b, or permission of instructor.&lt;br /&gt;
 MCDB 120a or 200b is a prerequisite for courses numbered MCDB 202 and above.&lt;br /&gt;
&lt;br /&gt;
===Quizzes and Final Project===&lt;br /&gt;
&lt;br /&gt;
There will be approximately four short quizzes during the semester and a take-home final project. For CBB and CS sections, the final project will be a programming assignment. For MB&amp;amp;B, the final project will be a paper. Further details will be announced at a later date.&lt;br /&gt;
&lt;br /&gt;
===Literature discussion section===&lt;br /&gt;
&lt;br /&gt;
One session of 60 minutes per week, time to be arranged. Student presentations of recent research papers relevant to the topics of the course. Led by Lucas Lochovsky (Bass, Rm 437; (203) 432-5405; lucas.lochovsky(at)yale.edu) and Jane Leng (Bass, Rm 437; (203) 432-5405; jing.leng(at)yale.edu).&lt;br /&gt;
&lt;br /&gt;
===Different headings for this class===&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
MBB452 &lt;br /&gt;
&lt;br /&gt;
MB&amp;amp;B 452 01 (21914) &lt;br /&gt;
MCDB45&lt;br /&gt;
&lt;br /&gt;
everything + written proj + written homeworks &lt;br /&gt;
&lt;br /&gt;
MB&amp;amp;B752&lt;br /&gt;
MCDB752&lt;br /&gt;
&lt;br /&gt;
everything + grad level written proj + homeworks &lt;br /&gt;
&lt;br /&gt;
CB&amp;amp;B752&lt;br /&gt;
CPSC752&lt;br /&gt;
&lt;br /&gt;
everything + programming proj&lt;br /&gt;
+ prog assignment&lt;br /&gt;
&lt;br /&gt;
For graduate students the course is broken up into two &amp;quot;modules&amp;quot; (each counting 0.5 credit towards MB&amp;amp;B course requirement.):&lt;br /&gt;
&lt;br /&gt;
MB&amp;amp;B 753b3, Bioinformatics: Practical Application of Data Mining (1st half of term)&lt;br /&gt;
&lt;br /&gt;
MB&amp;amp;B 754b4, Bioinformatics: Practical Application of Simulation (2nd half of term)&lt;br /&gt;
&lt;br /&gt;
same as mbb752&lt;br /&gt;
incl the final proj. at half length&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Programming Projects/Problem Sets===&lt;br /&gt;
&lt;br /&gt;
Students taking this course listed under Computational Biology and Bioinformatics or Computer Science will be required to complete several short programming assignments. Further details will be discussed in the literature discussion section and during class.&lt;br /&gt;
&lt;br /&gt;
===Grade Categories===&lt;br /&gt;
&lt;br /&gt;
====CBB and CPSC Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 8.50%&lt;br /&gt;
|-&lt;br /&gt;
| Programming Assignments&lt;br /&gt;
| 25.00%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
====MBB and MCDB Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 16.75%&lt;br /&gt;
|-&lt;br /&gt;
| Problem Sets&lt;br /&gt;
| 16.75%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Differences Between Class Sections===&lt;br /&gt;
&lt;br /&gt;
In general, the graduate level CS/CBB course is significantly different than MBB/MCDB (graduate and undergraduate) in several ways. Although the lectures are the same for each section, the graduate level CPSC/CBB course has additional programming assignments in addition to the work being completed by the MBB students. Homework for the MBB section centers on the completion of several problem sets without a programming component. The CPSC/CBB section forgoes these problem sets and instead requires that students implement several of the algorithms discussed in class. Also, the final project for CPSC/CBB MUST be a programming assignment rather than the final paper required for the MBB section. Due to the distinct course requirements, category weightings for final grades are also different.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Timing &amp;amp; location==&lt;br /&gt;
&lt;br /&gt;
Class: Meeting from 1:00-2:15 pm on Monday and Wednesday, in Bass 305. (First meeting will be on 9 Jan.)&lt;br /&gt;
&lt;br /&gt;
Discussion section: TBA&lt;br /&gt;
&lt;br /&gt;
==Instructors==&lt;br /&gt;
&lt;br /&gt;
===Instructor-in-Charge===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Office Phone&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Mark Gerstein&lt;br /&gt;
| Bass 432A&lt;br /&gt;
| (203) 432-6105&lt;br /&gt;
| mark.gerstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Instructors===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
! Office Hours&lt;br /&gt;
|-&lt;br /&gt;
| Corey O&#039;Hern&lt;br /&gt;
| Mason Laboratory&lt;br /&gt;
| corey.ohern(at)yale.edu&lt;br /&gt;
| TBA&lt;br /&gt;
|-&lt;br /&gt;
| Jesse Rinehart&lt;br /&gt;
| 300 George St&lt;br /&gt;
| jesse.rinehart(at)yale.edu&lt;br /&gt;
| TBA&lt;br /&gt;
|-&lt;br /&gt;
| James Noonan&lt;br /&gt;
| 333 Cedar St&lt;br /&gt;
| james.noonan(at)yale.edu&lt;br /&gt;
| TBA&lt;br /&gt;
|-&lt;br /&gt;
| Kei Cheung&lt;br /&gt;
| 300 George St&lt;br /&gt;
| kei.cheung(at)yale.edu&lt;br /&gt;
| TBA&lt;br /&gt;
|-&lt;br /&gt;
| Steven Kleinstein&lt;br /&gt;
| 300 George St&lt;br /&gt;
| steven.kleinstein(at)yale.edu&lt;br /&gt;
| TBA&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Teaching Fellows===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Office Phone&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Lucas Lochovsky&lt;br /&gt;
| Bass 437&lt;br /&gt;
| (203) 432-5405&lt;br /&gt;
| lucas.lochovsky(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jane Leng&lt;br /&gt;
| Bass 437&lt;br /&gt;
| (203) 432-5405&lt;br /&gt;
| jing.leng(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Topics/Class Schedule==&lt;br /&gt;
&lt;br /&gt;
[https://docs.google.com/spreadsheet/ccc?key=0Ar2M8lIkl9T5dDN4d1Q0aDBGcThXdjV2c21ESWVxSGc Class Schedule] (including a list of topics and quiz dates)&lt;br /&gt;
&lt;br /&gt;
==Discussion Sections==&lt;br /&gt;
&lt;br /&gt;
===Session 1===&lt;br /&gt;
Metzker ML. &amp;quot;Sequencing technologies - the next generation” Nature Reviews Genetics. 11 (2010) [http://www.gersteinlab.org/courses/452/10-spring/pdf/ngs.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Wheeler DA et al. &amp;quot;The complete genome of an individual by massively parallel DNA sequencing,” Nature. 452:872-876 (208) [http://www.gersteinlab.org/courses/452/10-spring/pdf/WatsonGenome.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 2===&lt;br /&gt;
Olsen JV, Blagoev B, Gnad F, Macek B, Kumar C, Mortensen P, Mann M. (2006) Global, in vivo, and site-specific phosphorylation dynamics in signaling networks.Cell. 2006 Nov 3;127(3):635-48. [http://www.pil.sdu.dk/1/MSQuant/Cell_GlobalPhosphorylationSignalingDynamics.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Nevan J. Krogan et al (2006) Global landscape of protein complexes in the yeast Saccharomyces cerevisiae Nature 440, 637-643 (30 March 2006) [http://www.nature.com/nature/journal/v440/n7084/pdf/nature04670.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 3===&lt;br /&gt;
T.F. Smith and M.S. Waterman. (1981) Identification of common molecular subsequences. Journal of Molecular Biology,147(1): 195-7. PMID: 7265238. [http://www.gersteinlab.org/courses/452/10-spring/pdf/sw.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. (1990) Basic local alignment search tool. Journal of Molecular Biology, 215(3):403-10. PMID: 2231712. [http://www.gersteinlab.org/courses/452/10-spring/pdf/Altschul.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 4===&lt;br /&gt;
Bailey TL, Williams N, Misleh C, Li WW. (2006) MEME: discovering and analyzing DNA and protein sequence motifs, Nucl Acids Res.34:W369-373 [http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1538909/?tool=pubmed PDF]&lt;br /&gt;
&lt;br /&gt;
Garnier J, Gibrat JF, Robson B. (1996) GOR method for predicting protein secondary structure from amino acid sequence.Methods in Enzymology,266: 540-53. PMID: 8743705. [http://www.gersteinlab.org/courses/452/10-spring/pdf/GOR.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 5===&lt;br /&gt;
Laura J. van &#039;t Veer et al. Gene expression profiling predicts clinical outcome of breast cancer Nature 415, 530-536 (31 January 2002) | doi:10.1038/415530a; Received 24 August 2001; Accepted 22 November 2001 [http://www.nature.com/nature/journal/v415/n6871/full/415530a.html TEXT]&lt;br /&gt;
&lt;br /&gt;
Kwang-Il Goh, Michael E. Cusick, David Vall, Barton Child, Marc Vidal, and Albert-La ́szlo ́ Barabasi (2007) The human disease network Proc Natl Acad Sci U S A. 2007 May 22;104(21):8685-90. Epub 2007 May 14. [http://www.pnas.org/content/104/21/8685.full.pdf+html PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 6===&lt;br /&gt;
Antezana E, Egaña M, Blondé W, Illarramendi A, Bilbao I, De Baets B, Stevens R, Mironov V, Kuiper M. (2009) The Cell Cycle Ontology: an application ontology for the representation and integrated analysis of the cell cycle process. Genome Biol. 2009;10(5):R58. Epub 2009 May 29. [http://genomebiology.com/content/10/5/R58 PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 7===&lt;br /&gt;
Perelson AS. Modelling viral and immune system dynamics. Nat Rev Immunol. 2002 Jan;2(1):28-36. [http://www.gersteinlab.org/courses/452/10-spring/pdf/perelson.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 8===&lt;br /&gt;
ML Connolly. (1983) Solvent-accessible surfaces of proteins and nucleic acids. Science, 221(4612): 709-13. PMID: 6879170.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Connolly.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Martin Karplus and J. Andrew McCammon. (2002) Molecular dynamics simulations of biomolecules. Nature Structural Biology,9, 646-52. PMID: 12198485.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Karplus.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 9===&lt;br /&gt;
Dill KA, Ozkan SB, Shell MS, Weikl TR. (2008) The Protein Folding Problem.Annu Rev Biophys,9, 37:289-316. PMID: 2443096.[http://www.gersteinlab.org/courses/452/10-spring/pdf/proteinFolding.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Bowman GR, Beauchamp KA, Boxer G, Pande VS. “Progress and challenges in the automated construction of Markov state models for full protein systems,” J. Chem. Phys. 131 (2009) 124101 [http://www.gersteinlab.org/courses/452/10-spring/pdf/bowman.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
==Non-CBB Final Project==&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
==CBB Final Project==&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
==Class Requirements==&lt;br /&gt;
&lt;br /&gt;
===Discussion Section / Readings===&lt;br /&gt;
Papers will be assigned throughout the course. These papers will be presented and discussed in weekly sections with the TAs. A brief summary (a half-page per article) should be submitted at the beginning of the discussion session.&lt;br /&gt;
&lt;br /&gt;
===Bioinformatics quizzes===&lt;br /&gt;
There will be four short quizzes (25 minutes) in class comprising SIMPLE questions that you should be able to answer from the lectures plus the main readings.&lt;br /&gt;
&lt;br /&gt;
===Programming Assignments (CBB and CS)===&lt;br /&gt;
There will be several short programming assignments required for CBB and CS students taking this course. Acceptable languages and submission requirements will be discussed prior to the first assignment. These assignments are NOT required for students not taking the CBB or CS sections of the course.&lt;br /&gt;
&lt;br /&gt;
===Prerequisites===&lt;br /&gt;
The course is keyed towards CBB graduate students as well as advanced MB&amp;amp;B undergraduates and graduate students wishing to learn about types of large-scale quantitative analyses that whole-genome sequencing will make possible. It would also be suitable for students from other fields such as computer science or physics wanting to learn about an important new biological application for computation.&lt;br /&gt;
&lt;br /&gt;
Students should have:&lt;br /&gt;
&lt;br /&gt;
A basic knowledge of biochemistry and molecular biology.&lt;br /&gt;
A knowledge of basic quantitative concepts, such as single variable calculus, some probability and statistics, and basic programming skills.&lt;br /&gt;
These can be fulfilled by the following prerequisites statement: &amp;quot;Prerequisites: MBB 200 and Mathematics 115 or permission of the instructor.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
===Relevant Yale College Regulations===&lt;br /&gt;
&lt;br /&gt;
Students may have questions concerning end-of-term matters.  Links to further information about these regulations can be found below:&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/reading-period-and-final-examination-period&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/completion-course-work&lt;br /&gt;
&lt;br /&gt;
==Misc==&lt;br /&gt;
[http://info.gersteinlab.org/Permissions Permissions] on using website material&lt;br /&gt;
&lt;br /&gt;
[http://archive.gersteinlab.org/mark/out/log/2011/12.29/talk/intro-graphic.ppt Graphic] for course homepage&lt;br /&gt;
&lt;br /&gt;
If you&#039;re really motivated, take a look at http://gersteinlab.org/jobs for further Research Opportunities&lt;br /&gt;
&lt;br /&gt;
==Pages from previous years==&lt;br /&gt;
[http://info.gersteinlab.org/Cbb752b11 2011],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/ 2010],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/previous.html 2009 and earlier]&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1792</id>
		<title>Cbb752b12</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1792"/>
		<updated>2011-12-20T04:41:53Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: /* Instructors */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=CBB 752=&lt;br /&gt;
&lt;br /&gt;
==Course Information==&lt;br /&gt;
&lt;br /&gt;
===Course Description===&lt;br /&gt;
&lt;br /&gt;
Bioinformatics encompasses the analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. It represents a major practical application for modern techniques in data mining and simulation. Specific topics to be covered include sequence alignment, large-scale processing, next-generation sequencing data, comparative genomics, phylogenetics, biological database design, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, normalization of microarray data, mining of functional genomics data sets, and machine learning approaches for data integration.&lt;br /&gt;
&lt;br /&gt;
===Concise undergraduate course description===&lt;br /&gt;
&lt;br /&gt;
Techniques in data mining and simulation applied to bioinformatics, the computational analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. Sequence alignment, comparative genomics and phylogenetics, biological databases, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, microarray normalization, and machine-learning approaches to data integration.&lt;br /&gt;
&lt;br /&gt;
See entry from undergraduate catalog:&lt;br /&gt;
http://students.yale.edu/oci/resultDetail.jsp?course=21914&amp;amp;term=201201 , viz:&lt;br /&gt;
&lt;br /&gt;
 MB&amp;amp;B 452 01 (21914) /MCDB452/MB&amp;amp;B752/CB&amp;amp;B752/MCDB752/CPSC752&lt;br /&gt;
 Bioinformatics: Practical Application of Simulation and Data Mining &lt;br /&gt;
 Mark Gerstein&lt;br /&gt;
 MW 1.00-2.15 BASS 305&lt;br /&gt;
 Spring 2012 &lt;br /&gt;
 No regular final examination&lt;br /&gt;
 Areas Sc&lt;br /&gt;
 Prerequisites: MB&amp;amp;B 301b and MATH 115a or b, or permission of instructor.&lt;br /&gt;
 MCDB 120a or 200b is a prerequisite for courses numbered MCDB 202 and above.&lt;br /&gt;
&lt;br /&gt;
===Quizzes and Final Project===&lt;br /&gt;
&lt;br /&gt;
There will be approximately four short quizzes during the semester and a take-home final project. For CBB and CS sections, the final project will be a programming assignment. For MB&amp;amp;B, the final project will be a paper. Further details will be announced at a later date.&lt;br /&gt;
&lt;br /&gt;
===Literature discussion section===&lt;br /&gt;
&lt;br /&gt;
One session of 60 minutes per week, time to be arranged. Student presentations of recent research papers relevant to the topics of the course. Led by Lucas Lochovsky (Bass, Rm 437; (203) 432-5405; lucas.lochovsky(at)yale.edu) and Jane Leng (Bass, Rm 437; (203) 432-5405; jing.leng(at)yale.edu).&lt;br /&gt;
&lt;br /&gt;
===Programming Projects/Problem Sets===&lt;br /&gt;
&lt;br /&gt;
Students taking this course listed under Computational Biology and Bioinformatics or Computer Science will be required to complete several short programming assignments. Further details will be discussed in the literature discussion section and during class.&lt;br /&gt;
&lt;br /&gt;
===Grade Categories===&lt;br /&gt;
&lt;br /&gt;
====CBB and CPSC Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 8.50%&lt;br /&gt;
|-&lt;br /&gt;
| Programming Assignments&lt;br /&gt;
| 25.00%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
====MBB and MCDB Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 16.75%&lt;br /&gt;
|-&lt;br /&gt;
| Problem Sets&lt;br /&gt;
| 16.75%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Differences Between Class Sections===&lt;br /&gt;
&lt;br /&gt;
In general, the graduate level CS/CBB course is significantly different than MBB/MCDB (graduate and undergraduate) in several ways. Although the lectures are the same for each section, the graduate level CPSC/CBB course has additional programming assignments in addition to the work being completed by the MBB students. Homework for the MBB section centers on the completion of several problem sets without a programming component. The CPSC/CBB section forgoes these problem sets and instead requires that students implement several of the algorithms discussed in class. Also, the final project for CPSC/CBB MUST be a programming assignment rather than the final paper required for the MBB section. Due to the distinct course requirements, category weightings for final grades are also different.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Timing &amp;amp; location==&lt;br /&gt;
&lt;br /&gt;
Class: Meeting from 1:00-2:15 pm on Monday and Wednesday, in Bass 305. (First meeting will be on 9 Jan.)&lt;br /&gt;
&lt;br /&gt;
Discussion section: TBA&lt;br /&gt;
&lt;br /&gt;
==Instructors==&lt;br /&gt;
&lt;br /&gt;
===Instructor-in-Charge===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Office Phone&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Mark Gerstein&lt;br /&gt;
| Bass 432A&lt;br /&gt;
| (203) 432-6105&lt;br /&gt;
| mark.gerstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Instructors===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
! Office Hours&lt;br /&gt;
|-&lt;br /&gt;
| Corey O&#039;Hern&lt;br /&gt;
| Mason Laboratory&lt;br /&gt;
| corey.ohern(at)yale.edu&lt;br /&gt;
| TBA&lt;br /&gt;
|-&lt;br /&gt;
| Jesse Rinehart&lt;br /&gt;
| 300 George St&lt;br /&gt;
| jesse.rinehart(at)yale.edu&lt;br /&gt;
| TBA&lt;br /&gt;
|-&lt;br /&gt;
| James Noonan&lt;br /&gt;
| 333 Cedar St&lt;br /&gt;
| james.noonan(at)yale.edu&lt;br /&gt;
| TBA&lt;br /&gt;
|-&lt;br /&gt;
| Kei Cheung&lt;br /&gt;
| 300 George St&lt;br /&gt;
| kei.cheung(at)yale.edu&lt;br /&gt;
| TBA&lt;br /&gt;
|-&lt;br /&gt;
| Steven Kleinstein&lt;br /&gt;
| 300 George St&lt;br /&gt;
| steven.kleinstein(at)yale.edu&lt;br /&gt;
| TBA&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Teaching Fellows===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Office Phone&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Lucas Lochovsky&lt;br /&gt;
| Bass 437&lt;br /&gt;
| (203) 432-5405&lt;br /&gt;
| lucas.lochovsky(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jane Leng&lt;br /&gt;
| Bass 437&lt;br /&gt;
| (203) 432-5405&lt;br /&gt;
| jing.leng(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Topics/Class Schedule==&lt;br /&gt;
&lt;br /&gt;
[https://docs.google.com/spreadsheet/ccc?key=0Ar2M8lIkl9T5dDN4d1Q0aDBGcThXdjV2c21ESWVxSGc Class Schedule] (including a list of topics and quiz dates)&lt;br /&gt;
&lt;br /&gt;
==Discussion Sections==&lt;br /&gt;
&lt;br /&gt;
===Session 1===&lt;br /&gt;
Metzker ML. &amp;quot;Sequencing technologies - the next generation” Nature Reviews Genetics. 11 (2010) [http://www.gersteinlab.org/courses/452/10-spring/pdf/ngs.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Wheeler DA et al. &amp;quot;The complete genome of an individual by massively parallel DNA sequencing,” Nature. 452:872-876 (208) [http://www.gersteinlab.org/courses/452/10-spring/pdf/WatsonGenome.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 2===&lt;br /&gt;
Olsen JV, Blagoev B, Gnad F, Macek B, Kumar C, Mortensen P, Mann M. (2006) Global, in vivo, and site-specific phosphorylation dynamics in signaling networks.Cell. 2006 Nov 3;127(3):635-48. [http://www.pil.sdu.dk/1/MSQuant/Cell_GlobalPhosphorylationSignalingDynamics.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Nevan J. Krogan et al (2006) Global landscape of protein complexes in the yeast Saccharomyces cerevisiae Nature 440, 637-643 (30 March 2006) [http://www.nature.com/nature/journal/v440/n7084/pdf/nature04670.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 3===&lt;br /&gt;
T.F. Smith and M.S. Waterman. (1981) Identification of common molecular subsequences. Journal of Molecular Biology,147(1): 195-7. PMID: 7265238. [http://www.gersteinlab.org/courses/452/10-spring/pdf/sw.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. (1990) Basic local alignment search tool. Journal of Molecular Biology, 215(3):403-10. PMID: 2231712. [http://www.gersteinlab.org/courses/452/10-spring/pdf/Altschul.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 4===&lt;br /&gt;
Bailey TL, Williams N, Misleh C, Li WW. (2006) MEME: discovering and analyzing DNA and protein sequence motifs, Nucl Acids Res.34:W369-373 [http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1538909/?tool=pubmed PDF]&lt;br /&gt;
&lt;br /&gt;
Garnier J, Gibrat JF, Robson B. (1996) GOR method for predicting protein secondary structure from amino acid sequence.Methods in Enzymology,266: 540-53. PMID: 8743705. [http://www.gersteinlab.org/courses/452/10-spring/pdf/GOR.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 5===&lt;br /&gt;
Laura J. van &#039;t Veer et al. Gene expression profiling predicts clinical outcome of breast cancer Nature 415, 530-536 (31 January 2002) | doi:10.1038/415530a; Received 24 August 2001; Accepted 22 November 2001 [http://www.nature.com/nature/journal/v415/n6871/full/415530a.html TEXT]&lt;br /&gt;
&lt;br /&gt;
Kwang-Il Goh, Michael E. Cusick, David Vall, Barton Child, Marc Vidal, and Albert-La ́szlo ́ Barabasi (2007) The human disease network Proc Natl Acad Sci U S A. 2007 May 22;104(21):8685-90. Epub 2007 May 14. [http://www.pnas.org/content/104/21/8685.full.pdf+html PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 6===&lt;br /&gt;
Antezana E, Egaña M, Blondé W, Illarramendi A, Bilbao I, De Baets B, Stevens R, Mironov V, Kuiper M. (2009) The Cell Cycle Ontology: an application ontology for the representation and integrated analysis of the cell cycle process. Genome Biol. 2009;10(5):R58. Epub 2009 May 29. [http://genomebiology.com/content/10/5/R58 PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 7===&lt;br /&gt;
Perelson AS. Modelling viral and immune system dynamics. Nat Rev Immunol. 2002 Jan;2(1):28-36. [http://www.gersteinlab.org/courses/452/10-spring/pdf/perelson.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 8===&lt;br /&gt;
ML Connolly. (1983) Solvent-accessible surfaces of proteins and nucleic acids. Science, 221(4612): 709-13. PMID: 6879170.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Connolly.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Martin Karplus and J. Andrew McCammon. (2002) Molecular dynamics simulations of biomolecules. Nature Structural Biology,9, 646-52. PMID: 12198485.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Karplus.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 9===&lt;br /&gt;
Dill KA, Ozkan SB, Shell MS, Weikl TR. (2008) The Protein Folding Problem.Annu Rev Biophys,9, 37:289-316. PMID: 2443096.[http://www.gersteinlab.org/courses/452/10-spring/pdf/proteinFolding.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Bowman GR, Beauchamp KA, Boxer G, Pande VS. “Progress and challenges in the automated construction of Markov state models for full protein systems,” J. Chem. Phys. 131 (2009) 124101 [http://www.gersteinlab.org/courses/452/10-spring/pdf/bowman.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
==Non-CBB Final Project==&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
==CBB Final Project==&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
==Class Requirements==&lt;br /&gt;
&lt;br /&gt;
===Discussion Section / Readings===&lt;br /&gt;
Papers will be assigned throughout the course. These papers will be presented and discussed in weekly sections with the TAs. A brief summary (a half-page per article) should be submitted at the beginning of the discussion session.&lt;br /&gt;
&lt;br /&gt;
===Bioinformatics quizzes===&lt;br /&gt;
There will be four short quizzes (25 minutes) in class comprising SIMPLE questions that you should be able to answer from the lectures plus the main readings.&lt;br /&gt;
&lt;br /&gt;
===Programming Assignments (CBB and CS)===&lt;br /&gt;
There will be several short programming assignments required for CBB and CS students taking this course. Acceptable languages and submission requirements will be discussed prior to the first assignment. These assignments are NOT required for students not taking the CBB or CS sections of the course.&lt;br /&gt;
&lt;br /&gt;
===Prerequisites===&lt;br /&gt;
The course is keyed towards CBB graduate students as well as advanced MB&amp;amp;B undergraduates and graduate students wishing to learn about types of large-scale quantitative analyses that whole-genome sequencing will make possible. It would also be suitable for students from other fields such as computer science or physics wanting to learn about an important new biological application for computation.&lt;br /&gt;
&lt;br /&gt;
Students should have:&lt;br /&gt;
&lt;br /&gt;
A basic knowledge of biochemistry and molecular biology.&lt;br /&gt;
A knowledge of basic quantitative concepts, such as single variable calculus, some probability and statistics, and basic programming skills.&lt;br /&gt;
These can be fulfilled by the following prerequisites statement: &amp;quot;Prerequisites: MBB 200 and Mathematics 115 or permission of the instructor.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
===Relevant Yale College Regulations===&lt;br /&gt;
&lt;br /&gt;
Students may have questions concerning end-of-term matters.  Links to further information about these regulations can be found below:&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/reading-period-and-final-examination-period&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/completion-course-work&lt;br /&gt;
&lt;br /&gt;
==Pages from previous years==&lt;br /&gt;
[http://info.gersteinlab.org/Cbb752b11 2011],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/ 2010],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/previous.html 2009 and earlier]&lt;br /&gt;
&lt;br /&gt;
==Research Opportunities==&lt;br /&gt;
&lt;br /&gt;
If you&#039;re really motivated, take a look at http://bioinfo.mbb.yale.edu/jobs/.&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1791</id>
		<title>Cbb752b12</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1791"/>
		<updated>2011-12-20T04:39:28Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: /* Instructors */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=CBB 752=&lt;br /&gt;
&lt;br /&gt;
==Course Information==&lt;br /&gt;
&lt;br /&gt;
===Course Description===&lt;br /&gt;
&lt;br /&gt;
Bioinformatics encompasses the analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. It represents a major practical application for modern techniques in data mining and simulation. Specific topics to be covered include sequence alignment, large-scale processing, next-generation sequencing data, comparative genomics, phylogenetics, biological database design, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, normalization of microarray data, mining of functional genomics data sets, and machine learning approaches for data integration.&lt;br /&gt;
&lt;br /&gt;
===Concise undergraduate course description===&lt;br /&gt;
&lt;br /&gt;
Techniques in data mining and simulation applied to bioinformatics, the computational analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. Sequence alignment, comparative genomics and phylogenetics, biological databases, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, microarray normalization, and machine-learning approaches to data integration.&lt;br /&gt;
&lt;br /&gt;
See entry from undergraduate catalog:&lt;br /&gt;
http://students.yale.edu/oci/resultDetail.jsp?course=21914&amp;amp;term=201201 , viz:&lt;br /&gt;
&lt;br /&gt;
 MB&amp;amp;B 452 01 (21914) /MCDB452/MB&amp;amp;B752/CB&amp;amp;B752/MCDB752/CPSC752&lt;br /&gt;
 Bioinformatics: Practical Application of Simulation and Data Mining &lt;br /&gt;
 Mark Gerstein&lt;br /&gt;
 MW 1.00-2.15 BASS 305&lt;br /&gt;
 Spring 2012 &lt;br /&gt;
 No regular final examination&lt;br /&gt;
 Areas Sc&lt;br /&gt;
 Prerequisites: MB&amp;amp;B 301b and MATH 115a or b, or permission of instructor.&lt;br /&gt;
 MCDB 120a or 200b is a prerequisite for courses numbered MCDB 202 and above.&lt;br /&gt;
&lt;br /&gt;
===Quizzes and Final Project===&lt;br /&gt;
&lt;br /&gt;
There will be approximately four short quizzes during the semester and a take-home final project. For CBB and CS sections, the final project will be a programming assignment. For MB&amp;amp;B, the final project will be a paper. Further details will be announced at a later date.&lt;br /&gt;
&lt;br /&gt;
===Literature discussion section===&lt;br /&gt;
&lt;br /&gt;
One session of 60 minutes per week, time to be arranged. Student presentations of recent research papers relevant to the topics of the course. Led by Lucas Lochovsky (Bass, Rm 437; (203) 432-5405; lucas.lochovsky(at)yale.edu) and Jane Leng (Bass, Rm 437; (203) 432-5405; jing.leng(at)yale.edu).&lt;br /&gt;
&lt;br /&gt;
===Programming Projects/Problem Sets===&lt;br /&gt;
&lt;br /&gt;
Students taking this course listed under Computational Biology and Bioinformatics or Computer Science will be required to complete several short programming assignments. Further details will be discussed in the literature discussion section and during class.&lt;br /&gt;
&lt;br /&gt;
===Grade Categories===&lt;br /&gt;
&lt;br /&gt;
====CBB and CPSC Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 8.50%&lt;br /&gt;
|-&lt;br /&gt;
| Programming Assignments&lt;br /&gt;
| 25.00%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
====MBB and MCDB Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 16.75%&lt;br /&gt;
|-&lt;br /&gt;
| Problem Sets&lt;br /&gt;
| 16.75%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Differences Between Class Sections===&lt;br /&gt;
&lt;br /&gt;
In general, the graduate level CS/CBB course is significantly different than MBB/MCDB (graduate and undergraduate) in several ways. Although the lectures are the same for each section, the graduate level CPSC/CBB course has additional programming assignments in addition to the work being completed by the MBB students. Homework for the MBB section centers on the completion of several problem sets without a programming component. The CPSC/CBB section forgoes these problem sets and instead requires that students implement several of the algorithms discussed in class. Also, the final project for CPSC/CBB MUST be a programming assignment rather than the final paper required for the MBB section. Due to the distinct course requirements, category weightings for final grades are also different.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Timing &amp;amp; location==&lt;br /&gt;
&lt;br /&gt;
Class: Meeting from 1:00-2:15 pm on Monday and Wednesday, in Bass 305. (First meeting will be on 9 Jan.)&lt;br /&gt;
&lt;br /&gt;
Discussion section: TBA&lt;br /&gt;
&lt;br /&gt;
==Instructors==&lt;br /&gt;
&lt;br /&gt;
===Instructor-in-Charge===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Office Phone&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Mark Gerstein&lt;br /&gt;
| Bass 432A&lt;br /&gt;
| (203) 432-6105&lt;br /&gt;
| mark.gerstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Instructors===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
! Office Hours&lt;br /&gt;
|-&lt;br /&gt;
| Corey O&#039;Hern&lt;br /&gt;
| Mason Laboratory&lt;br /&gt;
| corey.ohern(at)yale.edu&lt;br /&gt;
| TBA&lt;br /&gt;
|-&lt;br /&gt;
| Jesse Rinehart&lt;br /&gt;
| 300 George St&lt;br /&gt;
| jesse.rinehart(at)yale.edu&lt;br /&gt;
| TBA&lt;br /&gt;
|-&lt;br /&gt;
| James Noonan&lt;br /&gt;
| 333 Cedar St&lt;br /&gt;
| james.noonan(at)yale.edu&lt;br /&gt;
| TBA&lt;br /&gt;
|-&lt;br /&gt;
| Kei Cheung&lt;br /&gt;
| 300 George St&lt;br /&gt;
| kei.cheung(at)yale.edu&lt;br /&gt;
| TBA&lt;br /&gt;
|-&lt;br /&gt;
| Steven Kleinstein&lt;br /&gt;
| 300 George St&lt;br /&gt;
| steven.kleinstein(at)yale.edu&lt;br /&gt;
| TBA&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Others to be listed&lt;br /&gt;
&lt;br /&gt;
===Teaching Fellows===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Office Phone&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Lucas Lochovsky&lt;br /&gt;
| Bass 437&lt;br /&gt;
| (203) 432-5405&lt;br /&gt;
| lucas.lochovsky(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jane Leng&lt;br /&gt;
| Bass 437&lt;br /&gt;
| (203) 432-5405&lt;br /&gt;
| jing.leng(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Topics/Class Schedule==&lt;br /&gt;
&lt;br /&gt;
[https://docs.google.com/spreadsheet/ccc?key=0Ar2M8lIkl9T5dDN4d1Q0aDBGcThXdjV2c21ESWVxSGc Class Schedule] (including a list of topics and quiz dates)&lt;br /&gt;
&lt;br /&gt;
==Discussion Sections==&lt;br /&gt;
&lt;br /&gt;
===Session 1===&lt;br /&gt;
Metzker ML. &amp;quot;Sequencing technologies - the next generation” Nature Reviews Genetics. 11 (2010) [http://www.gersteinlab.org/courses/452/10-spring/pdf/ngs.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Wheeler DA et al. &amp;quot;The complete genome of an individual by massively parallel DNA sequencing,” Nature. 452:872-876 (208) [http://www.gersteinlab.org/courses/452/10-spring/pdf/WatsonGenome.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 2===&lt;br /&gt;
Olsen JV, Blagoev B, Gnad F, Macek B, Kumar C, Mortensen P, Mann M. (2006) Global, in vivo, and site-specific phosphorylation dynamics in signaling networks.Cell. 2006 Nov 3;127(3):635-48. [http://www.pil.sdu.dk/1/MSQuant/Cell_GlobalPhosphorylationSignalingDynamics.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Nevan J. Krogan et al (2006) Global landscape of protein complexes in the yeast Saccharomyces cerevisiae Nature 440, 637-643 (30 March 2006) [http://www.nature.com/nature/journal/v440/n7084/pdf/nature04670.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 3===&lt;br /&gt;
T.F. Smith and M.S. Waterman. (1981) Identification of common molecular subsequences. Journal of Molecular Biology,147(1): 195-7. PMID: 7265238. [http://www.gersteinlab.org/courses/452/10-spring/pdf/sw.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. (1990) Basic local alignment search tool. Journal of Molecular Biology, 215(3):403-10. PMID: 2231712. [http://www.gersteinlab.org/courses/452/10-spring/pdf/Altschul.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 4===&lt;br /&gt;
Bailey TL, Williams N, Misleh C, Li WW. (2006) MEME: discovering and analyzing DNA and protein sequence motifs, Nucl Acids Res.34:W369-373 [http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1538909/?tool=pubmed PDF]&lt;br /&gt;
&lt;br /&gt;
Garnier J, Gibrat JF, Robson B. (1996) GOR method for predicting protein secondary structure from amino acid sequence.Methods in Enzymology,266: 540-53. PMID: 8743705. [http://www.gersteinlab.org/courses/452/10-spring/pdf/GOR.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 5===&lt;br /&gt;
Laura J. van &#039;t Veer et al. Gene expression profiling predicts clinical outcome of breast cancer Nature 415, 530-536 (31 January 2002) | doi:10.1038/415530a; Received 24 August 2001; Accepted 22 November 2001 [http://www.nature.com/nature/journal/v415/n6871/full/415530a.html TEXT]&lt;br /&gt;
&lt;br /&gt;
Kwang-Il Goh, Michael E. Cusick, David Vall, Barton Child, Marc Vidal, and Albert-La ́szlo ́ Barabasi (2007) The human disease network Proc Natl Acad Sci U S A. 2007 May 22;104(21):8685-90. Epub 2007 May 14. [http://www.pnas.org/content/104/21/8685.full.pdf+html PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 6===&lt;br /&gt;
Antezana E, Egaña M, Blondé W, Illarramendi A, Bilbao I, De Baets B, Stevens R, Mironov V, Kuiper M. (2009) The Cell Cycle Ontology: an application ontology for the representation and integrated analysis of the cell cycle process. Genome Biol. 2009;10(5):R58. Epub 2009 May 29. [http://genomebiology.com/content/10/5/R58 PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 7===&lt;br /&gt;
Perelson AS. Modelling viral and immune system dynamics. Nat Rev Immunol. 2002 Jan;2(1):28-36. [http://www.gersteinlab.org/courses/452/10-spring/pdf/perelson.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 8===&lt;br /&gt;
ML Connolly. (1983) Solvent-accessible surfaces of proteins and nucleic acids. Science, 221(4612): 709-13. PMID: 6879170.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Connolly.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Martin Karplus and J. Andrew McCammon. (2002) Molecular dynamics simulations of biomolecules. Nature Structural Biology,9, 646-52. PMID: 12198485.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Karplus.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 9===&lt;br /&gt;
Dill KA, Ozkan SB, Shell MS, Weikl TR. (2008) The Protein Folding Problem.Annu Rev Biophys,9, 37:289-316. PMID: 2443096.[http://www.gersteinlab.org/courses/452/10-spring/pdf/proteinFolding.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Bowman GR, Beauchamp KA, Boxer G, Pande VS. “Progress and challenges in the automated construction of Markov state models for full protein systems,” J. Chem. Phys. 131 (2009) 124101 [http://www.gersteinlab.org/courses/452/10-spring/pdf/bowman.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
==Non-CBB Final Project==&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
==CBB Final Project==&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
==Class Requirements==&lt;br /&gt;
&lt;br /&gt;
===Discussion Section / Readings===&lt;br /&gt;
Papers will be assigned throughout the course. These papers will be presented and discussed in weekly sections with the TAs. A brief summary (a half-page per article) should be submitted at the beginning of the discussion session.&lt;br /&gt;
&lt;br /&gt;
===Bioinformatics quizzes===&lt;br /&gt;
There will be four short quizzes (25 minutes) in class comprising SIMPLE questions that you should be able to answer from the lectures plus the main readings.&lt;br /&gt;
&lt;br /&gt;
===Programming Assignments (CBB and CS)===&lt;br /&gt;
There will be several short programming assignments required for CBB and CS students taking this course. Acceptable languages and submission requirements will be discussed prior to the first assignment. These assignments are NOT required for students not taking the CBB or CS sections of the course.&lt;br /&gt;
&lt;br /&gt;
===Prerequisites===&lt;br /&gt;
The course is keyed towards CBB graduate students as well as advanced MB&amp;amp;B undergraduates and graduate students wishing to learn about types of large-scale quantitative analyses that whole-genome sequencing will make possible. It would also be suitable for students from other fields such as computer science or physics wanting to learn about an important new biological application for computation.&lt;br /&gt;
&lt;br /&gt;
Students should have:&lt;br /&gt;
&lt;br /&gt;
A basic knowledge of biochemistry and molecular biology.&lt;br /&gt;
A knowledge of basic quantitative concepts, such as single variable calculus, some probability and statistics, and basic programming skills.&lt;br /&gt;
These can be fulfilled by the following prerequisites statement: &amp;quot;Prerequisites: MBB 200 and Mathematics 115 or permission of the instructor.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
===Relevant Yale College Regulations===&lt;br /&gt;
&lt;br /&gt;
Students may have questions concerning end-of-term matters.  Links to further information about these regulations can be found below:&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/reading-period-and-final-examination-period&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/completion-course-work&lt;br /&gt;
&lt;br /&gt;
==Pages from previous years==&lt;br /&gt;
[http://info.gersteinlab.org/Cbb752b11 2011],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/ 2010],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/previous.html 2009 and earlier]&lt;br /&gt;
&lt;br /&gt;
==Research Opportunities==&lt;br /&gt;
&lt;br /&gt;
If you&#039;re really motivated, take a look at http://bioinfo.mbb.yale.edu/jobs/.&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1790</id>
		<title>Cbb752b12</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1790"/>
		<updated>2011-12-20T04:38:41Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: /* Instructors */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=CBB 752=&lt;br /&gt;
&lt;br /&gt;
==Course Information==&lt;br /&gt;
&lt;br /&gt;
===Course Description===&lt;br /&gt;
&lt;br /&gt;
Bioinformatics encompasses the analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. It represents a major practical application for modern techniques in data mining and simulation. Specific topics to be covered include sequence alignment, large-scale processing, next-generation sequencing data, comparative genomics, phylogenetics, biological database design, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, normalization of microarray data, mining of functional genomics data sets, and machine learning approaches for data integration.&lt;br /&gt;
&lt;br /&gt;
===Concise undergraduate course description===&lt;br /&gt;
&lt;br /&gt;
Techniques in data mining and simulation applied to bioinformatics, the computational analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. Sequence alignment, comparative genomics and phylogenetics, biological databases, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, microarray normalization, and machine-learning approaches to data integration.&lt;br /&gt;
&lt;br /&gt;
See entry from undergraduate catalog:&lt;br /&gt;
http://students.yale.edu/oci/resultDetail.jsp?course=21914&amp;amp;term=201201 , viz:&lt;br /&gt;
&lt;br /&gt;
 MB&amp;amp;B 452 01 (21914) /MCDB452/MB&amp;amp;B752/CB&amp;amp;B752/MCDB752/CPSC752&lt;br /&gt;
 Bioinformatics: Practical Application of Simulation and Data Mining &lt;br /&gt;
 Mark Gerstein&lt;br /&gt;
 MW 1.00-2.15 BASS 305&lt;br /&gt;
 Spring 2012 &lt;br /&gt;
 No regular final examination&lt;br /&gt;
 Areas Sc&lt;br /&gt;
 Prerequisites: MB&amp;amp;B 301b and MATH 115a or b, or permission of instructor.&lt;br /&gt;
 MCDB 120a or 200b is a prerequisite for courses numbered MCDB 202 and above.&lt;br /&gt;
&lt;br /&gt;
===Quizzes and Final Project===&lt;br /&gt;
&lt;br /&gt;
There will be approximately four short quizzes during the semester and a take-home final project. For CBB and CS sections, the final project will be a programming assignment. For MB&amp;amp;B, the final project will be a paper. Further details will be announced at a later date.&lt;br /&gt;
&lt;br /&gt;
===Literature discussion section===&lt;br /&gt;
&lt;br /&gt;
One session of 60 minutes per week, time to be arranged. Student presentations of recent research papers relevant to the topics of the course. Led by Lucas Lochovsky (Bass, Rm 437; (203) 432-5405; lucas.lochovsky(at)yale.edu) and Jane Leng (Bass, Rm 437; (203) 432-5405; jing.leng(at)yale.edu).&lt;br /&gt;
&lt;br /&gt;
===Programming Projects/Problem Sets===&lt;br /&gt;
&lt;br /&gt;
Students taking this course listed under Computational Biology and Bioinformatics or Computer Science will be required to complete several short programming assignments. Further details will be discussed in the literature discussion section and during class.&lt;br /&gt;
&lt;br /&gt;
===Grade Categories===&lt;br /&gt;
&lt;br /&gt;
====CBB and CPSC Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 8.50%&lt;br /&gt;
|-&lt;br /&gt;
| Programming Assignments&lt;br /&gt;
| 25.00%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
====MBB and MCDB Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 16.75%&lt;br /&gt;
|-&lt;br /&gt;
| Problem Sets&lt;br /&gt;
| 16.75%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Differences Between Class Sections===&lt;br /&gt;
&lt;br /&gt;
In general, the graduate level CS/CBB course is significantly different than MBB/MCDB (graduate and undergraduate) in several ways. Although the lectures are the same for each section, the graduate level CPSC/CBB course has additional programming assignments in addition to the work being completed by the MBB students. Homework for the MBB section centers on the completion of several problem sets without a programming component. The CPSC/CBB section forgoes these problem sets and instead requires that students implement several of the algorithms discussed in class. Also, the final project for CPSC/CBB MUST be a programming assignment rather than the final paper required for the MBB section. Due to the distinct course requirements, category weightings for final grades are also different.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Timing &amp;amp; location==&lt;br /&gt;
&lt;br /&gt;
Class: Meeting from 1:00-2:15 pm on Monday and Wednesday, in Bass 305. (First meeting will be on 9 Jan.)&lt;br /&gt;
&lt;br /&gt;
Discussion section: TBA&lt;br /&gt;
&lt;br /&gt;
==Instructors==&lt;br /&gt;
&lt;br /&gt;
===Instructor-in-Charge===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Office Phone&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Mark Gerstein&lt;br /&gt;
| Bass 432A&lt;br /&gt;
| (203) 432-6105&lt;br /&gt;
| mark.gerstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Instructors===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
! Office Hours&lt;br /&gt;
|-&lt;br /&gt;
| Corey O&#039;Hern&lt;br /&gt;
| Mason Laboratory&lt;br /&gt;
| corey.ohern(at)yale.edu&lt;br /&gt;
| &lt;br /&gt;
|-&lt;br /&gt;
| Jesse Rinehart&lt;br /&gt;
| 300 George St&lt;br /&gt;
| jesse.rinehart(at)yale.edu&lt;br /&gt;
| &lt;br /&gt;
|-&lt;br /&gt;
| James Noonan&lt;br /&gt;
| 333 Cedar St&lt;br /&gt;
| james.noonan(at)yale.edu&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
| Kei Cheung&lt;br /&gt;
| 300 George St&lt;br /&gt;
| kei.cheung(at)yale.edu&lt;br /&gt;
| &lt;br /&gt;
|-&lt;br /&gt;
| Steven Kleinstein&lt;br /&gt;
| 300 George St&lt;br /&gt;
| steven.kleinstein(at)yale.edu&lt;br /&gt;
| &lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Others to be listed&lt;br /&gt;
&lt;br /&gt;
===Teaching Fellows===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Office Phone&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Lucas Lochovsky&lt;br /&gt;
| Bass 437&lt;br /&gt;
| (203) 432-5405&lt;br /&gt;
| lucas.lochovsky(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jane Leng&lt;br /&gt;
| Bass 437&lt;br /&gt;
| (203) 432-5405&lt;br /&gt;
| jing.leng(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Topics/Class Schedule==&lt;br /&gt;
&lt;br /&gt;
[https://docs.google.com/spreadsheet/ccc?key=0Ar2M8lIkl9T5dDN4d1Q0aDBGcThXdjV2c21ESWVxSGc Class Schedule] (including a list of topics and quiz dates)&lt;br /&gt;
&lt;br /&gt;
==Discussion Sections==&lt;br /&gt;
&lt;br /&gt;
===Session 1===&lt;br /&gt;
Metzker ML. &amp;quot;Sequencing technologies - the next generation” Nature Reviews Genetics. 11 (2010) [http://www.gersteinlab.org/courses/452/10-spring/pdf/ngs.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Wheeler DA et al. &amp;quot;The complete genome of an individual by massively parallel DNA sequencing,” Nature. 452:872-876 (208) [http://www.gersteinlab.org/courses/452/10-spring/pdf/WatsonGenome.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 2===&lt;br /&gt;
Olsen JV, Blagoev B, Gnad F, Macek B, Kumar C, Mortensen P, Mann M. (2006) Global, in vivo, and site-specific phosphorylation dynamics in signaling networks.Cell. 2006 Nov 3;127(3):635-48. [http://www.pil.sdu.dk/1/MSQuant/Cell_GlobalPhosphorylationSignalingDynamics.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Nevan J. Krogan et al (2006) Global landscape of protein complexes in the yeast Saccharomyces cerevisiae Nature 440, 637-643 (30 March 2006) [http://www.nature.com/nature/journal/v440/n7084/pdf/nature04670.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 3===&lt;br /&gt;
T.F. Smith and M.S. Waterman. (1981) Identification of common molecular subsequences. Journal of Molecular Biology,147(1): 195-7. PMID: 7265238. [http://www.gersteinlab.org/courses/452/10-spring/pdf/sw.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. (1990) Basic local alignment search tool. Journal of Molecular Biology, 215(3):403-10. PMID: 2231712. [http://www.gersteinlab.org/courses/452/10-spring/pdf/Altschul.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 4===&lt;br /&gt;
Bailey TL, Williams N, Misleh C, Li WW. (2006) MEME: discovering and analyzing DNA and protein sequence motifs, Nucl Acids Res.34:W369-373 [http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1538909/?tool=pubmed PDF]&lt;br /&gt;
&lt;br /&gt;
Garnier J, Gibrat JF, Robson B. (1996) GOR method for predicting protein secondary structure from amino acid sequence.Methods in Enzymology,266: 540-53. PMID: 8743705. [http://www.gersteinlab.org/courses/452/10-spring/pdf/GOR.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 5===&lt;br /&gt;
Laura J. van &#039;t Veer et al. Gene expression profiling predicts clinical outcome of breast cancer Nature 415, 530-536 (31 January 2002) | doi:10.1038/415530a; Received 24 August 2001; Accepted 22 November 2001 [http://www.nature.com/nature/journal/v415/n6871/full/415530a.html TEXT]&lt;br /&gt;
&lt;br /&gt;
Kwang-Il Goh, Michael E. Cusick, David Vall, Barton Child, Marc Vidal, and Albert-La ́szlo ́ Barabasi (2007) The human disease network Proc Natl Acad Sci U S A. 2007 May 22;104(21):8685-90. Epub 2007 May 14. [http://www.pnas.org/content/104/21/8685.full.pdf+html PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 6===&lt;br /&gt;
Antezana E, Egaña M, Blondé W, Illarramendi A, Bilbao I, De Baets B, Stevens R, Mironov V, Kuiper M. (2009) The Cell Cycle Ontology: an application ontology for the representation and integrated analysis of the cell cycle process. Genome Biol. 2009;10(5):R58. Epub 2009 May 29. [http://genomebiology.com/content/10/5/R58 PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 7===&lt;br /&gt;
Perelson AS. Modelling viral and immune system dynamics. Nat Rev Immunol. 2002 Jan;2(1):28-36. [http://www.gersteinlab.org/courses/452/10-spring/pdf/perelson.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 8===&lt;br /&gt;
ML Connolly. (1983) Solvent-accessible surfaces of proteins and nucleic acids. Science, 221(4612): 709-13. PMID: 6879170.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Connolly.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Martin Karplus and J. Andrew McCammon. (2002) Molecular dynamics simulations of biomolecules. Nature Structural Biology,9, 646-52. PMID: 12198485.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Karplus.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 9===&lt;br /&gt;
Dill KA, Ozkan SB, Shell MS, Weikl TR. (2008) The Protein Folding Problem.Annu Rev Biophys,9, 37:289-316. PMID: 2443096.[http://www.gersteinlab.org/courses/452/10-spring/pdf/proteinFolding.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Bowman GR, Beauchamp KA, Boxer G, Pande VS. “Progress and challenges in the automated construction of Markov state models for full protein systems,” J. Chem. Phys. 131 (2009) 124101 [http://www.gersteinlab.org/courses/452/10-spring/pdf/bowman.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
==Non-CBB Final Project==&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
==CBB Final Project==&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
==Class Requirements==&lt;br /&gt;
&lt;br /&gt;
===Discussion Section / Readings===&lt;br /&gt;
Papers will be assigned throughout the course. These papers will be presented and discussed in weekly sections with the TAs. A brief summary (a half-page per article) should be submitted at the beginning of the discussion session.&lt;br /&gt;
&lt;br /&gt;
===Bioinformatics quizzes===&lt;br /&gt;
There will be four short quizzes (25 minutes) in class comprising SIMPLE questions that you should be able to answer from the lectures plus the main readings.&lt;br /&gt;
&lt;br /&gt;
===Programming Assignments (CBB and CS)===&lt;br /&gt;
There will be several short programming assignments required for CBB and CS students taking this course. Acceptable languages and submission requirements will be discussed prior to the first assignment. These assignments are NOT required for students not taking the CBB or CS sections of the course.&lt;br /&gt;
&lt;br /&gt;
===Prerequisites===&lt;br /&gt;
The course is keyed towards CBB graduate students as well as advanced MB&amp;amp;B undergraduates and graduate students wishing to learn about types of large-scale quantitative analyses that whole-genome sequencing will make possible. It would also be suitable for students from other fields such as computer science or physics wanting to learn about an important new biological application for computation.&lt;br /&gt;
&lt;br /&gt;
Students should have:&lt;br /&gt;
&lt;br /&gt;
A basic knowledge of biochemistry and molecular biology.&lt;br /&gt;
A knowledge of basic quantitative concepts, such as single variable calculus, some probability and statistics, and basic programming skills.&lt;br /&gt;
These can be fulfilled by the following prerequisites statement: &amp;quot;Prerequisites: MBB 200 and Mathematics 115 or permission of the instructor.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
===Relevant Yale College Regulations===&lt;br /&gt;
&lt;br /&gt;
Students may have questions concerning end-of-term matters.  Links to further information about these regulations can be found below:&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/reading-period-and-final-examination-period&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/completion-course-work&lt;br /&gt;
&lt;br /&gt;
==Pages from previous years==&lt;br /&gt;
[http://info.gersteinlab.org/Cbb752b11 2011],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/ 2010],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/previous.html 2009 and earlier]&lt;br /&gt;
&lt;br /&gt;
==Research Opportunities==&lt;br /&gt;
&lt;br /&gt;
If you&#039;re really motivated, take a look at http://bioinfo.mbb.yale.edu/jobs/.&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1789</id>
		<title>Cbb752b12</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1789"/>
		<updated>2011-12-20T04:24:53Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: /* Teaching Fellows */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=CBB 752=&lt;br /&gt;
&lt;br /&gt;
==Course Information==&lt;br /&gt;
&lt;br /&gt;
===Course Description===&lt;br /&gt;
&lt;br /&gt;
Bioinformatics encompasses the analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. It represents a major practical application for modern techniques in data mining and simulation. Specific topics to be covered include sequence alignment, large-scale processing, next-generation sequencing data, comparative genomics, phylogenetics, biological database design, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, normalization of microarray data, mining of functional genomics data sets, and machine learning approaches for data integration.&lt;br /&gt;
&lt;br /&gt;
===Concise undergraduate course description===&lt;br /&gt;
&lt;br /&gt;
Techniques in data mining and simulation applied to bioinformatics, the computational analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. Sequence alignment, comparative genomics and phylogenetics, biological databases, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, microarray normalization, and machine-learning approaches to data integration.&lt;br /&gt;
&lt;br /&gt;
See entry from undergraduate catalog:&lt;br /&gt;
http://students.yale.edu/oci/resultDetail.jsp?course=21914&amp;amp;term=201201 , viz:&lt;br /&gt;
&lt;br /&gt;
 MB&amp;amp;B 452 01 (21914) /MCDB452/MB&amp;amp;B752/CB&amp;amp;B752/MCDB752/CPSC752&lt;br /&gt;
 Bioinformatics: Practical Application of Simulation and Data Mining &lt;br /&gt;
 Mark Gerstein&lt;br /&gt;
 MW 1.00-2.15 BASS 305&lt;br /&gt;
 Spring 2012 &lt;br /&gt;
 No regular final examination&lt;br /&gt;
 Areas Sc&lt;br /&gt;
 Prerequisites: MB&amp;amp;B 301b and MATH 115a or b, or permission of instructor.&lt;br /&gt;
 MCDB 120a or 200b is a prerequisite for courses numbered MCDB 202 and above.&lt;br /&gt;
&lt;br /&gt;
===Quizzes and Final Project===&lt;br /&gt;
&lt;br /&gt;
There will be approximately four short quizzes during the semester and a take-home final project. For CBB and CS sections, the final project will be a programming assignment. For MB&amp;amp;B, the final project will be a paper. Further details will be announced at a later date.&lt;br /&gt;
&lt;br /&gt;
===Literature discussion section===&lt;br /&gt;
&lt;br /&gt;
One session of 60 minutes per week, time to be arranged. Student presentations of recent research papers relevant to the topics of the course. Led by Lucas Lochovsky (Bass, Rm 437; (203) 432-5405; lucas.lochovsky(at)yale.edu) and Jane Leng (Bass, Rm 437; (203) 432-5405; jing.leng(at)yale.edu).&lt;br /&gt;
&lt;br /&gt;
===Programming Projects/Problem Sets===&lt;br /&gt;
&lt;br /&gt;
Students taking this course listed under Computational Biology and Bioinformatics or Computer Science will be required to complete several short programming assignments. Further details will be discussed in the literature discussion section and during class.&lt;br /&gt;
&lt;br /&gt;
===Grade Categories===&lt;br /&gt;
&lt;br /&gt;
====CBB and CPSC Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 8.50%&lt;br /&gt;
|-&lt;br /&gt;
| Programming Assignments&lt;br /&gt;
| 25.00%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
====MBB and MCDB Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 16.75%&lt;br /&gt;
|-&lt;br /&gt;
| Problem Sets&lt;br /&gt;
| 16.75%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Differences Between Class Sections===&lt;br /&gt;
&lt;br /&gt;
In general, the graduate level CS/CBB course is significantly different than MBB/MCDB (graduate and undergraduate) in several ways. Although the lectures are the same for each section, the graduate level CPSC/CBB course has additional programming assignments in addition to the work being completed by the MBB students. Homework for the MBB section centers on the completion of several problem sets without a programming component. The CPSC/CBB section forgoes these problem sets and instead requires that students implement several of the algorithms discussed in class. Also, the final project for CPSC/CBB MUST be a programming assignment rather than the final paper required for the MBB section. Due to the distinct course requirements, category weightings for final grades are also different.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Timing &amp;amp; location==&lt;br /&gt;
&lt;br /&gt;
Class: Meeting from 1:00-2:15 pm on Monday and Wednesday, in Bass 305. (First meeting will be on 9 Jan.)&lt;br /&gt;
&lt;br /&gt;
Discussion section: TBA&lt;br /&gt;
&lt;br /&gt;
==Instructors==&lt;br /&gt;
&lt;br /&gt;
===Instructor-in-Charge===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Office Phone&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Mark Gerstein&lt;br /&gt;
| Bass 432A&lt;br /&gt;
| (203) 432-6105&lt;br /&gt;
| mark.gerstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Instructors===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
! Office Hours&lt;br /&gt;
|-&lt;br /&gt;
| Corey O&#039;Hern&lt;br /&gt;
| Mason Laboratory&lt;br /&gt;
| corey.ohern(at)yale.edu&lt;br /&gt;
| M 2:15-3:15 PM&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Others to be listed&lt;br /&gt;
&lt;br /&gt;
===Teaching Fellows===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Office Phone&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Lucas Lochovsky&lt;br /&gt;
| Bass 437&lt;br /&gt;
| (203) 432-5405&lt;br /&gt;
| lucas.lochovsky(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jane Leng&lt;br /&gt;
| Bass 437&lt;br /&gt;
| (203) 432-5405&lt;br /&gt;
| jing.leng(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Topics/Class Schedule==&lt;br /&gt;
&lt;br /&gt;
[https://docs.google.com/spreadsheet/ccc?key=0Ar2M8lIkl9T5dDN4d1Q0aDBGcThXdjV2c21ESWVxSGc Class Schedule] (including a list of topics and quiz dates)&lt;br /&gt;
&lt;br /&gt;
==Discussion Sections==&lt;br /&gt;
&lt;br /&gt;
===Session 1===&lt;br /&gt;
Metzker ML. &amp;quot;Sequencing technologies - the next generation” Nature Reviews Genetics. 11 (2010) [http://www.gersteinlab.org/courses/452/10-spring/pdf/ngs.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Wheeler DA et al. &amp;quot;The complete genome of an individual by massively parallel DNA sequencing,” Nature. 452:872-876 (208) [http://www.gersteinlab.org/courses/452/10-spring/pdf/WatsonGenome.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 2===&lt;br /&gt;
Olsen JV, Blagoev B, Gnad F, Macek B, Kumar C, Mortensen P, Mann M. (2006) Global, in vivo, and site-specific phosphorylation dynamics in signaling networks.Cell. 2006 Nov 3;127(3):635-48. [http://www.pil.sdu.dk/1/MSQuant/Cell_GlobalPhosphorylationSignalingDynamics.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Nevan J. Krogan et al (2006) Global landscape of protein complexes in the yeast Saccharomyces cerevisiae Nature 440, 637-643 (30 March 2006) [http://www.nature.com/nature/journal/v440/n7084/pdf/nature04670.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 3===&lt;br /&gt;
T.F. Smith and M.S. Waterman. (1981) Identification of common molecular subsequences. Journal of Molecular Biology,147(1): 195-7. PMID: 7265238. [http://www.gersteinlab.org/courses/452/10-spring/pdf/sw.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. (1990) Basic local alignment search tool. Journal of Molecular Biology, 215(3):403-10. PMID: 2231712. [http://www.gersteinlab.org/courses/452/10-spring/pdf/Altschul.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 4===&lt;br /&gt;
Bailey TL, Williams N, Misleh C, Li WW. (2006) MEME: discovering and analyzing DNA and protein sequence motifs, Nucl Acids Res.34:W369-373 [http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1538909/?tool=pubmed PDF]&lt;br /&gt;
&lt;br /&gt;
Garnier J, Gibrat JF, Robson B. (1996) GOR method for predicting protein secondary structure from amino acid sequence.Methods in Enzymology,266: 540-53. PMID: 8743705. [http://www.gersteinlab.org/courses/452/10-spring/pdf/GOR.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 5===&lt;br /&gt;
Laura J. van &#039;t Veer et al. Gene expression profiling predicts clinical outcome of breast cancer Nature 415, 530-536 (31 January 2002) | doi:10.1038/415530a; Received 24 August 2001; Accepted 22 November 2001 [http://www.nature.com/nature/journal/v415/n6871/full/415530a.html TEXT]&lt;br /&gt;
&lt;br /&gt;
Kwang-Il Goh, Michael E. Cusick, David Vall, Barton Child, Marc Vidal, and Albert-La ́szlo ́ Barabasi (2007) The human disease network Proc Natl Acad Sci U S A. 2007 May 22;104(21):8685-90. Epub 2007 May 14. [http://www.pnas.org/content/104/21/8685.full.pdf+html PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 6===&lt;br /&gt;
Antezana E, Egaña M, Blondé W, Illarramendi A, Bilbao I, De Baets B, Stevens R, Mironov V, Kuiper M. (2009) The Cell Cycle Ontology: an application ontology for the representation and integrated analysis of the cell cycle process. Genome Biol. 2009;10(5):R58. Epub 2009 May 29. [http://genomebiology.com/content/10/5/R58 PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 7===&lt;br /&gt;
Perelson AS. Modelling viral and immune system dynamics. Nat Rev Immunol. 2002 Jan;2(1):28-36. [http://www.gersteinlab.org/courses/452/10-spring/pdf/perelson.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 8===&lt;br /&gt;
ML Connolly. (1983) Solvent-accessible surfaces of proteins and nucleic acids. Science, 221(4612): 709-13. PMID: 6879170.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Connolly.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Martin Karplus and J. Andrew McCammon. (2002) Molecular dynamics simulations of biomolecules. Nature Structural Biology,9, 646-52. PMID: 12198485.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Karplus.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 9===&lt;br /&gt;
Dill KA, Ozkan SB, Shell MS, Weikl TR. (2008) The Protein Folding Problem.Annu Rev Biophys,9, 37:289-316. PMID: 2443096.[http://www.gersteinlab.org/courses/452/10-spring/pdf/proteinFolding.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Bowman GR, Beauchamp KA, Boxer G, Pande VS. “Progress and challenges in the automated construction of Markov state models for full protein systems,” J. Chem. Phys. 131 (2009) 124101 [http://www.gersteinlab.org/courses/452/10-spring/pdf/bowman.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
==Non-CBB Final Project==&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
==CBB Final Project==&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
==Class Requirements==&lt;br /&gt;
&lt;br /&gt;
===Discussion Section / Readings===&lt;br /&gt;
Papers will be assigned throughout the course. These papers will be presented and discussed in weekly sections with the TAs. A brief summary (a half-page per article) should be submitted at the beginning of the discussion session.&lt;br /&gt;
&lt;br /&gt;
===Bioinformatics quizzes===&lt;br /&gt;
There will be four short quizzes (25 minutes) in class comprising SIMPLE questions that you should be able to answer from the lectures plus the main readings.&lt;br /&gt;
&lt;br /&gt;
===Programming Assignments (CBB and CS)===&lt;br /&gt;
There will be several short programming assignments required for CBB and CS students taking this course. Acceptable languages and submission requirements will be discussed prior to the first assignment. These assignments are NOT required for students not taking the CBB or CS sections of the course.&lt;br /&gt;
&lt;br /&gt;
===Prerequisites===&lt;br /&gt;
The course is keyed towards CBB graduate students as well as advanced MB&amp;amp;B undergraduates and graduate students wishing to learn about types of large-scale quantitative analyses that whole-genome sequencing will make possible. It would also be suitable for students from other fields such as computer science or physics wanting to learn about an important new biological application for computation.&lt;br /&gt;
&lt;br /&gt;
Students should have:&lt;br /&gt;
&lt;br /&gt;
A basic knowledge of biochemistry and molecular biology.&lt;br /&gt;
A knowledge of basic quantitative concepts, such as single variable calculus, some probability and statistics, and basic programming skills.&lt;br /&gt;
These can be fulfilled by the following prerequisites statement: &amp;quot;Prerequisites: MBB 200 and Mathematics 115 or permission of the instructor.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
===Relevant Yale College Regulations===&lt;br /&gt;
&lt;br /&gt;
Students may have questions concerning end-of-term matters.  Links to further information about these regulations can be found below:&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/reading-period-and-final-examination-period&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/completion-course-work&lt;br /&gt;
&lt;br /&gt;
==Pages from previous years==&lt;br /&gt;
[http://info.gersteinlab.org/Cbb752b11 2011],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/ 2010],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/previous.html 2009 and earlier]&lt;br /&gt;
&lt;br /&gt;
==Research Opportunities==&lt;br /&gt;
&lt;br /&gt;
If you&#039;re really motivated, take a look at http://bioinfo.mbb.yale.edu/jobs/.&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1788</id>
		<title>Cbb752b12</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1788"/>
		<updated>2011-12-20T04:23:36Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: /* Timing &amp;amp; location */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=CBB 752=&lt;br /&gt;
&lt;br /&gt;
==Course Information==&lt;br /&gt;
&lt;br /&gt;
===Course Description===&lt;br /&gt;
&lt;br /&gt;
Bioinformatics encompasses the analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. It represents a major practical application for modern techniques in data mining and simulation. Specific topics to be covered include sequence alignment, large-scale processing, next-generation sequencing data, comparative genomics, phylogenetics, biological database design, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, normalization of microarray data, mining of functional genomics data sets, and machine learning approaches for data integration.&lt;br /&gt;
&lt;br /&gt;
===Concise undergraduate course description===&lt;br /&gt;
&lt;br /&gt;
Techniques in data mining and simulation applied to bioinformatics, the computational analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. Sequence alignment, comparative genomics and phylogenetics, biological databases, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, microarray normalization, and machine-learning approaches to data integration.&lt;br /&gt;
&lt;br /&gt;
See entry from undergraduate catalog:&lt;br /&gt;
http://students.yale.edu/oci/resultDetail.jsp?course=21914&amp;amp;term=201201 , viz:&lt;br /&gt;
&lt;br /&gt;
 MB&amp;amp;B 452 01 (21914) /MCDB452/MB&amp;amp;B752/CB&amp;amp;B752/MCDB752/CPSC752&lt;br /&gt;
 Bioinformatics: Practical Application of Simulation and Data Mining &lt;br /&gt;
 Mark Gerstein&lt;br /&gt;
 MW 1.00-2.15 BASS 305&lt;br /&gt;
 Spring 2012 &lt;br /&gt;
 No regular final examination&lt;br /&gt;
 Areas Sc&lt;br /&gt;
 Prerequisites: MB&amp;amp;B 301b and MATH 115a or b, or permission of instructor.&lt;br /&gt;
 MCDB 120a or 200b is a prerequisite for courses numbered MCDB 202 and above.&lt;br /&gt;
&lt;br /&gt;
===Quizzes and Final Project===&lt;br /&gt;
&lt;br /&gt;
There will be approximately four short quizzes during the semester and a take-home final project. For CBB and CS sections, the final project will be a programming assignment. For MB&amp;amp;B, the final project will be a paper. Further details will be announced at a later date.&lt;br /&gt;
&lt;br /&gt;
===Literature discussion section===&lt;br /&gt;
&lt;br /&gt;
One session of 60 minutes per week, time to be arranged. Student presentations of recent research papers relevant to the topics of the course. Led by Lucas Lochovsky (Bass, Rm 437; (203) 432-5405; lucas.lochovsky(at)yale.edu) and Jane Leng (Bass, Rm 437; (203) 432-5405; jing.leng(at)yale.edu).&lt;br /&gt;
&lt;br /&gt;
===Programming Projects/Problem Sets===&lt;br /&gt;
&lt;br /&gt;
Students taking this course listed under Computational Biology and Bioinformatics or Computer Science will be required to complete several short programming assignments. Further details will be discussed in the literature discussion section and during class.&lt;br /&gt;
&lt;br /&gt;
===Grade Categories===&lt;br /&gt;
&lt;br /&gt;
====CBB and CPSC Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 8.50%&lt;br /&gt;
|-&lt;br /&gt;
| Programming Assignments&lt;br /&gt;
| 25.00%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
====MBB and MCDB Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 16.75%&lt;br /&gt;
|-&lt;br /&gt;
| Problem Sets&lt;br /&gt;
| 16.75%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Differences Between Class Sections===&lt;br /&gt;
&lt;br /&gt;
In general, the graduate level CS/CBB course is significantly different than MBB/MCDB (graduate and undergraduate) in several ways. Although the lectures are the same for each section, the graduate level CPSC/CBB course has additional programming assignments in addition to the work being completed by the MBB students. Homework for the MBB section centers on the completion of several problem sets without a programming component. The CPSC/CBB section forgoes these problem sets and instead requires that students implement several of the algorithms discussed in class. Also, the final project for CPSC/CBB MUST be a programming assignment rather than the final paper required for the MBB section. Due to the distinct course requirements, category weightings for final grades are also different.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Timing &amp;amp; location==&lt;br /&gt;
&lt;br /&gt;
Class: Meeting from 1:00-2:15 pm on Monday and Wednesday, in Bass 305. (First meeting will be on 9 Jan.)&lt;br /&gt;
&lt;br /&gt;
Discussion section: TBA&lt;br /&gt;
&lt;br /&gt;
==Instructors==&lt;br /&gt;
&lt;br /&gt;
===Instructor-in-Charge===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Office Phone&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Mark Gerstein&lt;br /&gt;
| Bass 432A&lt;br /&gt;
| (203) 432-6105&lt;br /&gt;
| mark.gerstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Instructors===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
! Office Hours&lt;br /&gt;
|-&lt;br /&gt;
| Corey O&#039;Hern&lt;br /&gt;
| Mason Laboratory&lt;br /&gt;
| corey.ohern(at)yale.edu&lt;br /&gt;
| M 2:15-3:15 PM&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Others to be listed&lt;br /&gt;
&lt;br /&gt;
===Teaching Fellows===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Office Phone&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Lucas Lochovsky&lt;br /&gt;
| Bass Rm 437&lt;br /&gt;
| (203) 432-5405&lt;br /&gt;
| lucas.lochovsky(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jane Leng&lt;br /&gt;
| Bass Rm 437&lt;br /&gt;
| (203) 432-5405&lt;br /&gt;
| jing.leng(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Topics/Class Schedule==&lt;br /&gt;
&lt;br /&gt;
[https://docs.google.com/spreadsheet/ccc?key=0Ar2M8lIkl9T5dDN4d1Q0aDBGcThXdjV2c21ESWVxSGc Class Schedule] (including a list of topics and quiz dates)&lt;br /&gt;
&lt;br /&gt;
==Discussion Sections==&lt;br /&gt;
&lt;br /&gt;
===Session 1===&lt;br /&gt;
Metzker ML. &amp;quot;Sequencing technologies - the next generation” Nature Reviews Genetics. 11 (2010) [http://www.gersteinlab.org/courses/452/10-spring/pdf/ngs.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Wheeler DA et al. &amp;quot;The complete genome of an individual by massively parallel DNA sequencing,” Nature. 452:872-876 (208) [http://www.gersteinlab.org/courses/452/10-spring/pdf/WatsonGenome.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 2===&lt;br /&gt;
Olsen JV, Blagoev B, Gnad F, Macek B, Kumar C, Mortensen P, Mann M. (2006) Global, in vivo, and site-specific phosphorylation dynamics in signaling networks.Cell. 2006 Nov 3;127(3):635-48. [http://www.pil.sdu.dk/1/MSQuant/Cell_GlobalPhosphorylationSignalingDynamics.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Nevan J. Krogan et al (2006) Global landscape of protein complexes in the yeast Saccharomyces cerevisiae Nature 440, 637-643 (30 March 2006) [http://www.nature.com/nature/journal/v440/n7084/pdf/nature04670.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 3===&lt;br /&gt;
T.F. Smith and M.S. Waterman. (1981) Identification of common molecular subsequences. Journal of Molecular Biology,147(1): 195-7. PMID: 7265238. [http://www.gersteinlab.org/courses/452/10-spring/pdf/sw.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. (1990) Basic local alignment search tool. Journal of Molecular Biology, 215(3):403-10. PMID: 2231712. [http://www.gersteinlab.org/courses/452/10-spring/pdf/Altschul.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 4===&lt;br /&gt;
Bailey TL, Williams N, Misleh C, Li WW. (2006) MEME: discovering and analyzing DNA and protein sequence motifs, Nucl Acids Res.34:W369-373 [http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1538909/?tool=pubmed PDF]&lt;br /&gt;
&lt;br /&gt;
Garnier J, Gibrat JF, Robson B. (1996) GOR method for predicting protein secondary structure from amino acid sequence.Methods in Enzymology,266: 540-53. PMID: 8743705. [http://www.gersteinlab.org/courses/452/10-spring/pdf/GOR.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 5===&lt;br /&gt;
Laura J. van &#039;t Veer et al. Gene expression profiling predicts clinical outcome of breast cancer Nature 415, 530-536 (31 January 2002) | doi:10.1038/415530a; Received 24 August 2001; Accepted 22 November 2001 [http://www.nature.com/nature/journal/v415/n6871/full/415530a.html TEXT]&lt;br /&gt;
&lt;br /&gt;
Kwang-Il Goh, Michael E. Cusick, David Vall, Barton Child, Marc Vidal, and Albert-La ́szlo ́ Barabasi (2007) The human disease network Proc Natl Acad Sci U S A. 2007 May 22;104(21):8685-90. Epub 2007 May 14. [http://www.pnas.org/content/104/21/8685.full.pdf+html PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 6===&lt;br /&gt;
Antezana E, Egaña M, Blondé W, Illarramendi A, Bilbao I, De Baets B, Stevens R, Mironov V, Kuiper M. (2009) The Cell Cycle Ontology: an application ontology for the representation and integrated analysis of the cell cycle process. Genome Biol. 2009;10(5):R58. Epub 2009 May 29. [http://genomebiology.com/content/10/5/R58 PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 7===&lt;br /&gt;
Perelson AS. Modelling viral and immune system dynamics. Nat Rev Immunol. 2002 Jan;2(1):28-36. [http://www.gersteinlab.org/courses/452/10-spring/pdf/perelson.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 8===&lt;br /&gt;
ML Connolly. (1983) Solvent-accessible surfaces of proteins and nucleic acids. Science, 221(4612): 709-13. PMID: 6879170.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Connolly.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Martin Karplus and J. Andrew McCammon. (2002) Molecular dynamics simulations of biomolecules. Nature Structural Biology,9, 646-52. PMID: 12198485.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Karplus.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 9===&lt;br /&gt;
Dill KA, Ozkan SB, Shell MS, Weikl TR. (2008) The Protein Folding Problem.Annu Rev Biophys,9, 37:289-316. PMID: 2443096.[http://www.gersteinlab.org/courses/452/10-spring/pdf/proteinFolding.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Bowman GR, Beauchamp KA, Boxer G, Pande VS. “Progress and challenges in the automated construction of Markov state models for full protein systems,” J. Chem. Phys. 131 (2009) 124101 [http://www.gersteinlab.org/courses/452/10-spring/pdf/bowman.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
==Non-CBB Final Project==&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
==CBB Final Project==&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
==Class Requirements==&lt;br /&gt;
&lt;br /&gt;
===Discussion Section / Readings===&lt;br /&gt;
Papers will be assigned throughout the course. These papers will be presented and discussed in weekly sections with the TAs. A brief summary (a half-page per article) should be submitted at the beginning of the discussion session.&lt;br /&gt;
&lt;br /&gt;
===Bioinformatics quizzes===&lt;br /&gt;
There will be four short quizzes (25 minutes) in class comprising SIMPLE questions that you should be able to answer from the lectures plus the main readings.&lt;br /&gt;
&lt;br /&gt;
===Programming Assignments (CBB and CS)===&lt;br /&gt;
There will be several short programming assignments required for CBB and CS students taking this course. Acceptable languages and submission requirements will be discussed prior to the first assignment. These assignments are NOT required for students not taking the CBB or CS sections of the course.&lt;br /&gt;
&lt;br /&gt;
===Prerequisites===&lt;br /&gt;
The course is keyed towards CBB graduate students as well as advanced MB&amp;amp;B undergraduates and graduate students wishing to learn about types of large-scale quantitative analyses that whole-genome sequencing will make possible. It would also be suitable for students from other fields such as computer science or physics wanting to learn about an important new biological application for computation.&lt;br /&gt;
&lt;br /&gt;
Students should have:&lt;br /&gt;
&lt;br /&gt;
A basic knowledge of biochemistry and molecular biology.&lt;br /&gt;
A knowledge of basic quantitative concepts, such as single variable calculus, some probability and statistics, and basic programming skills.&lt;br /&gt;
These can be fulfilled by the following prerequisites statement: &amp;quot;Prerequisites: MBB 200 and Mathematics 115 or permission of the instructor.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
===Relevant Yale College Regulations===&lt;br /&gt;
&lt;br /&gt;
Students may have questions concerning end-of-term matters.  Links to further information about these regulations can be found below:&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/reading-period-and-final-examination-period&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/completion-course-work&lt;br /&gt;
&lt;br /&gt;
==Pages from previous years==&lt;br /&gt;
[http://info.gersteinlab.org/Cbb752b11 2011],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/ 2010],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/previous.html 2009 and earlier]&lt;br /&gt;
&lt;br /&gt;
==Research Opportunities==&lt;br /&gt;
&lt;br /&gt;
If you&#039;re really motivated, take a look at http://bioinfo.mbb.yale.edu/jobs/.&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1787</id>
		<title>Cbb752b12</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1787"/>
		<updated>2011-12-20T04:18:33Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: /* Bioinformatics quizzes */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=CBB 752=&lt;br /&gt;
&lt;br /&gt;
==Course Information==&lt;br /&gt;
&lt;br /&gt;
===Course Description===&lt;br /&gt;
&lt;br /&gt;
Bioinformatics encompasses the analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. It represents a major practical application for modern techniques in data mining and simulation. Specific topics to be covered include sequence alignment, large-scale processing, next-generation sequencing data, comparative genomics, phylogenetics, biological database design, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, normalization of microarray data, mining of functional genomics data sets, and machine learning approaches for data integration.&lt;br /&gt;
&lt;br /&gt;
===Concise undergraduate course description===&lt;br /&gt;
&lt;br /&gt;
Techniques in data mining and simulation applied to bioinformatics, the computational analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. Sequence alignment, comparative genomics and phylogenetics, biological databases, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, microarray normalization, and machine-learning approaches to data integration.&lt;br /&gt;
&lt;br /&gt;
See entry from undergraduate catalog:&lt;br /&gt;
http://students.yale.edu/oci/resultDetail.jsp?course=21914&amp;amp;term=201201 , viz:&lt;br /&gt;
&lt;br /&gt;
 MB&amp;amp;B 452 01 (21914) /MCDB452/MB&amp;amp;B752/CB&amp;amp;B752/MCDB752/CPSC752&lt;br /&gt;
 Bioinformatics: Practical Application of Simulation and Data Mining &lt;br /&gt;
 Mark Gerstein&lt;br /&gt;
 MW 1.00-2.15 BASS 305&lt;br /&gt;
 Spring 2012 &lt;br /&gt;
 No regular final examination&lt;br /&gt;
 Areas Sc&lt;br /&gt;
 Prerequisites: MB&amp;amp;B 301b and MATH 115a or b, or permission of instructor.&lt;br /&gt;
 MCDB 120a or 200b is a prerequisite for courses numbered MCDB 202 and above.&lt;br /&gt;
&lt;br /&gt;
===Quizzes and Final Project===&lt;br /&gt;
&lt;br /&gt;
There will be approximately four short quizzes during the semester and a take-home final project. For CBB and CS sections, the final project will be a programming assignment. For MB&amp;amp;B, the final project will be a paper. Further details will be announced at a later date.&lt;br /&gt;
&lt;br /&gt;
===Literature discussion section===&lt;br /&gt;
&lt;br /&gt;
One session of 60 minutes per week, time to be arranged. Student presentations of recent research papers relevant to the topics of the course. Led by Lucas Lochovsky (Bass, Rm 437; (203) 432-5405; lucas.lochovsky(at)yale.edu) and Jane Leng (Bass, Rm 437; (203) 432-5405; jing.leng(at)yale.edu).&lt;br /&gt;
&lt;br /&gt;
===Programming Projects/Problem Sets===&lt;br /&gt;
&lt;br /&gt;
Students taking this course listed under Computational Biology and Bioinformatics or Computer Science will be required to complete several short programming assignments. Further details will be discussed in the literature discussion section and during class.&lt;br /&gt;
&lt;br /&gt;
===Grade Categories===&lt;br /&gt;
&lt;br /&gt;
====CBB and CPSC Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 8.50%&lt;br /&gt;
|-&lt;br /&gt;
| Programming Assignments&lt;br /&gt;
| 25.00%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
====MBB and MCDB Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 16.75%&lt;br /&gt;
|-&lt;br /&gt;
| Problem Sets&lt;br /&gt;
| 16.75%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Differences Between Class Sections===&lt;br /&gt;
&lt;br /&gt;
In general, the graduate level CS/CBB course is significantly different than MBB/MCDB (graduate and undergraduate) in several ways. Although the lectures are the same for each section, the graduate level CPSC/CBB course has additional programming assignments in addition to the work being completed by the MBB students. Homework for the MBB section centers on the completion of several problem sets without a programming component. The CPSC/CBB section forgoes these problem sets and instead requires that students implement several of the algorithms discussed in class. Also, the final project for CPSC/CBB MUST be a programming assignment rather than the final paper required for the MBB section. Due to the distinct course requirements, category weightings for final grades are also different.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Timing &amp;amp; location==&lt;br /&gt;
&lt;br /&gt;
Class: Meeting from 1:00-2:15 pm on Monday and Wednesday, in 305 BASS. (First meeting will be on 9 Jan.)&lt;br /&gt;
&lt;br /&gt;
Discussion section: TBA&lt;br /&gt;
&lt;br /&gt;
==Instructors==&lt;br /&gt;
&lt;br /&gt;
===Instructor-in-Charge===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Office Phone&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Mark Gerstein&lt;br /&gt;
| Bass 432A&lt;br /&gt;
| (203) 432-6105&lt;br /&gt;
| mark.gerstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Instructors===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
! Office Hours&lt;br /&gt;
|-&lt;br /&gt;
| Corey O&#039;Hern&lt;br /&gt;
| Mason Laboratory&lt;br /&gt;
| corey.ohern(at)yale.edu&lt;br /&gt;
| M 2:15-3:15 PM&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Others to be listed&lt;br /&gt;
&lt;br /&gt;
===Teaching Fellows===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Office Phone&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Lucas Lochovsky&lt;br /&gt;
| Bass Rm 437&lt;br /&gt;
| (203) 432-5405&lt;br /&gt;
| lucas.lochovsky(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jane Leng&lt;br /&gt;
| Bass Rm 437&lt;br /&gt;
| (203) 432-5405&lt;br /&gt;
| jing.leng(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Topics/Class Schedule==&lt;br /&gt;
&lt;br /&gt;
[https://docs.google.com/spreadsheet/ccc?key=0Ar2M8lIkl9T5dDN4d1Q0aDBGcThXdjV2c21ESWVxSGc Class Schedule] (including a list of topics and quiz dates)&lt;br /&gt;
&lt;br /&gt;
==Discussion Sections==&lt;br /&gt;
&lt;br /&gt;
===Session 1===&lt;br /&gt;
Metzker ML. &amp;quot;Sequencing technologies - the next generation” Nature Reviews Genetics. 11 (2010) [http://www.gersteinlab.org/courses/452/10-spring/pdf/ngs.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Wheeler DA et al. &amp;quot;The complete genome of an individual by massively parallel DNA sequencing,” Nature. 452:872-876 (208) [http://www.gersteinlab.org/courses/452/10-spring/pdf/WatsonGenome.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 2===&lt;br /&gt;
Olsen JV, Blagoev B, Gnad F, Macek B, Kumar C, Mortensen P, Mann M. (2006) Global, in vivo, and site-specific phosphorylation dynamics in signaling networks.Cell. 2006 Nov 3;127(3):635-48. [http://www.pil.sdu.dk/1/MSQuant/Cell_GlobalPhosphorylationSignalingDynamics.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Nevan J. Krogan et al (2006) Global landscape of protein complexes in the yeast Saccharomyces cerevisiae Nature 440, 637-643 (30 March 2006) [http://www.nature.com/nature/journal/v440/n7084/pdf/nature04670.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 3===&lt;br /&gt;
T.F. Smith and M.S. Waterman. (1981) Identification of common molecular subsequences. Journal of Molecular Biology,147(1): 195-7. PMID: 7265238. [http://www.gersteinlab.org/courses/452/10-spring/pdf/sw.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. (1990) Basic local alignment search tool. Journal of Molecular Biology, 215(3):403-10. PMID: 2231712. [http://www.gersteinlab.org/courses/452/10-spring/pdf/Altschul.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 4===&lt;br /&gt;
Bailey TL, Williams N, Misleh C, Li WW. (2006) MEME: discovering and analyzing DNA and protein sequence motifs, Nucl Acids Res.34:W369-373 [http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1538909/?tool=pubmed PDF]&lt;br /&gt;
&lt;br /&gt;
Garnier J, Gibrat JF, Robson B. (1996) GOR method for predicting protein secondary structure from amino acid sequence.Methods in Enzymology,266: 540-53. PMID: 8743705. [http://www.gersteinlab.org/courses/452/10-spring/pdf/GOR.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 5===&lt;br /&gt;
Laura J. van &#039;t Veer et al. Gene expression profiling predicts clinical outcome of breast cancer Nature 415, 530-536 (31 January 2002) | doi:10.1038/415530a; Received 24 August 2001; Accepted 22 November 2001 [http://www.nature.com/nature/journal/v415/n6871/full/415530a.html TEXT]&lt;br /&gt;
&lt;br /&gt;
Kwang-Il Goh, Michael E. Cusick, David Vall, Barton Child, Marc Vidal, and Albert-La ́szlo ́ Barabasi (2007) The human disease network Proc Natl Acad Sci U S A. 2007 May 22;104(21):8685-90. Epub 2007 May 14. [http://www.pnas.org/content/104/21/8685.full.pdf+html PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 6===&lt;br /&gt;
Antezana E, Egaña M, Blondé W, Illarramendi A, Bilbao I, De Baets B, Stevens R, Mironov V, Kuiper M. (2009) The Cell Cycle Ontology: an application ontology for the representation and integrated analysis of the cell cycle process. Genome Biol. 2009;10(5):R58. Epub 2009 May 29. [http://genomebiology.com/content/10/5/R58 PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 7===&lt;br /&gt;
Perelson AS. Modelling viral and immune system dynamics. Nat Rev Immunol. 2002 Jan;2(1):28-36. [http://www.gersteinlab.org/courses/452/10-spring/pdf/perelson.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 8===&lt;br /&gt;
ML Connolly. (1983) Solvent-accessible surfaces of proteins and nucleic acids. Science, 221(4612): 709-13. PMID: 6879170.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Connolly.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Martin Karplus and J. Andrew McCammon. (2002) Molecular dynamics simulations of biomolecules. Nature Structural Biology,9, 646-52. PMID: 12198485.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Karplus.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 9===&lt;br /&gt;
Dill KA, Ozkan SB, Shell MS, Weikl TR. (2008) The Protein Folding Problem.Annu Rev Biophys,9, 37:289-316. PMID: 2443096.[http://www.gersteinlab.org/courses/452/10-spring/pdf/proteinFolding.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Bowman GR, Beauchamp KA, Boxer G, Pande VS. “Progress and challenges in the automated construction of Markov state models for full protein systems,” J. Chem. Phys. 131 (2009) 124101 [http://www.gersteinlab.org/courses/452/10-spring/pdf/bowman.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
==Non-CBB Final Project==&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
==CBB Final Project==&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
==Class Requirements==&lt;br /&gt;
&lt;br /&gt;
===Discussion Section / Readings===&lt;br /&gt;
Papers will be assigned throughout the course. These papers will be presented and discussed in weekly sections with the TAs. A brief summary (a half-page per article) should be submitted at the beginning of the discussion session.&lt;br /&gt;
&lt;br /&gt;
===Bioinformatics quizzes===&lt;br /&gt;
There will be four short quizzes (25 minutes) in class comprising SIMPLE questions that you should be able to answer from the lectures plus the main readings.&lt;br /&gt;
&lt;br /&gt;
===Programming Assignments (CBB and CS)===&lt;br /&gt;
There will be several short programming assignments required for CBB and CS students taking this course. Acceptable languages and submission requirements will be discussed prior to the first assignment. These assignments are NOT required for students not taking the CBB or CS sections of the course.&lt;br /&gt;
&lt;br /&gt;
===Prerequisites===&lt;br /&gt;
The course is keyed towards CBB graduate students as well as advanced MB&amp;amp;B undergraduates and graduate students wishing to learn about types of large-scale quantitative analyses that whole-genome sequencing will make possible. It would also be suitable for students from other fields such as computer science or physics wanting to learn about an important new biological application for computation.&lt;br /&gt;
&lt;br /&gt;
Students should have:&lt;br /&gt;
&lt;br /&gt;
A basic knowledge of biochemistry and molecular biology.&lt;br /&gt;
A knowledge of basic quantitative concepts, such as single variable calculus, some probability and statistics, and basic programming skills.&lt;br /&gt;
These can be fulfilled by the following prerequisites statement: &amp;quot;Prerequisites: MBB 200 and Mathematics 115 or permission of the instructor.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
===Relevant Yale College Regulations===&lt;br /&gt;
&lt;br /&gt;
Students may have questions concerning end-of-term matters.  Links to further information about these regulations can be found below:&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/reading-period-and-final-examination-period&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/completion-course-work&lt;br /&gt;
&lt;br /&gt;
==Pages from previous years==&lt;br /&gt;
[http://info.gersteinlab.org/Cbb752b11 2011],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/ 2010],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/previous.html 2009 and earlier]&lt;br /&gt;
&lt;br /&gt;
==Research Opportunities==&lt;br /&gt;
&lt;br /&gt;
If you&#039;re really motivated, take a look at http://bioinfo.mbb.yale.edu/jobs/.&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1786</id>
		<title>Cbb752b12</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1786"/>
		<updated>2011-12-20T04:16:59Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=CBB 752=&lt;br /&gt;
&lt;br /&gt;
==Course Information==&lt;br /&gt;
&lt;br /&gt;
===Course Description===&lt;br /&gt;
&lt;br /&gt;
Bioinformatics encompasses the analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. It represents a major practical application for modern techniques in data mining and simulation. Specific topics to be covered include sequence alignment, large-scale processing, next-generation sequencing data, comparative genomics, phylogenetics, biological database design, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, normalization of microarray data, mining of functional genomics data sets, and machine learning approaches for data integration.&lt;br /&gt;
&lt;br /&gt;
===Concise undergraduate course description===&lt;br /&gt;
&lt;br /&gt;
Techniques in data mining and simulation applied to bioinformatics, the computational analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. Sequence alignment, comparative genomics and phylogenetics, biological databases, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, microarray normalization, and machine-learning approaches to data integration.&lt;br /&gt;
&lt;br /&gt;
See entry from undergraduate catalog:&lt;br /&gt;
http://students.yale.edu/oci/resultDetail.jsp?course=21914&amp;amp;term=201201 , viz:&lt;br /&gt;
&lt;br /&gt;
 MB&amp;amp;B 452 01 (21914) /MCDB452/MB&amp;amp;B752/CB&amp;amp;B752/MCDB752/CPSC752&lt;br /&gt;
 Bioinformatics: Practical Application of Simulation and Data Mining &lt;br /&gt;
 Mark Gerstein&lt;br /&gt;
 MW 1.00-2.15 BASS 305&lt;br /&gt;
 Spring 2012 &lt;br /&gt;
 No regular final examination&lt;br /&gt;
 Areas Sc&lt;br /&gt;
 Prerequisites: MB&amp;amp;B 301b and MATH 115a or b, or permission of instructor.&lt;br /&gt;
 MCDB 120a or 200b is a prerequisite for courses numbered MCDB 202 and above.&lt;br /&gt;
&lt;br /&gt;
===Quizzes and Final Project===&lt;br /&gt;
&lt;br /&gt;
There will be approximately four short quizzes during the semester and a take-home final project. For CBB and CS sections, the final project will be a programming assignment. For MB&amp;amp;B, the final project will be a paper. Further details will be announced at a later date.&lt;br /&gt;
&lt;br /&gt;
===Literature discussion section===&lt;br /&gt;
&lt;br /&gt;
One session of 60 minutes per week, time to be arranged. Student presentations of recent research papers relevant to the topics of the course. Led by Lucas Lochovsky (Bass, Rm 437; (203) 432-5405; lucas.lochovsky(at)yale.edu) and Jane Leng (Bass, Rm 437; (203) 432-5405; jing.leng(at)yale.edu).&lt;br /&gt;
&lt;br /&gt;
===Programming Projects/Problem Sets===&lt;br /&gt;
&lt;br /&gt;
Students taking this course listed under Computational Biology and Bioinformatics or Computer Science will be required to complete several short programming assignments. Further details will be discussed in the literature discussion section and during class.&lt;br /&gt;
&lt;br /&gt;
===Grade Categories===&lt;br /&gt;
&lt;br /&gt;
====CBB and CPSC Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 8.50%&lt;br /&gt;
|-&lt;br /&gt;
| Programming Assignments&lt;br /&gt;
| 25.00%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
====MBB and MCDB Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 16.75%&lt;br /&gt;
|-&lt;br /&gt;
| Problem Sets&lt;br /&gt;
| 16.75%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Differences Between Class Sections===&lt;br /&gt;
&lt;br /&gt;
In general, the graduate level CS/CBB course is significantly different than MBB/MCDB (graduate and undergraduate) in several ways. Although the lectures are the same for each section, the graduate level CPSC/CBB course has additional programming assignments in addition to the work being completed by the MBB students. Homework for the MBB section centers on the completion of several problem sets without a programming component. The CPSC/CBB section forgoes these problem sets and instead requires that students implement several of the algorithms discussed in class. Also, the final project for CPSC/CBB MUST be a programming assignment rather than the final paper required for the MBB section. Due to the distinct course requirements, category weightings for final grades are also different.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Timing &amp;amp; location==&lt;br /&gt;
&lt;br /&gt;
Class: Meeting from 1:00-2:15 pm on Monday and Wednesday, in 305 BASS. (First meeting will be on 9 Jan.)&lt;br /&gt;
&lt;br /&gt;
Discussion section: TBA&lt;br /&gt;
&lt;br /&gt;
==Instructors==&lt;br /&gt;
&lt;br /&gt;
===Instructor-in-Charge===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Office Phone&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Mark Gerstein&lt;br /&gt;
| Bass 432A&lt;br /&gt;
| (203) 432-6105&lt;br /&gt;
| mark.gerstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Instructors===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
! Office Hours&lt;br /&gt;
|-&lt;br /&gt;
| Corey O&#039;Hern&lt;br /&gt;
| Mason Laboratory&lt;br /&gt;
| corey.ohern(at)yale.edu&lt;br /&gt;
| M 2:15-3:15 PM&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Others to be listed&lt;br /&gt;
&lt;br /&gt;
===Teaching Fellows===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Office Phone&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Lucas Lochovsky&lt;br /&gt;
| Bass Rm 437&lt;br /&gt;
| (203) 432-5405&lt;br /&gt;
| lucas.lochovsky(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jane Leng&lt;br /&gt;
| Bass Rm 437&lt;br /&gt;
| (203) 432-5405&lt;br /&gt;
| jing.leng(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Topics/Class Schedule==&lt;br /&gt;
&lt;br /&gt;
[https://docs.google.com/spreadsheet/ccc?key=0Ar2M8lIkl9T5dDN4d1Q0aDBGcThXdjV2c21ESWVxSGc Class Schedule] (including a list of topics and quiz dates)&lt;br /&gt;
&lt;br /&gt;
==Discussion Sections==&lt;br /&gt;
&lt;br /&gt;
===Session 1===&lt;br /&gt;
Metzker ML. &amp;quot;Sequencing technologies - the next generation” Nature Reviews Genetics. 11 (2010) [http://www.gersteinlab.org/courses/452/10-spring/pdf/ngs.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Wheeler DA et al. &amp;quot;The complete genome of an individual by massively parallel DNA sequencing,” Nature. 452:872-876 (208) [http://www.gersteinlab.org/courses/452/10-spring/pdf/WatsonGenome.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 2===&lt;br /&gt;
Olsen JV, Blagoev B, Gnad F, Macek B, Kumar C, Mortensen P, Mann M. (2006) Global, in vivo, and site-specific phosphorylation dynamics in signaling networks.Cell. 2006 Nov 3;127(3):635-48. [http://www.pil.sdu.dk/1/MSQuant/Cell_GlobalPhosphorylationSignalingDynamics.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Nevan J. Krogan et al (2006) Global landscape of protein complexes in the yeast Saccharomyces cerevisiae Nature 440, 637-643 (30 March 2006) [http://www.nature.com/nature/journal/v440/n7084/pdf/nature04670.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 3===&lt;br /&gt;
T.F. Smith and M.S. Waterman. (1981) Identification of common molecular subsequences. Journal of Molecular Biology,147(1): 195-7. PMID: 7265238. [http://www.gersteinlab.org/courses/452/10-spring/pdf/sw.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. (1990) Basic local alignment search tool. Journal of Molecular Biology, 215(3):403-10. PMID: 2231712. [http://www.gersteinlab.org/courses/452/10-spring/pdf/Altschul.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 4===&lt;br /&gt;
Bailey TL, Williams N, Misleh C, Li WW. (2006) MEME: discovering and analyzing DNA and protein sequence motifs, Nucl Acids Res.34:W369-373 [http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1538909/?tool=pubmed PDF]&lt;br /&gt;
&lt;br /&gt;
Garnier J, Gibrat JF, Robson B. (1996) GOR method for predicting protein secondary structure from amino acid sequence.Methods in Enzymology,266: 540-53. PMID: 8743705. [http://www.gersteinlab.org/courses/452/10-spring/pdf/GOR.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 5===&lt;br /&gt;
Laura J. van &#039;t Veer et al. Gene expression profiling predicts clinical outcome of breast cancer Nature 415, 530-536 (31 January 2002) | doi:10.1038/415530a; Received 24 August 2001; Accepted 22 November 2001 [http://www.nature.com/nature/journal/v415/n6871/full/415530a.html TEXT]&lt;br /&gt;
&lt;br /&gt;
Kwang-Il Goh, Michael E. Cusick, David Vall, Barton Child, Marc Vidal, and Albert-La ́szlo ́ Barabasi (2007) The human disease network Proc Natl Acad Sci U S A. 2007 May 22;104(21):8685-90. Epub 2007 May 14. [http://www.pnas.org/content/104/21/8685.full.pdf+html PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 6===&lt;br /&gt;
Antezana E, Egaña M, Blondé W, Illarramendi A, Bilbao I, De Baets B, Stevens R, Mironov V, Kuiper M. (2009) The Cell Cycle Ontology: an application ontology for the representation and integrated analysis of the cell cycle process. Genome Biol. 2009;10(5):R58. Epub 2009 May 29. [http://genomebiology.com/content/10/5/R58 PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 7===&lt;br /&gt;
Perelson AS. Modelling viral and immune system dynamics. Nat Rev Immunol. 2002 Jan;2(1):28-36. [http://www.gersteinlab.org/courses/452/10-spring/pdf/perelson.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 8===&lt;br /&gt;
ML Connolly. (1983) Solvent-accessible surfaces of proteins and nucleic acids. Science, 221(4612): 709-13. PMID: 6879170.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Connolly.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Martin Karplus and J. Andrew McCammon. (2002) Molecular dynamics simulations of biomolecules. Nature Structural Biology,9, 646-52. PMID: 12198485.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Karplus.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 9===&lt;br /&gt;
Dill KA, Ozkan SB, Shell MS, Weikl TR. (2008) The Protein Folding Problem.Annu Rev Biophys,9, 37:289-316. PMID: 2443096.[http://www.gersteinlab.org/courses/452/10-spring/pdf/proteinFolding.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Bowman GR, Beauchamp KA, Boxer G, Pande VS. “Progress and challenges in the automated construction of Markov state models for full protein systems,” J. Chem. Phys. 131 (2009) 124101 [http://www.gersteinlab.org/courses/452/10-spring/pdf/bowman.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
==Non-CBB Final Project==&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
==CBB Final Project==&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
==Class Requirements==&lt;br /&gt;
&lt;br /&gt;
===Discussion Section / Readings===&lt;br /&gt;
Papers will be assigned throughout the course. These papers will be presented and discussed in weekly sections with the TAs. A brief summary (a half-page per article) should be submitted at the beginning of the discussion session.&lt;br /&gt;
&lt;br /&gt;
===Bioinformatics quizzes===&lt;br /&gt;
There will be approximately three short quizzes (25 minutes) in class comprising SIMPLE questions that you should be able to answer from the lectures plus the main readings.&lt;br /&gt;
&lt;br /&gt;
===Programming Assignments (CBB and CS)===&lt;br /&gt;
There will be several short programming assignments required for CBB and CS students taking this course. Acceptable languages and submission requirements will be discussed prior to the first assignment. These assignments are NOT required for students not taking the CBB or CS sections of the course.&lt;br /&gt;
&lt;br /&gt;
===Prerequisites===&lt;br /&gt;
The course is keyed towards CBB graduate students as well as advanced MB&amp;amp;B undergraduates and graduate students wishing to learn about types of large-scale quantitative analyses that whole-genome sequencing will make possible. It would also be suitable for students from other fields such as computer science or physics wanting to learn about an important new biological application for computation.&lt;br /&gt;
&lt;br /&gt;
Students should have:&lt;br /&gt;
&lt;br /&gt;
A basic knowledge of biochemistry and molecular biology.&lt;br /&gt;
A knowledge of basic quantitative concepts, such as single variable calculus, some probability and statistics, and basic programming skills.&lt;br /&gt;
These can be fulfilled by the following prerequisites statement: &amp;quot;Prerequisites: MBB 200 and Mathematics 115 or permission of the instructor.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
===Relevant Yale College Regulations===&lt;br /&gt;
&lt;br /&gt;
Students may have questions concerning end-of-term matters.  Links to further information about these regulations can be found below:&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/reading-period-and-final-examination-period&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/completion-course-work&lt;br /&gt;
&lt;br /&gt;
==Pages from previous years==&lt;br /&gt;
[http://info.gersteinlab.org/Cbb752b11 2011],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/ 2010],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/previous.html 2009 and earlier]&lt;br /&gt;
&lt;br /&gt;
==Research Opportunities==&lt;br /&gt;
&lt;br /&gt;
If you&#039;re really motivated, take a look at http://bioinfo.mbb.yale.edu/jobs/.&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1785</id>
		<title>Cbb752b12</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=Cbb752b12&amp;diff=1785"/>
		<updated>2011-12-20T04:12:44Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=CBB 752=&lt;br /&gt;
&lt;br /&gt;
==Course Information==&lt;br /&gt;
&lt;br /&gt;
===Course Description===&lt;br /&gt;
&lt;br /&gt;
Bioinformatics encompasses the analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. It represents a major practical application for modern techniques in data mining and simulation. Specific topics to be covered include sequence alignment, large-scale processing, next-generation sequencing data, comparative genomics, phylogenetics, biological database design, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, normalization of microarray data, mining of functional genomics data sets, and machine learning approaches for data integration.&lt;br /&gt;
&lt;br /&gt;
===Concise undergraduate course description===&lt;br /&gt;
&lt;br /&gt;
Techniques in data mining and simulation applied to bioinformatics, the computational analysis of gene sequences, macromolecular structures, and functional genomics data on a large scale. Sequence alignment, comparative genomics and phylogenetics, biological databases, geometric analysis of protein structure, molecular-dynamics simulation, biological networks, microarray normalization, and machine-learning approaches to data integration.&lt;br /&gt;
&lt;br /&gt;
See entry from undergraduate catalog:&lt;br /&gt;
http://students.yale.edu/oci/resultDetail.jsp?course=21914&amp;amp;term=201201 , viz:&lt;br /&gt;
&lt;br /&gt;
 MB&amp;amp;B 452 01 (21914) /MCDB452/MB&amp;amp;B752/CB&amp;amp;B752/MCDB752/CPSC752&lt;br /&gt;
 Bioinformatics: Practical Application of Simulation and Data Mining &lt;br /&gt;
 Mark Gerstein&lt;br /&gt;
 MW 1.00-2.15 BASS 305&lt;br /&gt;
 Spring 2012 &lt;br /&gt;
 No regular final examination&lt;br /&gt;
 Areas Sc&lt;br /&gt;
 Prerequisites: MB&amp;amp;B 301b and MATH 115a or b, or permission of instructor.&lt;br /&gt;
 MCDB 120a or 200b is a prerequisite for courses numbered MCDB 202 and above.&lt;br /&gt;
&lt;br /&gt;
===Quizzes and Final Project===&lt;br /&gt;
&lt;br /&gt;
There will be approximately four short quizzes during the semester and a take-home final project. For CBB and CS sections, the final project will be a programming assignment. For MB&amp;amp;B, the final project will be a paper. Further details will be announced at a later date.&lt;br /&gt;
&lt;br /&gt;
===Literature discussion section===&lt;br /&gt;
&lt;br /&gt;
One session of 60 minutes per week, time to be arranged. Student presentations of recent research papers relevant to the topics of the course. Led by Lucas Lochovsky (Bass, Rm 437; (203) 432-5405; lucas.lochovsky(at)yale.edu) and Jane Leng (Bass, Rm 437; (203) 432-5405; jing.leng(at)yale.edu).&lt;br /&gt;
&lt;br /&gt;
===Programming Projects/Problem Sets===&lt;br /&gt;
&lt;br /&gt;
Students taking this course listed under Computational Biology and Bioinformatics or Computer Science will be required to complete several short programming assignments. Further details will be discussed in the literature discussion section and during class.&lt;br /&gt;
&lt;br /&gt;
===Grade Categories===&lt;br /&gt;
&lt;br /&gt;
====CBB and CPSC Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 8.50%&lt;br /&gt;
|-&lt;br /&gt;
| Programming Assignments&lt;br /&gt;
| 25.00%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
====MBB and MCDB Sections:====&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Category&lt;br /&gt;
! % of Total Grade&lt;br /&gt;
|-&lt;br /&gt;
| Quizzes&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Final Project&lt;br /&gt;
| 33.25%&lt;br /&gt;
|-&lt;br /&gt;
| Discussion Section&lt;br /&gt;
| 16.75%&lt;br /&gt;
|-&lt;br /&gt;
| Problem Sets&lt;br /&gt;
| 16.75%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Differences Between Class Sections===&lt;br /&gt;
&lt;br /&gt;
In general, the graduate level CS/CBB course is significantly different than MBB/MCDB (graduate and undergraduate) in several ways. Although the lectures are the same for each section, the graduate level CPSC/CBB course has additional programming assignments in addition to the work being completed by the MBB students. Homework for the MBB section centers on the completion of several problem sets without a programming component. The CPSC/CBB section forgoes these problem sets and instead requires that students implement several of the algorithms discussed in class. Also, the final project for CPSC/CBB MUST be a programming assignment rather than the final paper required for the MBB section. Due to the distinct course requirements, category weightings for final grades are also different.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Timing &amp;amp; location==&lt;br /&gt;
&lt;br /&gt;
Class: Meeting from 1:00-2:15 pm on Monday and Wednesday, in 305 BASS. (First meeting will be on 9 Jan.)&lt;br /&gt;
&lt;br /&gt;
Discussion section: TBA&lt;br /&gt;
&lt;br /&gt;
==Instructors==&lt;br /&gt;
&lt;br /&gt;
===Instructor-in-Charge===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Office Phone&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Mark Gerstein&lt;br /&gt;
| Bass 432A&lt;br /&gt;
| (203) 432-6105&lt;br /&gt;
| mark.gerstein(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Instructors===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Email&lt;br /&gt;
! Office Hours&lt;br /&gt;
|-&lt;br /&gt;
| Corey O&#039;Hern&lt;br /&gt;
| Mason Laboratory&lt;br /&gt;
| corey.ohern(at)yale.edu&lt;br /&gt;
| M 2:15-3:15 PM&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Others to be listed&lt;br /&gt;
&lt;br /&gt;
===Teaching Fellows===&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
! Office&lt;br /&gt;
! Office Phone&lt;br /&gt;
! Email&lt;br /&gt;
|-&lt;br /&gt;
| Lucas Lochovsky&lt;br /&gt;
| Bass Rm 437&lt;br /&gt;
| (203) 432-5405&lt;br /&gt;
| lucas.lochovsky(at)yale.edu&lt;br /&gt;
|-&lt;br /&gt;
| Jane Leng&lt;br /&gt;
| Bass Rm 437&lt;br /&gt;
| (203) 432-5405&lt;br /&gt;
| jing.leng(at)yale.edu&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Topics/Class Schedule==&lt;br /&gt;
&lt;br /&gt;
[https://spreadsheets.google.com/pub?key=0AnSCkRiRBZ5KdEp1T0NuWkZHTjZKLXZyV1Y0WnFNZGc&amp;amp;hl=en&amp;amp;output=html Class Schedule] (including a list of topics and quiz dates)&lt;br /&gt;
&lt;br /&gt;
==Discussion Sections==&lt;br /&gt;
&lt;br /&gt;
===Session 1===&lt;br /&gt;
Metzker ML. &amp;quot;Sequencing technologies - the next generation” Nature Reviews Genetics. 11 (2010) [http://www.gersteinlab.org/courses/452/10-spring/pdf/ngs.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Wheeler DA et al. &amp;quot;The complete genome of an individual by massively parallel DNA sequencing,” Nature. 452:872-876 (208) [http://www.gersteinlab.org/courses/452/10-spring/pdf/WatsonGenome.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 2===&lt;br /&gt;
Olsen JV, Blagoev B, Gnad F, Macek B, Kumar C, Mortensen P, Mann M. (2006) Global, in vivo, and site-specific phosphorylation dynamics in signaling networks.Cell. 2006 Nov 3;127(3):635-48. [http://www.pil.sdu.dk/1/MSQuant/Cell_GlobalPhosphorylationSignalingDynamics.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Nevan J. Krogan et al (2006) Global landscape of protein complexes in the yeast Saccharomyces cerevisiae Nature 440, 637-643 (30 March 2006) [http://www.nature.com/nature/journal/v440/n7084/pdf/nature04670.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 3===&lt;br /&gt;
T.F. Smith and M.S. Waterman. (1981) Identification of common molecular subsequences. Journal of Molecular Biology,147(1): 195-7. PMID: 7265238. [http://www.gersteinlab.org/courses/452/10-spring/pdf/sw.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. (1990) Basic local alignment search tool. Journal of Molecular Biology, 215(3):403-10. PMID: 2231712. [http://www.gersteinlab.org/courses/452/10-spring/pdf/Altschul.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 4===&lt;br /&gt;
Bailey TL, Williams N, Misleh C, Li WW. (2006) MEME: discovering and analyzing DNA and protein sequence motifs, Nucl Acids Res.34:W369-373 [http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1538909/?tool=pubmed PDF]&lt;br /&gt;
&lt;br /&gt;
Garnier J, Gibrat JF, Robson B. (1996) GOR method for predicting protein secondary structure from amino acid sequence.Methods in Enzymology,266: 540-53. PMID: 8743705. [http://www.gersteinlab.org/courses/452/10-spring/pdf/GOR.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 5===&lt;br /&gt;
Laura J. van &#039;t Veer et al. Gene expression profiling predicts clinical outcome of breast cancer Nature 415, 530-536 (31 January 2002) | doi:10.1038/415530a; Received 24 August 2001; Accepted 22 November 2001 [http://www.nature.com/nature/journal/v415/n6871/full/415530a.html TEXT]&lt;br /&gt;
&lt;br /&gt;
Kwang-Il Goh, Michael E. Cusick, David Vall, Barton Child, Marc Vidal, and Albert-La ́szlo ́ Barabasi (2007) The human disease network Proc Natl Acad Sci U S A. 2007 May 22;104(21):8685-90. Epub 2007 May 14. [http://www.pnas.org/content/104/21/8685.full.pdf+html PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 6===&lt;br /&gt;
Antezana E, Egaña M, Blondé W, Illarramendi A, Bilbao I, De Baets B, Stevens R, Mironov V, Kuiper M. (2009) The Cell Cycle Ontology: an application ontology for the representation and integrated analysis of the cell cycle process. Genome Biol. 2009;10(5):R58. Epub 2009 May 29. [http://genomebiology.com/content/10/5/R58 PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 7===&lt;br /&gt;
Perelson AS. Modelling viral and immune system dynamics. Nat Rev Immunol. 2002 Jan;2(1):28-36. [http://www.gersteinlab.org/courses/452/10-spring/pdf/perelson.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 8===&lt;br /&gt;
ML Connolly. (1983) Solvent-accessible surfaces of proteins and nucleic acids. Science, 221(4612): 709-13. PMID: 6879170.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Connolly.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Martin Karplus and J. Andrew McCammon. (2002) Molecular dynamics simulations of biomolecules. Nature Structural Biology,9, 646-52. PMID: 12198485.[http://www.gersteinlab.org/courses/452/10-spring/pdf/Karplus.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
===Session 9===&lt;br /&gt;
Dill KA, Ozkan SB, Shell MS, Weikl TR. (2008) The Protein Folding Problem.Annu Rev Biophys,9, 37:289-316. PMID: 2443096.[http://www.gersteinlab.org/courses/452/10-spring/pdf/proteinFolding.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
Bowman GR, Beauchamp KA, Boxer G, Pande VS. “Progress and challenges in the automated construction of Markov state models for full protein systems,” J. Chem. Phys. 131 (2009) 124101 [http://www.gersteinlab.org/courses/452/10-spring/pdf/bowman.pdf PDF]&lt;br /&gt;
&lt;br /&gt;
==Non-CBB Final Project==&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
==CBB Final Project==&lt;br /&gt;
&lt;br /&gt;
[tbd]&lt;br /&gt;
&lt;br /&gt;
==Class Requirements==&lt;br /&gt;
&lt;br /&gt;
===Discussion Section / Readings===&lt;br /&gt;
Papers will be assigned throughout the course. These papers will be presented and discussed in weekly sections with the TAs. A brief summary (a half-page per article) should be submitted at the beginning of the discussion session.&lt;br /&gt;
&lt;br /&gt;
===Bioinformatics quizzes===&lt;br /&gt;
There will be approximately three short quizzes (25 minutes) in class comprising SIMPLE questions that you should be able to answer from the lectures plus the main readings.&lt;br /&gt;
&lt;br /&gt;
===Programming Assignments (CBB and CS)===&lt;br /&gt;
There will be several short programming assignments required for CBB and CS students taking this course. Acceptable languages and submission requirements will be discussed prior to the first assignment. These assignments are NOT required for students not taking the CBB or CS sections of the course.&lt;br /&gt;
&lt;br /&gt;
===Prerequisites===&lt;br /&gt;
The course is keyed towards CBB graduate students as well as advanced MB&amp;amp;B undergraduates and graduate students wishing to learn about types of large-scale quantitative analyses that whole-genome sequencing will make possible. It would also be suitable for students from other fields such as computer science or physics wanting to learn about an important new biological application for computation.&lt;br /&gt;
&lt;br /&gt;
Students should have:&lt;br /&gt;
&lt;br /&gt;
A basic knowledge of biochemistry and molecular biology.&lt;br /&gt;
A knowledge of basic quantitative concepts, such as single variable calculus, some probability and statistics, and basic programming skills.&lt;br /&gt;
These can be fulfilled by the following prerequisites statement: &amp;quot;Prerequisites: MBB 200 and Mathematics 115 or permission of the instructor.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
===Relevant Yale College Regulations===&lt;br /&gt;
&lt;br /&gt;
Students may have questions concerning end-of-term matters.  Links to further information about these regulations can be found below:&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/reading-period-and-final-examination-period&lt;br /&gt;
&lt;br /&gt;
http://yalecollege.yale.edu/content/completion-course-work&lt;br /&gt;
&lt;br /&gt;
==Pages from previous years==&lt;br /&gt;
[http://info.gersteinlab.org/Cbb752b11 2011],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/ 2010],&lt;br /&gt;
[http://www.gersteinlab.org/courses/452/10-spring/previous.html 2009 and earlier]&lt;br /&gt;
&lt;br /&gt;
==Research Opportunities==&lt;br /&gt;
&lt;br /&gt;
If you&#039;re really motivated, take a look at http://bioinfo.mbb.yale.edu/jobs/.&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=Calendar_maintainers&amp;diff=1360</id>
		<title>Calendar maintainers</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=Calendar_maintainers&amp;diff=1360"/>
		<updated>2011-06-22T18:29:29Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;The individuals below have responsibilities for maintaining different parts of the lab calendar. These should *also* try to turn up on their call. &lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
|+Editors and Calls&lt;br /&gt;
|-&lt;br /&gt;
! Name !! Call/mtg !! Call/mtg !! Call/mtg&lt;br /&gt;
|-&lt;br /&gt;
! ZL&lt;br /&gt;
| modEncode TR || ||&lt;br /&gt;
|-&lt;br /&gt;
! JR&lt;br /&gt;
|  special events || encode Elements ||&lt;br /&gt;
|-&lt;br /&gt;
! RR&lt;br /&gt;
| GT || monthly modENCODE Consortium || modEncode&lt;br /&gt;
|-&lt;br /&gt;
! RKA &lt;br /&gt;
| encode-Chip || ||&lt;br /&gt;
|-&lt;br /&gt;
! CC &lt;br /&gt;
| encode AWG || ||&lt;br /&gt;
|-&lt;br /&gt;
! LH &lt;br /&gt;
| oligo call || test subject calls ||&lt;br /&gt;
|-&lt;br /&gt;
! JJL &lt;br /&gt;
| ENCODE RNA ||  ||&lt;br /&gt;
|-&lt;br /&gt;
! JCW &lt;br /&gt;
| Brainseq (GO group #3) ||  ||&lt;br /&gt;
|-&lt;br /&gt;
! RA&lt;br /&gt;
| modEncode AWG ||  ||&lt;br /&gt;
|-&lt;br /&gt;
! MW&lt;br /&gt;
| Happy-Hr || encode-chip quarterly report deadlines||&lt;br /&gt;
|-&lt;br /&gt;
! MF&lt;br /&gt;
| gpmtg || JC ||&lt;br /&gt;
|-&lt;br /&gt;
! XJM &lt;br /&gt;
| 1000 genomes Analysis  || 1000 genomes indel calls ||&lt;br /&gt;
|-&lt;br /&gt;
! AAN &lt;br /&gt;
| encode: Variation || Marcin call ||&lt;br /&gt;
|-&lt;br /&gt;
! EK &lt;br /&gt;
| 1000SV || ||&lt;br /&gt;
|-&lt;br /&gt;
! SB &lt;br /&gt;
| encode-pgene || 1000 genomes exome call || &lt;br /&gt;
|-&lt;br /&gt;
! ASB &lt;br /&gt;
| Rubin-Gerstein Seq. Call &amp;amp; Monthly Meetings ||  || &lt;br /&gt;
|-&lt;br /&gt;
&lt;br /&gt;
! AN &lt;br /&gt;
| Yale holidays || Rotation Dates || Misc. stuff&lt;br /&gt;
|-&lt;br /&gt;
! RK&lt;br /&gt;
| YPED ||  ||&lt;br /&gt;
|-&lt;br /&gt;
! BP&lt;br /&gt;
| encode-annotation ||  ||&lt;br /&gt;
|}&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
	<entry>
		<id>https://info.gersteinlab.org/index.php?title=Main_Page&amp;diff=1327</id>
		<title>Main Page</title>
		<link rel="alternate" type="text/html" href="https://info.gersteinlab.org/index.php?title=Main_Page&amp;diff=1327"/>
		<updated>2011-06-07T18:37:47Z</updated>

		<summary type="html">&lt;p&gt;Lucasl: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;This wiki hosts general, public information about the&lt;br /&gt;
 &lt;br /&gt;
&#039;&#039;&#039;[http://gersteinlab.org Gerstein Lab]&#039;&#039;&#039;.  &lt;br /&gt;
&lt;br /&gt;
Lab members should consult the [http://wiki.gersteinlab.org/labinfo/ private lab wiki] for lab-specific information (login as user &amp;quot;public&amp;quot;). There&#039;s also a wiki for [http://wiki.gersteinlab.org/meetings lab meetings] (login as user &amp;quot;private&amp;quot;). To edit this wiki, contact the [[User:Infoadmin | Infoadmin]]&lt;br /&gt;
&lt;br /&gt;
== Important Public Items ==&lt;br /&gt;
*Public [[Documents]] about the lab&lt;br /&gt;
*[[FAQ]] (new) on programs&lt;br /&gt;
*Lab [[Calendar]]&lt;br /&gt;
*[http://www.gersteinlab.org/index.html.1jun11 Old version] of lab homepage&lt;br /&gt;
*Prominent [[public wiki pages linked from elsewhere]]&lt;br /&gt;
*[[Permissions]]&lt;br /&gt;
&lt;br /&gt;
== Info for New People ==&lt;br /&gt;
*[[General Information for New Undergraduates]] thinking about working in the lab&lt;br /&gt;
* Information for &amp;lt;B&amp;gt;new people&amp;lt;/B&amp;gt; starting in the lab from the private wiki ([http://wiki.gersteinlab.org/labinfo/Group_Meeting_and_JC_Procedure], [http://wiki.gersteinlab.org/labinfo/Bad_Times_and_Contact_Info], [http://wiki.gersteinlab.org/labinfo/New_People_Entering_Lab], [http://wiki.gersteinlab.org/labinfo/Staff])&lt;br /&gt;
&lt;br /&gt;
== Info for Lab Members and Collaborators ==&lt;br /&gt;
*[http://wiki.gersteinlab.org/labinfo/Lab_Computing_Resources Lab Computing Resources] page. &lt;br /&gt;
*[http://wiki.gersteinlab.org/labinfo/Making_conference_calls_or_lab_related_calls Making phone calls] (SKYPE information included) page.&lt;br /&gt;
*[http://wiki.gersteinlab.org/labinfo/Using_copier Using copier] page.&lt;br /&gt;
*[http://www.facebook.com/group.php?gid=54856837624 Gerstein Lab Facebook Page]&lt;br /&gt;
*[http://bioinfo.mbb.yale.edu/pipermail/web/ Public WEB Mailing List] (no longer in use)&lt;br /&gt;
* [[Streamlining Draft Flow]] (Ideas for streamlining the process of drafting and submitting papers)&lt;br /&gt;
** Way to list Mark&#039;s [[Affiliation]] on papers &lt;br /&gt;
** See [[xxmg at gersteinlab.org | xxmg@gersteinlab.org]] for correct address to use for paper correspondence.&lt;br /&gt;
** Note that &#039;&#039;&#039;xxmg at bioinfo.mbb.yale.edu&#039;&#039;&#039; for paper correspondence is deprecated. &lt;br /&gt;
* [[Pointers on Powerpoints]] and [[Pointers on Grant Sections]]&lt;br /&gt;
* [[Recommendation Letters]]&lt;br /&gt;
* Lab [[Resources Document]] (NIH form)&lt;br /&gt;
* Google Groups: [http://group.gersteinlab.org homepage], for [http://docs.google.com/a/gersteinlab.org DOCS]&lt;br /&gt;
* Some Useful University Policies&lt;br /&gt;
** Snippets from [[MB&amp;amp;B Policy for Graduate Students on Vacation and Travel]] and [[Policy on Postdoc Appointments]]&lt;br /&gt;
** Policies on visitors: [http://provost.yale.edu/minors-in-labs Policy on Minors in Labs] ([http://archive.gersteinlab.org/docs/2010/06.02/Policy-on-Minors-in-Labs.pdf  old]), [http://provost.yale.edu/policy-access-university-labs-and-research-facilities Policy on Access to the Lab]&lt;br /&gt;
** Travel per diem information: [http://www.yale.edu/ppdev/Guides/bluepages.pdf Yale Blue Pages]&lt;br /&gt;
* Useful links: [http://www.yale.edu/its/accounts/netid.html Yale NetID System], [http://www.yale.edu/its/network/vpn_faq.html Yale VPN FAQ], [http://www.yale.edu/its/network/wireless/faq.html Yale Wireless FAQ], [http://www.yale.edu/its/telecom/dialing.html Yale Dialing Instructions], [https://config.mail.yale.edu Configuring Yale email], [http://www.yale.edu/ris/main.html Poster Printing]&lt;br /&gt;
* [http://maguro.cs.yale.edu:8000/Center_for_High_Performance_Computation_in_Biology_and_Biomedicine Yale High Performance Computing Center]&lt;br /&gt;
* Information on [[what grant to charge something to]]&lt;br /&gt;
* Information on [[Meeting Invites]]&lt;br /&gt;
* [http://araman.mbgnet/nagios/ System Status] MBGNet LAN access only.&lt;br /&gt;
* [http://info.gersteinlab.org/Tools Tools] page&lt;/div&gt;</summary>
		<author><name>Lucasl</name></author>
	</entry>
</feed>