FusionSeq

From GersteinInfo

(Difference between revisions)
Jump to: navigation, search
(How to execute FusionSeq)
(Software Requirements)
Line 4: Line 4:
However, this is not a plug-and-play program, but it requires the user to compile, install and run a set of programs. Please read the requirements before [[#Download|downloading]] FusionSeq.
However, this is not a plug-and-play program, but it requires the user to compile, install and run a set of programs. Please read the requirements before [[#Download|downloading]] FusionSeq.
-
==Software Requirements==
+
==[[Software Requirements]]==
-
FusionSeq requires several additional packages to be installed in order to carry out the analysis and visualize the results. Moreover, since its modularity, different programs would need specific libraries. Moreover, some data set are also required for the analysis (see [[#Data Requirements|Data Requirements]]). Here we describe the complete set of tools that one would need to run the analysis as we do in our lab. The modules should be installed in the listed order.
+
-
 
+
-
===Alignment tools===
+
-
* [http://bowtie-bio.sourceforge.net/index.shtml bowtie] (64bit)
+
-
* [http://users.soe.ucsc.edu/~kent/src/ Blat (source)] [http://genome-test.cse.ucsc.edu/~kent/exe/ (binaries)]
+
-
Please make sure that blat and bowtie executables are part of the PATH, i.e. they can be accessed and executed from any location on your file system. Moreover, make sure that twoBit2fa is also downloaded from the blat package and part of the PATH.
+
-
 
+
-
===Scientific and bioinformatics libraries===
+
-
* [http://www.gnu.org/software/gsl/ GNU Scientific Library]
+
-
* [http://rnaseq.gersteinlab.org/doc/bios/ BIOS]
+
-
 
+
-
===Drawing tools===
+
-
* [http://www.libgd.org/Main_Page GD library]
+
-
 
+
-
===Data analysis===
+
-
* [http://root.cern.ch/drupal/ ROOT]
+
==Data Requirements==
==Data Requirements==

Revision as of 12:56, 19 August 2010

FusionSeq main web page

Contents

Introduction

This page provides the source code for FusionSeq. Please note that these tools were tested on a multi-node cluster of computing nodes with Linux Red Hat as operating system and PBS as scheduler system. FusionSeq programs are written in C and should likely compile to most Unix/Linux platforms. We used the gcc complier (version 3.4.6 20060404) to compile the source code. However, this is not a plug-and-play program, but it requires the user to compile, install and run a set of programs. Please read the requirements before downloading FusionSeq.

Software Requirements

Data Requirements

Here is the list of required data for a comprehensive use of FusionSeq tools.

External

The human genome needs to be properly indexed to be used by bowtie. Please see the instruction of bowtie for performing this operation. Indicatevely, you would need to run something like:

$ bowtie-build -f hg18_nh.fa /path2bowtieIndex/hg18_nh/

where hg18_nh.fa corresponds to the concatenation of all human chromosomes from chromFa.zip without the different haplotypes and "random" stuff.

Provided

The following data sets, bundled in a tarball, can be downloaded here.

  • knownGeneAnnotation.txt
  • knownGeneAnnotationTranscriptCompositeModel.txt
  • knownGeneAnnotationTranscriptCompositeModel.fa
  • kgXref.txt
  • knownToTreefam.txt

The composite model needs to be indexed by bowtie:

$ bowtie-build -f knownGeneAnnotationTranscriptCompositeModel.fa /path2bowtieIndex/hg18_knownGeneAnnotationTranscriptCompositeModel/hg18_knownGeneAnnotationTranscriptCompositeModel

Please make sure that the correct filenames are used.

Download

Installation and Configuration of FusionSeq

How to execute FusionSeq




last revision: 12:53, 19 August 2010 (UTC)

Personal tools