http://info.gersteinlab.org/index.php?title=How_to_execute_FusionSeq&feed=atom&action=historyHow to execute FusionSeq - Revision history2024-03-28T17:34:11ZRevision history for this page on the wikiMediaWiki 1.15.4http://info.gersteinlab.org/index.php?title=How_to_execute_FusionSeq&diff=1105&oldid=prevAsboner: /* Junction-Sequence Identifier module */2011-04-05T17:59:52Z<p><span class="autocomment">Junction-Sequence Identifier module</span></p>
<table style="background-color: white; color:black;">
<col class='diff-marker' />
<col class='diff-content' />
<col class='diff-marker' />
<col class='diff-content' />
<tr valign='top'>
<td colspan='2' style="background-color: white; color:black;">← Older revision</td>
<td colspan='2' style="background-color: white; color:black;">Revision as of 17:59, 5 April 2011</td>
</tr>
<tr><td colspan="2" class="diff-lineno">Line 61:</td>
<td colspan="2" class="diff-lineno">Line 61:</td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>==Junction-Sequence Identifier module==</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>==Junction-Sequence Identifier module==</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>The junction-sequence identifier uses the high-confidence gfr file and look for the sequence of the junction for each candidate. This is the most computationally expensive part of FusionSeq. In order to run it efficiently we exploit a parallel computing architecture. Typically, one would execute:</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>The junction-sequence identifier uses the high-confidence gfr file and look for the sequence of the junction for each candidate. This is the most computationally expensive part of FusionSeq. In order to run it efficiently we exploit a parallel computing architecture. Typically, one would execute:</div></td></tr>
<tr><td class='diff-marker'>-</td><td style="background: #ffa; color:black; font-size: smaller;"><div> $ gfr2bpJunctions data.confidence.gfr 40 200 </div></td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div> $ gfr2bpJunctions data.confidence.gfr 40 200 <ins class="diffchange diffchange-inline">1.0</ins></div></td></tr>
<tr><td class='diff-marker'>-</td><td style="background: #ffa; color:black; font-size: smaller;"><div>where 40 correspond to the tile size and 200 is the number of nucleotides flanking the exons. For example, with 50-mers, having 40 as tile size ensures that at least 10 nucleotides of the reads will be mapped to either of the tiles. This program generates the fusion junction library in fasta format for each candidate, split in several files each one containing at most 2M entries. It also creates two additional files: one (data_jobList1.txt) including the instructions to index the library and align the reads to the files, and the second (data_jobList2.txt) including the instructions to aggregate the results of the alignment. How to run those jobs depends on the user architecture of the cluster. It is worth noting that data_jobList1.txt expects a compressed file with all the single end reads called: data_allReads.txt.gz. The auxiliary program [[FusionSeq_List of programs#export2mrf|export2mrf]] generates this file automatically from the output files (export) of the Illumina platform. Otherwise, the user should provide a file with all the reads, one sequence per line. One way to do this from the fastq data is:</div></td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div>where 40 correspond to the tile size and 200 is the number of nucleotides flanking the exons. For example, with 50-mers, having 40 as tile size ensures that at least 10 nucleotides of the reads will be mapped to either of the tiles. This program generates the fusion junction library in fasta format for each candidate <ins class="diffchange diffchange-inline">with DASPER>1.0</ins>, split in several files each one containing at most 2M entries. It also creates two additional files: one (data_jobList1.txt) including the instructions to index the library and align the reads to the files, and the second (data_jobList2.txt) including the instructions to aggregate the results of the alignment. How to run those jobs depends on the user architecture of the cluster. It is worth noting that data_jobList1.txt expects a compressed file with all the single end reads called: data_allReads.txt.gz. The auxiliary program [[FusionSeq_List of programs#export2mrf|export2mrf]] generates this file automatically from the output files (export) of the Illumina platform. Otherwise, the user should provide a file with all the reads, one sequence per line. One way to do this from the fastq data is:</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div> cat data_1.fastq data_2.fastq | gawk '$1 ~/^@/ { getline; print }' - > data_allReads.txt</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div> cat data_1.fastq data_2.fastq | gawk '$1 ~/^@/ { getline; print }' - > data_allReads.txt</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>where data_1.fastq data_2.fastq are the fastq data of the two ends.</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>where data_1.fastq data_2.fastq are the fastq data of the two ends.</div></td></tr>
<!-- diff generator: internal 2024-03-28 17:34:11 -->
</table>Asbonerhttp://info.gersteinlab.org/index.php?title=How_to_execute_FusionSeq&diff=836&oldid=prevAsboner at 11:14, 17 February 20112011-02-17T11:14:30Z<p></p>
<table style="background-color: white; color:black;">
<col class='diff-marker' />
<col class='diff-content' />
<col class='diff-marker' />
<col class='diff-content' />
<tr valign='top'>
<td colspan='2' style="background-color: white; color:black;">← Older revision</td>
<td colspan='2' style="background-color: white; color:black;">Revision as of 11:14, 17 February 2011</td>
</tr>
<tr><td colspan="2" class="diff-lineno">Line 4:</td>
<td colspan="2" class="diff-lineno">Line 4:</td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>For data generated by the Genome Analyzer II and aligned with Eland, the program [[FusionSeq_List_of_programs#export2mrf|export2mrf]] provided with FusionSeq can be used to perform the conversion. All programs provide a brief explanation of their usage when run without parameters. A typical analysis session may look like the following:</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>For data generated by the Genome Analyzer II and aligned with Eland, the program [[FusionSeq_List_of_programs#export2mrf|export2mrf]] provided with FusionSeq can be used to perform the conversion. All programs provide a brief explanation of their usage when run without parameters. A typical analysis session may look like the following:</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div><pre></div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div><pre></div></td></tr>
<tr><td class='diff-marker'>-</td><td style="background: #ffa; color:black; font-size: smaller;"><div>$ geneFusions data 4 < data.mrf > data.1.gfr 2> data.1.gfr.log</div></td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div>$ geneFusions data 4 < data.mrf <ins class="diffchange diffchange-inline">| gfrClassify </ins>> data.1.gfr 2> data.1.gfr.log</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>$ (gfrMitochondrialFilter < data.1.gfr | gfrRepeatMaskerFilter repeatMasker.interval 5 | gfrCountPairTypes | gfrExpressionConsistencyFilter | </div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>$ (gfrMitochondrialFilter < data.1.gfr | gfrRepeatMaskerFilter repeatMasker.interval 5 | gfrCountPairTypes | gfrExpressionConsistencyFilter | </div></td></tr>
<!-- diff generator: internal 2024-03-28 17:34:11 -->
</table>Asbonerhttp://info.gersteinlab.org/index.php?title=How_to_execute_FusionSeq&diff=834&oldid=prevAsboner: /* Auxilliary data */2011-02-16T11:18:05Z<p><span class="autocomment">Auxilliary data</span></p>
<table style="background-color: white; color:black;">
<col class='diff-marker' />
<col class='diff-content' />
<col class='diff-marker' />
<col class='diff-content' />
<tr valign='top'>
<td colspan='2' style="background-color: white; color:black;">← Older revision</td>
<td colspan='2' style="background-color: white; color:black;">Revision as of 11:18, 16 February 2011</td>
</tr>
<tr><td colspan="2" class="diff-lineno">Line 55:</td>
<td colspan="2" class="diff-lineno">Line 55:</td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>If the auxilliary modules are installed, then, the fourth command generates the corresponding files and print the gfr file to standard output. </div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>If the auxilliary modules are installed, then, the fourth command generates the corresponding files and print the gfr file to standard output. </div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div> $ (gfr2images < data.confidence.gfr | gfr2bed | gfr2fasta | gfr2gff) 2> data.aux.log</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div> $ (gfr2images < data.confidence.gfr | gfr2bed | gfr2fasta | gfr2gff) 2> data.aux.log</div></td></tr>
<tr><td class='diff-marker'>-</td><td style="background: #ffa; color:black; font-size: smaller;"><div>The files needs to be properly located into the directory structure described in Installing CGIs.</div></td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div>The files needs to be properly located into the directory structure described in Installing CGIs<ins class="diffchange diffchange-inline">. Also, note that [[#gfrCountPairTypes|gfrCountPairTypes]] should be executed before [[#gfr2images|gfr2images]]</ins>.</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div><center>[[#top|Top]]</center></div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div><center>[[#top|Top]]</center></div></td></tr>
<!-- diff generator: internal 2024-03-28 17:34:11 -->
</table>Asbonerhttp://info.gersteinlab.org/index.php?title=How_to_execute_FusionSeq&diff=736&oldid=prevAsboner at 06:44, 3 December 20102010-12-03T06:44:11Z<p></p>
<table style="background-color: white; color:black;">
<col class='diff-marker' />
<col class='diff-content' />
<col class='diff-marker' />
<col class='diff-content' />
<tr valign='top'>
<td colspan='2' style="background-color: white; color:black;">← Older revision</td>
<td colspan='2' style="background-color: white; color:black;">Revision as of 06:44, 3 December 2010</td>
</tr>
<tr><td colspan="2" class="diff-lineno">Line 6:</td>
<td colspan="2" class="diff-lineno">Line 6:</td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>$ geneFusions data 4 < data.mrf > data.1.gfr 2> data.1.gfr.log</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>$ geneFusions data 4 < data.mrf > data.1.gfr 2> data.1.gfr.log</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td></tr>
<tr><td class='diff-marker'>-</td><td style="background: #ffa; color:black; font-size: smaller;"><div>$ (gfrMitochondrialFilter < data.1.gfr | gfrRepeatMaskerFilter <del class="diffchange diffchange-inline">/path/to/</del>repeatMasker.interval 5 | gfrCountPairTypes | gfrExpressionConsistencyFilter | </div></td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div>$ (gfrMitochondrialFilter < data.1.gfr | gfrRepeatMaskerFilter repeatMasker.interval 5 | gfrCountPairTypes | gfrExpressionConsistencyFilter | </div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div> gfrAbnormalInsertSizeFilter 0.01 | gfrPCRFilter 4 4 | gfrProximityFilter 1000 | gfrAddInfo | gfrAnnotationConsistencyFilter ribosomal | </div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div> gfrAbnormalInsertSizeFilter 0.01 | gfrPCRFilter 4 4 | gfrProximityFilter 1000 | gfrAddInfo | gfrAnnotationConsistencyFilter ribosomal | </div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div> gfrAnnotationConsistencyFilter pseudogenes | gfrBlackListFilter blackList.txt | gfrLargeScaleHomologyFilter | </div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div> gfrAnnotationConsistencyFilter pseudogenes | gfrBlackListFilter blackList.txt | gfrLargeScaleHomologyFilter | </div></td></tr>
<tr><td colspan="2" class="diff-lineno">Line 36:</td>
<td colspan="2" class="diff-lineno">Line 36:</td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>The second command:</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>The second command:</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div><pre></div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div><pre></div></td></tr>
<tr><td class='diff-marker'>-</td><td style="background: #ffa; color:black; font-size: smaller;"><div>$ (gfrMitochondrialFilter < data.1.gfr | gfrRepeatMaskerFilter <del class="diffchange diffchange-inline">/path/to/</del>repeatMasker.interval 5 | gfrCountPairTypes | gfrExpressionConsistencyFilter | </div></td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div>$ (gfrMitochondrialFilter < data.1.gfr | gfrRepeatMaskerFilter repeatMasker.interval 5 | gfrCountPairTypes | gfrExpressionConsistencyFilter | </div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div> gfrAbnormalInsertSizeFilter 0.01 | gfrPCRFilter 4 4 | gfrProximityFilter 1000 | gfrAddInfo | gfrAnnotationConsistencyFilter ribosomal | </div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div> gfrAbnormalInsertSizeFilter 0.01 | gfrPCRFilter 4 4 | gfrProximityFilter 1000 | gfrAddInfo | gfrAnnotationConsistencyFilter ribosomal | </div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div> gfrAnnotationConsistencyFilter pseudogenes | gfrBlackListFilter blackList.txt | gfrLargeScaleHomologyFilter | </div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div> gfrAnnotationConsistencyFilter pseudogenes | gfrBlackListFilter blackList.txt | gfrLargeScaleHomologyFilter | </div></td></tr>
<!-- diff generator: internal 2024-03-28 17:34:11 -->
</table>Asbonerhttp://info.gersteinlab.org/index.php?title=How_to_execute_FusionSeq&diff=730&oldid=prevAsboner: /* Filtration cascade module */2010-12-03T06:37:18Z<p><span class="autocomment">Filtration cascade module</span></p>
<table style="background-color: white; color:black;">
<col class='diff-marker' />
<col class='diff-content' />
<col class='diff-marker' />
<col class='diff-content' />
<tr valign='top'>
<td colspan='2' style="background-color: white; color:black;">← Older revision</td>
<td colspan='2' style="background-color: white; color:black;">Revision as of 06:37, 3 December 2010</td>
</tr>
<tr><td colspan="2" class="diff-lineno">Line 34:</td>
<td colspan="2" class="diff-lineno">Line 34:</td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>==Filtration cascade module==</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>==Filtration cascade module==</div></td></tr>
<tr><td class='diff-marker'>-</td><td style="background: #ffa; color:black; font-size: smaller;"><div>The second command</div></td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div>The second command<ins class="diffchange diffchange-inline">:</ins></div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div><pre></div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div><pre></div></td></tr>
<tr><td class='diff-marker'>-</td><td style="background: #ffa; color:black; font-size: smaller;"><div><del class="diffchange diffchange-inline"> </del>$ (<del class="diffchange diffchange-inline">gfrAbnormalInsertSizeFilter 0.01 </del>< data.1.gfr | gfrPCRFilter 4 4 | gfrProximityFilter 1000 | gfrAddInfo | </div></td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div>$ (<ins class="diffchange diffchange-inline">gfrMitochondrialFilter </ins>< data.1.gfr | <ins class="diffchange diffchange-inline">gfrRepeatMaskerFilter /path/to/repeatMasker.interval 5 | gfrCountPairTypes | gfrExpressionConsistencyFilter | </ins></div></td></tr>
<tr><td class='diff-marker'>-</td><td style="background: #ffa; color:black; font-size: smaller;"><div><del class="diffchange diffchange-inline"> </del>gfrAnnotationConsistencyFilter ribosomal | gfrBlackListFilter blackList.txt | gfrLargeScaleHomologyFilter | </div></td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div><ins class="diffchange diffchange-inline"> gfrAbnormalInsertSizeFilter 0.01 | </ins>gfrPCRFilter 4 4 | gfrProximityFilter 1000 | gfrAddInfo | <ins class="diffchange diffchange-inline"> </ins>gfrAnnotationConsistencyFilter ribosomal <ins class="diffchange diffchange-inline">| </ins></div></td></tr>
<tr><td class='diff-marker'>-</td><td style="background: #ffa; color:black; font-size: smaller;"><div><del class="diffchange diffchange-inline"> </del>gfrRibosomalFilter | gfrSmallScaleHomologyFilter) > data.gfr 2> data.log</div></td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div><ins class="diffchange diffchange-inline"> gfrAnnotationConsistencyFilter pseudogenes </ins>| gfrBlackListFilter blackList.txt | <ins class="diffchange diffchange-inline"> </ins>gfrLargeScaleHomologyFilter | </div></td></tr>
<tr><td colspan="2"> </td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div><ins class="diffchange diffchange-inline"> </ins>gfrRibosomalFilter | gfrSmallScaleHomologyFilter) > data.gfr 2> data.log</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div></pre></div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div></pre></div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>runs several filters to remove potential artifacts and generate a high-confidence list of fusion candidates. The order of the filters affects only the computation time of the processing. However, [[FusionSeq_List of programs#gfrAddInfo|gfrAddInfo]] needs to be executed prior to [[FusionSeq_List of programs#gfrAnnotationConsistencyFilter|gfrAnnotationConsistencyFilter]] or [[FusionSeq_List of programs#gfrBlacklistFilter|gfrBlackListFilter]], because they require gene symbols and descriptions. </div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>runs several filters to remove potential artifacts and generate a high-confidence list of fusion candidates. The order of the filters affects only the computation time of the processing. However, [[FusionSeq_List of programs#gfrAddInfo|gfrAddInfo]] needs to be executed prior to [[FusionSeq_List of programs#gfrAnnotationConsistencyFilter|gfrAnnotationConsistencyFilter]] or [[FusionSeq_List of programs#gfrBlacklistFilter|gfrBlackListFilter]], because they require gene symbols and descriptions. </div></td></tr>
<!-- diff generator: internal 2024-03-28 17:34:11 -->
</table>Asbonerhttp://info.gersteinlab.org/index.php?title=How_to_execute_FusionSeq&diff=729&oldid=prevAsboner at 06:36, 3 December 20102010-12-03T06:36:44Z<p></p>
<table style="background-color: white; color:black;">
<col class='diff-marker' />
<col class='diff-content' />
<col class='diff-marker' />
<col class='diff-content' />
<tr valign='top'>
<td colspan='2' style="background-color: white; color:black;">← Older revision</td>
<td colspan='2' style="background-color: white; color:black;">Revision as of 06:36, 3 December 2010</td>
</tr>
<tr><td colspan="2" class="diff-lineno">Line 6:</td>
<td colspan="2" class="diff-lineno">Line 6:</td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>$ geneFusions data 4 < data.mrf > data.1.gfr 2> data.1.gfr.log</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>$ geneFusions data 4 < data.mrf > data.1.gfr 2> data.1.gfr.log</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td></tr>
<tr><td class='diff-marker'>-</td><td style="background: #ffa; color:black; font-size: smaller;"><div>$ (<del class="diffchange diffchange-inline">gfrAbnormalInsertSizeFilter 0.01 </del>< data.1.gfr | gfrPCRFilter 4 4 | gfrProximityFilter 1000 | gfrAddInfo | </div></td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div>$ (<ins class="diffchange diffchange-inline">gfrMitochondrialFilter </ins>< data.1.gfr | <ins class="diffchange diffchange-inline">gfrRepeatMaskerFilter /path/to/repeatMasker.interval 5 | gfrCountPairTypes | gfrExpressionConsistencyFilter | </ins></div></td></tr>
<tr><td class='diff-marker'>-</td><td style="background: #ffa; color:black; font-size: smaller;"><div><del class="diffchange diffchange-inline"> </del>gfrAnnotationConsistencyFilter ribosomal | gfrBlackListFilter blackList.txt | gfrLargeScaleHomologyFilter | </div></td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div><ins class="diffchange diffchange-inline"> gfrAbnormalInsertSizeFilter 0.01 | </ins>gfrPCRFilter 4 4 | gfrProximityFilter 1000 | gfrAddInfo | <ins class="diffchange diffchange-inline"> </ins>gfrAnnotationConsistencyFilter ribosomal <ins class="diffchange diffchange-inline">| </ins></div></td></tr>
<tr><td class='diff-marker'>-</td><td style="background: #ffa; color:black; font-size: smaller;"><div><del class="diffchange diffchange-inline"> </del>gfrRibosomalFilter | gfrSmallScaleHomologyFilter) > data.gfr 2> data.log</div></td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div><ins class="diffchange diffchange-inline"> gfrAnnotationConsistencyFilter pseudogenes </ins>| gfrBlackListFilter blackList.txt | <ins class="diffchange diffchange-inline"> </ins>gfrLargeScaleHomologyFilter | </div></td></tr>
<tr><td colspan="2"> </td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div><ins class="diffchange diffchange-inline"> </ins>gfrRibosomalFilter | gfrSmallScaleHomologyFilter) > data.gfr 2> data.log</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>$ gfrConfidenceValues data < data.gfr > data.confidence.gfr</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>$ gfrConfidenceValues data < data.gfr > data.confidence.gfr</div></td></tr>
<!-- diff generator: internal 2024-03-28 17:34:12 -->
</table>Asbonerhttp://info.gersteinlab.org/index.php?title=How_to_execute_FusionSeq&diff=719&oldid=prevAsboner at 06:51, 21 November 20102010-11-21T06:51:49Z<p></p>
<table style="background-color: white; color:black;">
<col class='diff-marker' />
<col class='diff-content' />
<col class='diff-marker' />
<col class='diff-content' />
<tr valign='top'>
<td colspan='2' style="background-color: white; color:black;">← Older revision</td>
<td colspan='2' style="background-color: white; color:black;">Revision as of 06:51, 21 November 2010</td>
</tr>
<tr><td colspan="2" class="diff-lineno">Line 18:</td>
<td colspan="2" class="diff-lineno">Line 18:</td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>$ validateBpJunctions < data_00002_AB.bp | bpFilter 4 4 100 0.01 30 > data_00002_AB.filtered.bp </div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>$ validateBpJunctions < data_00002_AB.bp | bpFilter 4 4 100 0.01 30 > data_00002_AB.filtered.bp </div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td></tr>
<tr><td class='diff-marker'>-</td><td style="background: #ffa; color:black; font-size: smaller;"><div>$ bp2alignment data_00002_AB.filtered.bp > data_00002_breakPointAlignments.txt</div></td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div>$ bp2alignment <ins class="diffchange diffchange-inline">< </ins>data_00002_AB.filtered.bp > data_00002_breakPointAlignments.txt</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>$ bp2wig data_00002_AB.filtered.bp </div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>$ bp2wig data_00002_AB.filtered.bp </div></td></tr>
<!-- diff generator: internal 2024-03-28 17:34:12 -->
</table>Asbonerhttp://info.gersteinlab.org/index.php?title=How_to_execute_FusionSeq&diff=718&oldid=prevAsboner at 06:50, 21 November 20102010-11-21T06:50:29Z<p></p>
<table style="background-color: white; color:black;">
<col class='diff-marker' />
<col class='diff-content' />
<col class='diff-marker' />
<col class='diff-content' />
<tr valign='top'>
<td colspan='2' style="background-color: white; color:black;">← Older revision</td>
<td colspan='2' style="background-color: white; color:black;">Revision as of 06:50, 21 November 2010</td>
</tr>
<tr><td colspan="2" class="diff-lineno">Line 4:</td>
<td colspan="2" class="diff-lineno">Line 4:</td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>For data generated by the Genome Analyzer II and aligned with Eland, the program [[FusionSeq_List_of_programs#export2mrf|export2mrf]] provided with FusionSeq can be used to perform the conversion. All programs provide a brief explanation of their usage when run without parameters. A typical analysis session may look like the following:</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>For data generated by the Genome Analyzer II and aligned with Eland, the program [[FusionSeq_List_of_programs#export2mrf|export2mrf]] provided with FusionSeq can be used to perform the conversion. All programs provide a brief explanation of their usage when run without parameters. A typical analysis session may look like the following:</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div><pre></div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div><pre></div></td></tr>
<tr><td class='diff-marker'>-</td><td style="background: #ffa; color:black; font-size: smaller;"><div>$ geneFusions data 4 < data.mrf > data.1.gfr 2> data.1.log</div></td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div>$ geneFusions data 4 < data.mrf > data.1.gfr 2> data.1<ins class="diffchange diffchange-inline">.gfr</ins>.log</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>$ (gfrAbnormalInsertSizeFilter 0.01 < data.1.gfr | gfrPCRFilter 4 4 | gfrProximityFilter 1000 | gfrAddInfo | </div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>$ (gfrAbnormalInsertSizeFilter 0.01 < data.1.gfr | gfrPCRFilter 4 4 | gfrProximityFilter 1000 | gfrAddInfo | </div></td></tr>
<!-- diff generator: internal 2024-03-28 17:34:12 -->
</table>Asbonerhttp://info.gersteinlab.org/index.php?title=How_to_execute_FusionSeq&diff=717&oldid=prevAsboner: /* Fusion detection module */2010-11-21T06:50:02Z<p><span class="autocomment">Fusion detection module</span></p>
<table style="background-color: white; color:black;">
<col class='diff-marker' />
<col class='diff-content' />
<col class='diff-marker' />
<col class='diff-content' />
<tr valign='top'>
<td colspan='2' style="background-color: white; color:black;">← Older revision</td>
<td colspan='2' style="background-color: white; color:black;">Revision as of 06:50, 21 November 2010</td>
</tr>
<tr><td colspan="2" class="diff-lineno">Line 24:</td>
<td colspan="2" class="diff-lineno">Line 24:</td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>==Fusion detection module==</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>==Fusion detection module==</div></td></tr>
<tr><td class='diff-marker'>-</td><td style="background: #ffa; color:black; font-size: smaller;"><div>The first command ([[FusionSeq_List of programs#geneFusions|geneFusions]]) will create the first list of candidate fusion transcripts. The <del class="diffchange diffchange-inline">parameters </del>"data" <del class="diffchange diffchange-inline">and "4" correspond </del>to the prefix <del class="diffchange diffchange-inline">(data) </del>that will be used to generate the IDs of the candidates, namely data_00001, data_00002, etc. "4" is the minimum number of PE reads needed to call a candidate. The program reads data.mrf from standard input and save the output to data.1.gfr as well as logging information on data.1.log. Note that all programs will generate some logging information in this format:</div></td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div>The first command ([[FusionSeq_List of programs#geneFusions|geneFusions]]) will create the first list of candidate fusion transcripts<ins class="diffchange diffchange-inline">:</ins></div></td></tr>
<tr><td colspan="2"> </td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div><ins class="diffchange diffchange-inline"> $ geneFusions data 4 < data.mrf > data.1.gfr 2> data.1.gfr</ins>.<ins class="diffchange diffchange-inline">log</ins></div></td></tr>
<tr><td colspan="2"> </td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div>The <ins class="diffchange diffchange-inline">parameter </ins>"data" <ins class="diffchange diffchange-inline">corresponds </ins>to the prefix that will be used to generate the IDs of the candidates, namely data_00001, data_00002, etc.<ins class="diffchange diffchange-inline">, whereas </ins>"4" is the minimum number of <ins class="diffchange diffchange-inline">inter-transcript </ins>PE reads needed to call a candidate. The program reads data.mrf from standard input and save the output to data.1.gfr as well as logging information on data.1.log. Note that all programs will generate some logging information in this format:</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div> WARNING: <program.name>_<type.of.information>: <value></div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div> WARNING: <program.name>_<type.of.information>: <value></div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>Hence we recommend to always capture the LOG information in a file by redirecting the STDERR with '2>'.</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>Hence we recommend to always capture the LOG information in a file by redirecting the STDERR with '2>'.</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td></tr>
<tr><td class='diff-marker'>-</td><td style="background: #ffa; color:black; font-size: smaller;"><div>The <del class="diffchange diffchange-inline">output in </del>data.1.gfr <del class="diffchange diffchange-inline">can be considered </del>the most comprehensive list of fusion candidates. </div></td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div>The <ins class="diffchange diffchange-inline">content of </ins>data.1.gfr <ins class="diffchange diffchange-inline">is </ins>the most comprehensive list of fusion candidates.</div></td></tr>
<tr><td colspan="2"> </td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div> </div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>==Filtration cascade module==</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>==Filtration cascade module==</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>The second command</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>The second command</div></td></tr>
<!-- diff generator: internal 2024-03-28 17:34:12 -->
</table>Asbonerhttp://info.gersteinlab.org/index.php?title=How_to_execute_FusionSeq&diff=715&oldid=prevAsboner: /* Junction-Sequence Identifier module */2010-11-20T18:37:10Z<p><span class="autocomment">Junction-Sequence Identifier module</span></p>
<table style="background-color: white; color:black;">
<col class='diff-marker' />
<col class='diff-content' />
<col class='diff-marker' />
<col class='diff-content' />
<tr valign='top'>
<td colspan='2' style="background-color: white; color:black;">← Older revision</td>
<td colspan='2' style="background-color: white; color:black;">Revision as of 18:37, 20 November 2010</td>
</tr>
<tr><td colspan="2" class="diff-lineno">Line 65:</td>
<td colspan="2" class="diff-lineno">Line 65:</td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>$ validateBpJunctions < data_00002_AB.bp | bpFilter 4 4 100 0.01 30 > data_00002_AB.filtered.bp </div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>$ validateBpJunctions < data_00002_AB.bp | bpFilter 4 4 100 0.01 30 > data_00002_AB.filtered.bp </div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td></tr>
<tr><td class='diff-marker'>-</td><td style="background: #ffa; color:black; font-size: smaller;"><div>$ bp2alignment data_00002_AB.filtered.bp > data_00002_breakPointAlignments.txt</div></td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div>$ bp2alignment <ins class="diffchange diffchange-inline">< </ins>data_00002_AB.filtered.bp > data_00002_breakPointAlignments.txt</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>$ bp2wig data_00002_AB.filtered.bp </div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>$ bp2wig data_00002_AB.filtered.bp </div></td></tr>
<!-- diff generator: internal 2024-03-28 17:34:12 -->
</table>Asboner