SEQanswers

Go Back   SEQanswers > Bioinformatics > Bioinformatics



Similar Threads
Thread Thread Starter Forum Replies Last Post
merging sequencing data from different sequencing runs csoong Bioinformatics 15 05-15-2013 11:02 AM
merge BAM files from 2 SOLiD PE runs and remove duplicates epigen Bioinformatics 12 03-23-2012 01:55 AM
multiple mapping in miRNA sequencing jay2008 Bioinformatics 1 10-10-2010 11:52 PM
Using multiple MIDs in titanium sequence runs JurgenP 454 Pyrosequencing 8 07-23-2010 07:24 AM
multiple runs and maq Layla Bioinformatics 8 05-14-2009 12:18 AM

Reply
 
Thread Tools
Old 01-27-2012, 05:17 PM   #1
vinay052003
Member
 
Location: Atlanta, US

Join Date: Jan 2010
Posts: 59
Default How to merge multiple sequencing runs

I have multiple fastq files generated by sequencing single end library on multiple lanes on Illumina GA. I am not sure how to merge them to generate a consolidate reference genome alignment file using Tophat.
If I had multiple fastq files for paired end reads, I would have run TopHat by providing two comma separated lists of files to TopHat. But I am not sure if I can do that with single end reads as well.
Any thoughts?


Thanks.
vinay052003 is offline   Reply With Quote
Old 01-27-2012, 05:39 PM   #2
adaptivegenome
Super Moderator
 
Location: US

Join Date: Nov 2009
Posts: 437
Default

You can map each fastq separately and merge the BAMs with samtools.
adaptivegenome is offline   Reply With Quote
Old 01-30-2012, 09:20 AM   #3
westerman
Rick Westerman
 
Location: Purdue University, Indiana, USA

Join Date: Jun 2008
Posts: 1,104
Default

You should also just be able to 'cat' the files together into one large file.
westerman is offline   Reply With Quote
Old 01-30-2012, 02:22 PM   #4
jbrwn
Member
 
Location: Denver, CO

Join Date: Mar 2011
Posts: 37
Default

map them all as individuals in tophat, then use samtools merge to put them together.
jbrwn is offline   Reply With Quote
Old 01-31-2012, 03:34 AM   #5
pbluescript
Senior Member
 
Location: Boston

Join Date: Nov 2009
Posts: 224
Default

I would first map them all separately as others have suggested and then calculate RPKMs for each sample. Compare the RPKMs between the samples, and if they all look similar, you can be reasonably sure there are no lane issues with your run. After that, I would use the cat command to combine the fastq files and then rerun the mapping with the combined file.
pbluescript is offline   Reply With Quote
Reply

Tags
runs multiple rna-seq

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off




All times are GMT -8. The time now is 03:13 AM.


Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2020, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO