Seqanswers Leaderboard Ad

**shurjo** · 09-28-2010, 01:42 PM

I've used Picard and it works fine for me.

**maubp** · 09-29-2010, 01:55 AM

You may want to filter the BAM file to remove any non-primary mappings (otherwise you could get duplicate entries in the FASTQ file). The tools may do that for you.

You may also want to append /1 and /2 to the forward and reverse read names (this information isn't currently stored in SAM/BAM format but there is a proposed tag for the read name suffix in the draft standard update).

Also double check that any reads mapped to the reverse stand get reverse complemented when writing the FASTQ file since you want to recover the input sequences.

There are also DIY approaches, for example BAM to SAM and then a Perl/Python script. I have some experimental code for Biopython to do this too.

There was a thread on this on the samtools-help mailing list in August 2010, "BAM to fastq how?"

**ekg** · 09-29-2010, 12:58 PM

Bamtools (http://github.com/pezmaster31/bamtools) can convert BAM to FASTQ.

bamtools convert -in file1.bam -in file2.bam ... -format fastq >reads.fq

**ElMichael** · 11-15-2010, 10:38 AM

Hi,

For BamtoFastq convertion I use Bamtools.
But when I try to convert one of my bam files to fastq I get the following error message
"BGZF ERROR: read block failed - could not read data from block"
The problem is that after this step bamtools exits. Is it possible to avoid it? I don't know, somehow to tell bamtools just to skip such block and continue. Or, like in the picard, is there any VALIDATION_STRIGENCY option that could be set lenient or silent?
Just to mention, these bam files contain unmapped PE reads.
thanks

**KevinLam** · 11-15-2011, 11:57 PM

On Picard,
my service provider mentioned this
"Using picard tools directly has one significant drawback. Picard tools will read in sequence from the BAM
line by line and cache it until it has both reads. Once it has both reads it will print them out and free the
memory. Unfortunately this means that every read which doesn't have the pairs near each other will
take memory. In the example above it took 2.5GB of memory for 120GB of sequence but this is not
guaranteed and will get worse on larger builds.
"

Sounds terrible to me..

fortunately there's method 2

'You can specify samtools memory usage (it'll use temporary files) so if you sort the BAM by name prior
to running picard tools on it you guarantee the reads are next to each other and picard tools will barely
use any memory. '

side question, was there anything in the original fastq one might want to keep that you can't find in the sorted bams? I am inclined to retrieve the original fastq files but data storage might be a problem for me.

**swbarnes2** · 11-16-2011, 09:39 AM

I've use Picard on .bams generated by bwa/samtools, and it definately keeps the unmapped reads. But that's because the .bam has them. If you used an aligner that tossed them, or put them in another .bam (didn't bowtie used to do that be default?) Then there's nothing any software can do about that.

I've never tried to get them back out as paired reads. I assume that it uses the flag to know which is read 1 and which is read 2, but it might not know to order them properly. If your .bam has all the reads sorted by name, and you haven't filtered out any single reads, I bet the fastqs would be in the right order.

**tsucheta** · 02-13-2012, 12:46 PM

Try using bam2fastq from hudsonalpha at http://www.hudsonalpha.org/gsl/software/bam2fastq.php. It is very quick (processed my bam files size ranging from 0.5 - 4 GB(8 files) in less than 10 minutes in a standard 2 core linux machine.)

**Johnnyalive** · 02-12-2013, 07:04 AM

Help using bamtools

I'm new to this and looking for help too - when I use bamtools to convert my .bam file to fastq, I only get one output file. Is it possible to split pair-ended reads into two output files? Can someone suggest a method?
Many thanks,
Johnny.

**vivek_** · 02-12-2013, 08:02 AM

You just specify two different output files like:

java picard-tools/SamToFastq.jar I=Input.bam F=seq1_1.fastq F2=seq1_2.fastq

You can also split these by read groups using additional command line arguments.

**abhinay** · 03-05-2013, 12:42 AM

TopHat

The following command in Tophat can convert bam to fastq (with basic settings)

bam2fastx -q -Q -A -o output.fastq input.bam

for more manipulation

bam2fastx [--fasta|-a|--fastq|-q] [--color] [-Q] [--sam|-s|-t]
[-M|--mapped-only|-A|--all] [-o <outfile>] [-P|--paired] [-N] <in.bam>

Note: By default, reads flagged as not passing quality controls are
discarded; the -Q option can be used to ignore the QC flag.

Use the -N option if the /1 and /2 suffixes should be appended to
read names according to the SAM flags

**amarth** · 03-05-2013, 08:01 AM

Originally posted by abhinay View Post

The following command in Tophat can convert bam to fastq (with basic settings)

bam2fastx -q -Q -A -o output.fastq input.bam

for more manipulation

bam2fastx [--fasta|-a|--fastq|-q] [--color] [-Q] [--sam|-s|-t]
[-M|--mapped-only|-A|--all] [-o <outfile>] [-P|--paired] [-N] <in.bam>

Note: By default, reads flagged as not passing quality controls are
discarded; the -Q option can be used to ignore the QC flag.

Use the -N option if the /1 and /2 suffixes should be appended to
read names according to the SAM flags

I second that

**nahalm63** · 11-26-2014, 09:19 AM

Hi, I am new here. Can any one tell me what script you use to convert BAM files to FASTQ in PICARD? tnx

Originally posted by malachig View Post

After a quick search I found these:

Hydra
Picard (SAMToFastq)
HudsonAlpha
Possibly EMBOSS

Any comments on these? Any other options for BAM-to-FASTQ conversion?

Basically I want to recover all paired-end reads (both R1 and R2) that were fed into the alignment that produced the BAM file, whether they mapped successfully or not.

**blancha** · 11-26-2014, 09:28 AM

Code:

java -jar /usr/local/tools/picard-tools-1.114/SamToFastq.jar \
VALIDATION_STRINGENCY=SILENT \
INPUT=HI.1965.007.Index_1.FL_K562-110k-A.bam \
FASTQ=HI.1965.007.Index_1.FL_K562-110k-A_R1.fastq \
SECOND_END_FASTQ=HI.1965.007.Index_1.FL_K562-110k-A_R2.fastq \
&> bamtofastq.sh.log

**Thorondor** · 03-26-2015, 08:47 AM

found this thread and decided to revive it.
Did anyone tried to get back to several fastq pairs r1 and r2 merged into one bam file. Alignment was done with bwa mem, merging with biobambam.
3 seperately sequenced lanes where the input.
Right now I use picard bam2fastq are there any other feasible options?
And do I really get back to the 100% identical fastq files which where the original input?

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, Yesterday, 11:49 AM	0 responses 13 views 0 likes	Last Post by seqadmin Yesterday, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, 04-24-2024, 08:47 AM	0 responses 16 views 0 likes	Last Post by seqadmin 04-24-2024, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 61 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

Convert BAM file to FASTQ

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News