SEQanswers

Go Back   SEQanswers > Bioinformatics > Bioinformatics



Similar Threads
Thread Thread Starter Forum Replies Last Post
A strange size difference of fastq file qinhao13 Bioinformatics 1 01-06-2015 01:56 AM
Get unique reads when merging FastQ Files from BAM rkizen Bioinformatics 4 08-07-2013 06:23 AM
wheter combine FASTq or bam files after alignment honey Bioinformatics 2 02-19-2013 12:19 AM
Reverse engineering BAM files: BAM -> FASTQ gene coder Bioinformatics 3 01-03-2012 02:42 PM
Difference in fasta files created by fastq-dump meetasunil Introductions 0 09-27-2011 02:08 AM

Reply
 
Thread Tools
Old 05-04-2015, 02:37 PM   #1
genomics
Junior Member
 
Location: Palo Alto, CA

Join Date: Sep 2011
Posts: 1
Default Why there is a difference in size between R1 and R2 fastq files from BAM?

Hi,

I downloaded bunch of bam files from TCGA projects and converted them to fastq files. Even though these BAM files are pair-end WES data, the size of fastq_R1 is different with the size of fastq_R2 in many cases. Sometime, the difference are 1 or several lines as well as almost two fold differences in file size. I used SamToFastq.jar of Picard. The command I used is as follows;

java -Xmx16g -jar SamToFastq.jar TMP_DIR=. INPUT=$file FASTQ=fastqR2 SECOND_END_FASTQ=fastqR2 VALIDATION_STRINGENCY=LENIENT VERBOSITY=ERROR";

For example,
TCGA-50-5946-01A_R1.fastq -> 3504190138
TCGA-50-5946-01A_R2.fastq -> 3504224169

TCGA-A7-A0CE-11A_R1.fastq -> 40393996094
TCGA-A7-A0CE-11A_R2.fastq -> 21008959121

It would be great if you give a clue about this.
genomics is offline   Reply With Quote
Old 05-04-2015, 02:51 PM   #2
Brian Bushnell
Super Moderator
 
Location: Walnut Creek, CA

Join Date: Jan 2014
Posts: 2,695
Default

R2 is generally lower quality and more likely to be trimmed if quality-trimming was done, and less likely to be mapped (if for example the bam only contains mapped reads).
Brian Bushnell is offline   Reply With Quote
Reply

Tags
bam, fastq, picard samtofastq

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off




All times are GMT -8. The time now is 06:37 PM.


Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2017, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO