Hi, I have a question about fastq and fastq.gz files. I understand that fastq.gz is the compressed version of fastq file. Can I combine all the R1.fastq.gz files and R2.fastq.gz separately before feeding it to fastQC ? Thanks.
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
Looks like fastqc will use one thread per file. Reading the -help for fastqc it states --threads "Specifies the number of files which can be processed simultaneously". Perhaps upping --threads to equal the number of fastq you have, then giving it all the fastq as inputs is the way to go? Going divide-and-conquer is what you need to do.
The problem with cat'ing all your fastq is that the statistics would be calculated as average of all reads: you might not catch an anomaly in one subset of reads, especially if it gets averaged out by the other three set of reads.Last edited by winsettz; 10-15-2013, 12:00 PM.
Comment
-
Originally posted by lala2013 View PostSorry I didn't make it clear. Is it possible to do the following and feed R1.fastq.gz to fastQC?
cat L001_R1.fastq.gz L002_R1.fastq.gz L003_R1.fastq.gz L004_R1.fastq.gz > R1.fastq.gz
Code:zcat L001_R1.fastq.gz L002_R1.fastq.gz L003_R1.fastq.gz L004_R1.fastq.gz | gzip -c - > R1.fastq.gz
Comment
-
Originally posted by vivek_ View PostI think cat does concatenate gzipped files. You don't need to gzip again either.
O.K. I just went to the FastQC site and under the Change Log:
3-5-12: Version 0.10.1 released
Added a workround to allow the analysis of concatenated gzipped files(emphasis mine)
Fixed a bug when FastQC was installed in a path containing characters needing to be escaped in a URL
Added an option to specify the location of the java interpreter on the command lineLast edited by kmcarr; 10-15-2013, 01:33 PM. Reason: Should have read the release notes before posting
Comment
Latest Articles
Collapse
-
by seqadmin
The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...-
Channel: Articles
05-06-2024, 07:48 AM -
-
by seqadmin
The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...-
Channel: Articles
04-22-2024, 07:01 AM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, Yesterday, 02:46 PM
|
0 responses
15 views
0 likes
|
Last Post
by seqadmin
Yesterday, 02:46 PM
|
||
Started by seqadmin, 05-07-2024, 06:57 AM
|
0 responses
13 views
0 likes
|
Last Post
by seqadmin
05-07-2024, 06:57 AM
|
||
Started by seqadmin, 05-06-2024, 07:17 AM
|
0 responses
17 views
0 likes
|
Last Post
by seqadmin
05-06-2024, 07:17 AM
|
||
Started by seqadmin, 05-02-2024, 08:06 AM
|
0 responses
23 views
0 likes
|
Last Post
by seqadmin
05-02-2024, 08:06 AM
|
Comment