SEQanswers

Go Back   SEQanswers > Bioinformatics > Bioinformatics



Similar Threads
Thread Thread Starter Forum Replies Last Post
How to fix Illumina FASTQ files with read length varies errors antgomo Illumina/Solexa 5 08-19-2014 11:28 PM
Demultiplexing dual-indexed MiSeq fastq files lynchde Bioinformatics 2 08-18-2013 02:15 PM
Why are Illumina paired-end SRA datasets made up of 3 FASTQ files? Bio.X2Y Illumina/Solexa 9 12-21-2010 11:36 AM
BWA mapping fastq files with Illumina quality maricu Bioinformatics 3 11-19-2010 11:18 AM
different Illumina convention in fastq files? mchaisso Bioinformatics 1 08-07-2008 07:22 AM

Reply
 
Thread Tools
Old 12-13-2013, 10:43 AM   #1
kkggc
Junior Member
 
Location: SC

Join Date: Dec 2013
Posts: 3
Default Illumina MiSeq FASTQ files

When receiving two fastq files from Illuminia, are they always considered paired-end reads?
kkggc is offline   Reply With Quote
Old 12-13-2013, 10:58 AM   #2
mastal
Senior Member
 
Location: uk

Join Date: Mar 2009
Posts: 667
Default

It depends on what the files/reads are called.

Depending on how many reads you have, you might have more than 1 file for each sample, with each file having about 4 million or so reads.

If the files are from R1 and R2 paired-end reads, then when you look at the header lines for the reads, they should have read identifiers that are the same except one file having 1:N:0 (R1) and the other file 2:N:0 (R2).
mastal is offline   Reply With Quote
Old 12-13-2013, 11:01 AM   #3
kkggc
Junior Member
 
Location: SC

Join Date: Dec 2013
Posts: 3
Default

Okay, but if there are two fastq files for the same sample, then it is paired-end reads? I have files now that I know they are paired-end reads and they do have the R1 and R2 in their file name. Just curious if there is ever a case to have two fastq files for one sample, but not paired-end reads.
kkggc is offline   Reply With Quote
Old 12-13-2013, 11:29 AM   #4
GenoMax
Senior Member
 
Location: East Coast USA

Join Date: Feb 2008
Posts: 7,031
Default

File names are probably not absolutely certain indicators of two files being paired-end reads (unless you were sure that the sequence provider had followed standard protocols, identical sample_ID/barcodes were included in the file names and the only difference in the names was R1 and R2, as Maria stated before).

Last edited by GenoMax; 12-13-2013 at 11:33 AM.
GenoMax is offline   Reply With Quote
Old 12-13-2013, 11:33 AM   #5
kkggc
Junior Member
 
Location: SC

Join Date: Dec 2013
Posts: 3
Default

That is understandable. However, I know for sure 100% my two fastq files are for the same sample and are paired-end reads. I am just curious if it is ever a possibility to have two fastq files for the same sample, but NOT paired-end reads.
kkggc is offline   Reply With Quote
Old 12-13-2013, 11:37 AM   #6
GenoMax
Senior Member
 
Location: East Coast USA

Join Date: Feb 2008
Posts: 7,031
Default

Sure. If you ran a sample multiple times on separate MiSeq runs the resulting file names (with standard post-processing protocol) will be identical (down to the R1 and R2 in names if they are paired-end reads).

Sample file names are using sample_ID inherited from the run samplesheet. So a mistake in the samplesheet could even assign the same name to sample files that are from different samples from 2 separate runs.
GenoMax is offline   Reply With Quote
Reply

Tags
illumina, miseq, paired-end reads

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off




All times are GMT -8. The time now is 10:56 AM.


Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2020, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO