Seqanswers Leaderboard Ad

**mastal** · 01-10-2013, 11:12 AM

Demystifying the MiSeq!

Hi microgirl123,

There are various websites as well as Illumina documents with helpful diagrams showing where the TruSeq index sequences go.

Some of the ones I have found most helpful are:

The 'Illumina - all flavors' web page from the U. Texas at Austin Genome Sequencing Facility:

Illumina - all flavors - Genomic Sequencing and Analysis Facility User Support Wiki - UT Austin Wikis

https://wikis.utexas.edu/display/GSAF/Illumina+-+all+flavors

The Tufts University Core Facility have a document called "Illumina TruSeq DNA Adapters De-Mystified" by James Schiemer, at

http://genomics.med.tufts.edu/documents/protocols/TUCF_Understanding_Illumina_TruSeq_Adapters.pdf

Best wishes,
Maria

**microgirl123** · 01-10-2013, 11:40 AM

Thanks, Maria. I hadn't seen the U. Texas info. The Tufts adapter info has been really helpful.

Also, does anyone know what the FASTQ files really are? Are they the sequence of bases that are fluorescing or are they the reverse complement of those bases, i.e. the bases actually present on the strand being sequenced?

**krobison** · 01-10-2013, 12:13 PM

Originally posted by microgirl123 View Post

Also, does anyone know what the FASTQ files really are? Are they the sequence of bases that are fluorescing or are they the reverse complement of those bases, i.e. the bases actually present on the strand being sequenced?

Quality scores increase from left to right; therefore the calls are what fluorescent nucleotide was added.

**microgirl123** · 01-10-2013, 12:31 PM

Quality scores increase from left to right; therefore the calls are what fluorescent nucleotide was added.

I'm sure I'm being dense here, but I have no idea what this means. What I'm trying to ask is, when the MiSeq makes its basecalls and creates FASTQ files, is it telling me what the fluorescent molecule was (let's say a T) or is it telling me what the fluorescent molecule bound to (an A). Of course, there's two steps here, the basecalling and the FASTQ file creation, so either of those steps could process the fluorescence to report what is actually on the strand (the A).

I'm really having trouble deciding which strand of DNA is being sequenced when and what the software is actually reporting!

**JackieBadger** · 01-10-2013, 01:10 PM

Illumina support are really good and will have no problem explaining this all to you over the phone

**mastal** · 01-10-2013, 02:01 PM

Demystifying the MiSeq!

I think the nucleotides reported in the fastq files are the nucleotides being incorporated.

But just at the moment I can't remember if I have any reference that specifically states this.

Maria

**kmcarr** · 01-10-2013, 07:29 PM

MG,

Have a look at this document from Illumina which pretty clearly describes the structure of the library molecules and relative position/orientation of the sequencing primers.

After cluster generation, linearization and blocking the Read 1 sequencing primer (SP) is annealed. After completion of read 1 sequencing the nascent strand is denatured and the Index SP is annealed; it anneals to the same strand as the read 1 SP. After the index read is complete, and IF you are performing a paired end protocol, the resynthesis (a.k.a. cluster regeneration or turn around) chemistry is performed. After resynthesis the Read 2 SP is annealed. The Read 2 SP is actually the reverse complement if the Index SP.

**microgirl123** · 01-11-2013, 05:53 AM

Thanks, kmcarr and Maria. I never thought to look for Illumina documents on anything but the MiSeq! Also, my second query to Illumina tech support yielded better results. The FASTQ files are reporting the sequence of the fluorescent, incorporated nucleotides, which are really the sequence of the original strand because what is being sequenced is the reverse complement! This all makes my brain hurt!

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, Yesterday, 11:49 AM	0 responses 15 views 0 likes	Last Post by seqadmin Yesterday, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, 04-24-2024, 08:47 AM	0 responses 16 views 0 likes	Last Post by seqadmin 04-24-2024, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 61 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

Demystifying the MiSeq!

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News