Seqanswers Leaderboard Ad

**xquan** · 05-12-2011, 06:07 AM

The query genome size (3Gb) is much larger than the reference genome (500mb). And reference genome is from de novo assembly contigs. But still mapped reads should not be so low.

**blackjimmy** · 05-19-2011, 12:22 AM

We've also met this question, except that our tag size was 36bp.
21M reads passed filtering, when aligned using bowtie, only about 10k reads mapped to the reference genome. We also found huge duplicate reads in our FASTQ file.
Does Illumina has officially quality control results to tell us whether our sequencing process is OK? Thanks a lot!

**zee** · 05-19-2011, 03:23 AM

I highly recommend you doing your own QC with a program like FASTQC or FASTX and analyzing the quality metrics in each lane.

**xquan** · 05-19-2011, 07:15 AM

Originally posted by blackjimmy View Post

We've also met this question, except that our tag size was 36bp.
21M reads passed filtering, when aligned using bowtie, only about 10k reads mapped to the reference genome. We also found huge duplicate reads in our FASTQ file.
Does Illumina has officially quality control results to tell us whether our sequencing process is OK? Thanks a lot!

The only quality control by Illumina I know is the chastity filtering process. Too lanes of our data completely failed to pass the filtering. And I didn't use any of the reads failed to pass the chastity filtering process. Does anyone know other Illumina quality control?

**xquan** · 05-19-2011, 07:17 AM

Originally posted by zee View Post

I highly recommend you doing your own QC with a program like FASTQC or FASTX and analyzing the quality metrics in each lane.

I have done QC check with FASTQC, and our reads after my QC from all lanes got good results except the k-mer analysis (which gave yellow warning sign).

**glacerda** · 05-19-2011, 07:33 AM

The Illumina mate pair libraries used to be in reverse-forward orientation ( --rf parameter ), Unless something has changed in the mate pair protocol, this could be the cause of the bad mapping.

**xquan** · 05-19-2011, 10:01 AM

Originally posted by glacerda View Post

The Illumina mate pair libraries used to be in reverse-forward orientation ( --rf parameter ), Unless something has changed in the mate pair protocol, this could be the cause of the bad mapping.

Do you mean that I should use --rf instead of --fr for the pair-end reads? I thought mate pair should be forward-forward orientation and use --ff.

**glacerda** · 05-19-2011, 10:35 AM

Hi xquan,

Illumina mate pair libraries are supposed contain outwards facing reads ( <-- --> ) and we should use --rf in bowtie. Illumina Mate Pair libraries are used to long insert lengths, greater than 2 Kbp usually.

Illumina paired end libraries are supposed to contain inwards facing reads ( --> <-- ) and we should use --fr in bowtie. Illumina Paired Ends are used to short insert lengths (at most 500 bp) usually.

As far as I can remeber, 454 and SOLiD use forward-forward ( --> --> )

**xquan** · 05-19-2011, 12:05 PM

Originally posted by glacerda View Post

Hi xquan,

Illumina mate pair libraries are supposed contain outwards facing reads ( <-- --> ) and we should use --rf in bowtie. Illumina Mate Pair libraries are used to long insert lengths, greater than 2 Kbp usually.

Illumina paired end libraries are supposed to contain inwards facing reads ( --> <-- ) and we should use --fr in bowtie. Illumina Paired Ends are used to short insert lengths (at most 500 bp) usually.

As far as I can remeber, 454 and SOLiD use forward-forward ( --> --> )

Thanks very much! I will confirm this with the sequencing company (who told me that their library preparation for mate pair is forward-forward) and try to run bowtie with --rf again.

**DZhang** · 05-22-2011, 06:28 AM

Hi,

In this case, I usually try aligning the data from one end to the reference as single-fragment to see what percentage of reads are mapped.

Douglas

https://www.contigexpress.com

**chadn737** · 05-22-2011, 01:01 PM

Have you tried Blasting some of the reads? You will sometimes be surprised by what you find when doing this.

**DZhang** · 05-22-2011, 01:19 PM

Hi chadn737,

That's a great point. Oftentimes the simplest approach is the best one. In one project, I randomly chose 10 reads and BLASTed it. They all came back mapping to an rRNA gene. No other approach is faster than BLAST to find this out.

Douglas

https://www.contigexpress.com

**stoker** · 05-22-2011, 11:31 PM

I have observed that Illumina instruments have different filters configurations. If your filters has been mounted incorrectly - in wrong positions (this is possible when you have a new device or you have a service repairs) then you may need to change bases in your reads. A to C, G to T and vice versa. We have met this problem in our lab.

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 18 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 22 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 17 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 49 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Very Bad Mapping Results with several mapping softwares

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News