Seqanswers Leaderboard Ad

**Skiaphrene** · 11-17-2013, 02:21 PM

Doesn't anybody have any ideas? I'm sorry for the long post, but I wanted to make sure I had "done my homework" before posting for help... The question boils down to:

"What could these Illumina reads with very FASTQC high duplication levels be after eliminating all the most obvious answers?"

Thanks,

-- Alex

**choishingwan** · 12-18-2013, 01:02 AM

From my experience, RNA Sequencing reads does have a relatively high duplication rate base on its nature and most of the time I don't read into the duplication rate from the FastQC report and focus mainly on the sequence score.

**Skiaphrene** · 12-18-2013, 03:25 PM

Hi choishingwan,

Thank you for your answer! I guess maybe I'm looking too much into this... Sometimes you just get problems in the data, in the tools, or both, and you just have to work with them anyway. In this case I would like to point out that some of these replicates had very high read quality scores, while others didn't, so I didn't find any pattern there.

Either way, I'm going to find another practice dataset!

Best regards,

-- Alex

**choishingwan** · 12-18-2013, 06:21 PM

Try and see if those reads are all coming from the same lane or if those are the second read of the read pair. Usually the lane will fail together or in general, the second read pair usually have a relatively lower quality score. If I remember correctly, you should be aiming for q30>80%, you can check illumina for the specification. Another thing to look for is to see if there is a high amount of over represented sequence at the beginning of your reads, that might be adapters that require trimming, though I haven't got a data that require to do so yet.

Topics	Statistics	Last Post
Genomics-Driven Care in Neurodevelopmental Disorders Shows Promising Results by seqadmin Started by seqadmin, 01-09-2025, 04:04 PM	0 responses 443 views 0 likes	Last Post by seqadmin 01-09-2025, 04:04 PM
Study Questions Accuracy of Genetic Testing for Opioid Use Disorder Risk by seqadmin Started by seqadmin, 01-09-2025, 09:42 AM	0 responses 444 views 0 likes	Last Post by seqadmin 01-09-2025, 09:42 AM
New Algorithm Brings Precision and Scalability to Single-Cell RNA Analysis by seqadmin Started by seqadmin, 01-08-2025, 03:17 PM	0 responses 459 views 0 likes	Last Post by seqadmin 01-08-2025, 03:17 PM
Nanopores as Precision Diagnostic Tools in Molecular Biology by seqadmin Started by seqadmin, 01-03-2025, 11:18 AM	1 response 50 views 1 like	Last Post by Tonia 01-05-2025, 12:15 PM

Seqanswers Leaderboard Ad

Announcement

Problem with FASTQC on Trinity Mouse DC reads example dataset

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News