Seqanswers Leaderboard Ad

**Brian Bushnell** · 04-23-2015, 11:32 PM

That's correct. You should be fine, though it does look like you gained a weird bias on your last 1bp. However, fastqc makes it hard to tell if that's 1bp or several. I assume that you are overtrimming the reads by removing even 1p of sequence that matches the adapter, or something like that.

**rhinoceros** · 04-24-2015, 12:12 AM

Originally posted by Brian Bushnell View Post

That's correct. You should be fine, though it does look like you gained a weird bias on your last 1bp. However, fastqc makes it hard to tell if that's 1bp or several. I assume that you are overtrimming the reads by removing even 1p of sequence that matches the adapter, or something like that.

Thanks for the reply. About the 3'-end, I'm indeed most likely overtrimming. In trim_galore:

Code:

--stringency <INT> Overlap with adapter sequence required to trim a sequence. Defaults to a very stringent setting of 1, i.e. even a single bp of overlapping sequence will be trimmed off from the 3' end of any read.

Anyway, relatively little information is lost because of this, so there's no harm. I was more concerned about the 5'-end, since I never saw what trimmed Nextera were supposed to look like. I also thought it was a little bit weird how the Nextera transposase sequence started appearing from the middle of the reads onwards but I guess that's normal..

**nucacidhunter** · 04-24-2015, 12:42 AM

I also thought it was a little bit weird how the Nextera transposase sequence started appearing from the middle of the reads onwards but I guess that's normal

Your plots indicates that around 15% of library fragments had shorter insert than number of sequencing cycles. If the library size cut off was a bit larger, Nextera adapters would not have been sequenced which would provide more useful data.

**rhinoceros** · 04-24-2015, 01:34 AM

Originally posted by nucacidhunter View Post

Your plots indicates that around 15% of library fragments had shorter insert than number of sequencing cycles. If the library size cut off was a bit larger, Nextera adapters would not have been sequenced which would provide more useful data.

I have to admit that I'm a little bit ignorant on the technical details of the sequencing part. Is library size cut off related to fragment size distribution? Also, what is the relation of fragment sizes and the number of sequencing cycles? Also, the DNA comes from a metagenomic sample which was very small (6.8ng), so overall I'm quite happy about the output..

**yueluo** · 04-24-2015, 02:32 AM

If you have a smaller fragment size than read length, you will get adapter sequence on the 3' -end of your reads. I think you can try running Brians' bbmerge to merge reads by overlapping.

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 30 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 32 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 28 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 52 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Cleaning paired-end Nextera prepped reads

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News