Seqanswers Leaderboard Ad

**blancha** · 12-05-2015, 05:54 AM

The bias in the first bases of the reads from libraries generated with random hexaprimers has been documented, and discussed over and over again. Do not cut them! You will just be discarding perfectly good bases.

Nucleotide bias in RNASeq data (initial 12-13 bp) - SEQanswers

http://seqanswers.com/forums/showthread.php?t=64396

Bridged amplification & clustering followed by sequencing by synthesis. (Genome Analyzer / HiSeq / MiSeq)

With Trimmomatic, you have the option of setting the minimum quality of the leading or trailing bases, with the options LEADING and TRAILING. It's true that there doesn't seem to be an option to cut a specified number of bases off the tail. There is only an option for the head with HEADCROP. But, it just makes so much more sense to trim by quality score anyway. Unless, you are using an aligner that absolutely requires all reads to have the same length.

Frankly, I would just use the example command given in the Trimmomatic manual, and only change the minimum length, given that you will want to keep only reads long enough to do a proper assembly.

With Cutadapt, you do have the option --cut which will allow you to specify the number of reads you want to trim off the 5' and 3' ends. Again, it is preferable to trim by quality unless your assembler requires all reads to be of the same length, which is generally not the case.

There is also BBDuk, written by Brian Bushnell, an active member of this forum, which seems to have just about every option imaginable.

**mastal** · 12-05-2015, 07:17 AM

The trimmomatic command CROP removes bases from the 3' end of the reads.

**blancha** · 12-05-2015, 07:40 AM

@mastal is correct.

The parameters are a bit different though from HEADCROP. Rather than specifying the number of bases to cut, you specify the read length after cutting.

**sebl** · 12-08-2015, 02:48 AM

Thank you for the reply.

I guess you are right and I should work on quality trimming.

Our libraries however were prepared by mechanical shearing of gDNA, not the usual random hexamer protocol in Nextera, if that is what you meant.

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, Yesterday, 11:49 AM	0 responses 15 views 0 likes	Last Post by seqadmin Yesterday, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, 04-24-2024, 08:47 AM	0 responses 16 views 0 likes	Last Post by seqadmin 04-24-2024, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 61 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

cut 5' and 3' ends of paired-ends reads

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News