Unconfigured Ad

**maubp** · 01-27-2010, 09:08 AM

Most people I've talked to use their own Perl or Python scripts for this, although EMBOSS are looking at adding this kind of tool to their suite.

**bbimber** · 01-27-2010, 10:32 AM

ok, thanks for the reply. that's the impression I was getting from google.

in case anyone else reads this, i did come across this:

FASTX-Toolkit

http://hannonlab.cshl.edu/fastx_toolkit/index.html

**bioenvisage** · 01-27-2010, 11:00 AM

hi .. is there any script to mask or remove the low quality repeats and also simple repeats..I would also
like to know whether this will create problems in denovo assembly...

**Zigster** · 01-28-2010, 12:44 PM

Originally posted by bbimber View Post

ok, thanks for the reply. that's the impression I
http://hannonlab.cshl.edu/fastx_toolkit/index.html

is the FASTQ Quality Filter a variable trimmer?

**gabriel.lichtenstein** · 03-16-2010, 04:25 AM

updates on this thread?

would you suggest me a perl script for quality trimming illumina 1.3 reads?

what do you think about clc trim sequences tool? and abyss -q option?

**maubp** · 03-16-2010, 04:34 AM

I don't have any perl examples, but there are some very simple Python examples in the Biopython Tutorial (search for FASTQ):

Page Redirection

http://biopython.org/DIST/docs/tutorial/Tutorial.html

Page not found · GitHub Pages

http://biopython.org/DIST/docs/tutorial/Tutorial.pdf

and here:

404 Page not found

http://news.open-bio.org/news/2009/09/biopython-fast-fastq/

**Simon Anders** · 03-16-2010, 04:48 AM

I am unconvinced that trimming low quality reads is necessary at all. After all, most aligners (e.g., Maq, Bowtie, BWA; but not Eland!) take into account the quality score and disregard or downweight low quality reads automatically.

Simon

**bbimber** · 03-16-2010, 05:11 AM

if you are interested in illumina, look into fastx toolkit (link above). there's a command line tool to do it. they might also have a web interface for it, but i'm not 100% positive. the logic behind their trimming is probably good for short reads, but not as optimal for longer ones like 454.

**xuer** · 03-25-2010, 01:40 PM

Originally posted by mard View Post

Yes it tells you the number of reads that have been marked as duplicates, as well as the total number of reads. But note that reads that Picard marks as duplicates do not necessarily have identical sequence they just map to the same chromosomal location.

so , it looks that Picard is not good choice for that

Topics	Statistics	Last Post
High-Resolution Sequencing Exposes Hidden Toxoplasma Diversity by SEQadmin2 Started by SEQadmin2, Yesterday, 11:08 AM	0 responses 6 views 0 reactions	Last Post by SEQadmin2 Yesterday, 11:08 AM
New AI Model Captures Long-Range Genomic Signals to Improve RNA Splice Site Prediction by SEQadmin2 Started by SEQadmin2, 06-30-2026, 05:37 AM	0 responses 11 views 0 reactions	Last Post by SEQadmin2 06-30-2026, 05:37 AM
Large-Scale Protein Screen Uncovers Hidden Regulators of Alternative Polyadenylation by SEQadmin2 Started by SEQadmin2, 06-26-2026, 11:10 AM	0 responses 19 views 0 reactions	Last Post by SEQadmin2 06-26-2026, 11:10 AM
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population by SEQadmin2 Started by SEQadmin2, 06-17-2026, 06:09 AM	0 responses 53 views 0 reactions	Last Post by SEQadmin2 06-17-2026, 06:09 AM

Unconfigured Ad

Quality trimmming / Mask low quality bases?

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News