Seqanswers Leaderboard Ad

**simonandrews** · 12-04-2010, 02:46 AM

All of the unusual profiles are the result of the overrepresented sequence in your library. Having the same sequence make up 33% of the library will affect the overall base composition, Kmer composition and overall GC content.

As you said, the quality looks OK so there's no technical problem with the sequencing. The duplication level plot will tell you whether your problem is a small number of isolated sequences, or a generally high level of duplication in your library.

What you do about this will largely depend on what the overrepresented sequence is. If it's a small RNA then it just means you original sample is really biased, but if it's something like an adapter or primer then you may be able to improve your sample prep to get rid of it in future runs.

**bioinfosm** · 12-04-2010, 10:31 PM

simon, I think you are the developer of fastqc?

It would be awesome to have sample good fastqc plots for the regular applications: dna re-sequencing, rna-seq, chip-seq, miRNA-seq... etc just to get a good idea for comparison, and your expert comments would definitely help as well!

**simonandrews** · 12-05-2010, 03:04 AM

This is actually something we've been looking into. Setting up a repository with example datasets from different techniques and platforms, along with QC reports and annotations of any known problems which were found. Still trying to figure out the practicalities of hosting this though...

**debjit_ray** · 10-24-2013, 10:21 AM

FASTQC on my small RNA sequences identifies several overrepresented sequences. It might be because of the adapter sequences. I do a trimming for the adapter ('ACTA') using the command
>fastx_clipper -C -v -i SRR519779.fastq -Q 33 -a ACTA -o SRR519779_trimmed.fastq
The out put for this is:
Clipping Adapter: ACTA Min. Length: 5 Clipped reads - discarded. Input: 4484151 reads. Output: 4440775 reads. discarded 0 too-short reads. discarded 0 adapter-only reads. discarded 0 clipped reads. discarded 43376 N reads.

Seems there is no effect of this trimming, the FASTQC shows similar results on the trimmed sequence.
Am I doing something wrong? Please suggest.

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 25 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 28 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 24 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 52 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Fastqc results small RNA run

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News