Unconfigured Ad

**GenoMax** · 04-21-2017, 07:14 AM

What kind of analysis are you trying to do? In general I have never worried about k-mer warnings from FastQC.

**Vinn** · 04-21-2017, 07:17 AM

Originally posted by GenoMax View Post

What kind of analysis are you trying to do? In general I have never worried about k-mer warnings from FastQC.

Hi GenoMax, thanks for your reply. I would like to do de novo assembly.

**GenoMax** · 04-21-2017, 07:43 AM

Take a look at @Brian's suggestions in this thread. I have provided a link for a specific post but take a look at the whole thread. He should be along with more later.

**Vinn** · 04-21-2017, 07:48 AM

Thank you, I will read the thread through.

**Brian Bushnell** · 04-24-2017, 10:16 AM

Kmer-content spikiness at the beginning of the read is normal for many fragmentation methodologies and should not be removed. I'm not sure what's going on at the end, though...

**Vinn** · 04-25-2017, 06:48 AM

Thanks for your reply Brian. Just to be on a safe side, do you think it is better to trim the end off?

**Brian Bushnell** · 04-25-2017, 09:58 AM

Excessive trimming reduces accuracy, and will degrade the results of any experiment. If you want to be confident that bases are genomic rather than artificial, I suggest you follow this methodology:

1) Map the reads to the reference (if you don't have a reference, you can make a quick assembly with Tadpole) with BBMap like this:

Code:

bbmap.sh in=reads.fq ref=ref.fa mhist=mhist.txt qhist=qhist.txt

2) Plot mhist with R or Excel with a log-scale Y-axis to look at the positional error rates.

If there is not an increased error rate in a region of the read, there is no reason to trim it. And conversely, it is prudent to trim if there is a high error rate at one end or the other.

**Vinn** · 04-26-2017, 01:56 PM

Thanks so much Brian for your advice. I will try as you suggested.

Topics	Statistics	Last Post
New AI Model Captures Long-Range Genomic Signals to Improve RNA Splice Site Prediction by SEQadmin2 Started by SEQadmin2, Today, 05:37 AM	0 responses 5 views 0 reactions	Last Post by SEQadmin2 Today, 05:37 AM
Large-Scale Protein Screen Uncovers Hidden Regulators of Alternative Polyadenylation by SEQadmin2 Started by SEQadmin2, 06-26-2026, 11:10 AM	0 responses 16 views 0 reactions	Last Post by SEQadmin2 06-26-2026, 11:10 AM
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population by SEQadmin2 Started by SEQadmin2, 06-17-2026, 06:09 AM	0 responses 49 views 0 reactions	Last Post by SEQadmin2 06-17-2026, 06:09 AM
Sequencing the Two-Toed Sloth Genome Reveals Jumping Genes Tied to Its Extreme Metabolism by SEQadmin2 Started by SEQadmin2, 06-09-2026, 11:58 AM	0 responses 109 views 0 reactions	Last Post by SEQadmin2 06-09-2026, 11:58 AM

Unconfigured Ad

K-mer content failed on 5' end - advice needed

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News