If one has demultiplexed dual index reads on a MiSeq AND included "adaptor removal" as part of the demultiplexing (on instrument), should one also run the FASTQ files through an adaptor removal programme, or is this just overkill?
Unconfigured Ad
Collapse
X
-
@cement_head: If there are no remaining adapters then all you lost is some time. For miseq datasets you would need less than 30 min to scan/trim data with bbduk.sh from BBMap. You can then be sure that there would be no extraneous sequences remaining in your data. Especially important if you were doing any de novo work.
Comment
-
-
I suggest fastp to do automatic adapter trimming, read filtering and quality control. fastp is developed in C++ with multi-threading support, it's ultra-fast.
fastp has following features:
1, filter out bad reads (too low quality, too short, or too many N...)
2, cut low quality bases for per read in its 5' and 3' by evaluating the mean quality from a sliding window (like Trimmomatic but faster).
3, trim all reads in front and tail
4, cut adapters. Adapter sequences can be automatically detected,which means you don't have to input the adapter sequences to trim them.
5, correct mismatched base pairs in overlapped regions of paired end reads, if one base is with high quality while the other is with ultra low quality
6, preprocess unique molecular identifer (UMI) enabled data, shift UMI to sequence name.
7, report JSON format result for further interpreting.
8, visualize quality control and filtering results on a single HTML page (like FASTQC but faster and more informative).
9, split the output to multiple files (0001.R1.gz, 0002.R1.gz...) to support parallel processing. Two modes can be used, limiting the total split file number, or limitting the lines of each split file.
10, support long reads (data from PacBio / Nanopore devices).
fastp creates reports in both HTML and JSON format.
HTML report: http://opengene.org/fastp/fastp.html
JSON report: http://opengene.org/fastp/fastp.json
fastp is an open source project at github: https://github.com/OpenGene/fastpOpenGene(Libraries and tools for NGS data analysis),AfterQC(Fastq Filtering and QC)
FusionDirect.jl( Detect gene fusion), SeqMaker.jl(Next Generation Sequencing simulation)
Comment
-
Latest Articles
Collapse
-
by SEQadmin2
Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.
The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
...-
Channel: Articles
06-02-2026, 10:05 AM -
-
by SEQadmin2
With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.
Introduction
Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...-
Channel: Articles
05-22-2026, 06:42 AM -
ad_right_rmr
Collapse
News
Collapse
| Topics | Statistics | Last Post | ||
|---|---|---|---|---|
|
Started by SEQadmin2, 06-05-2026, 10:09 AM
|
0 responses
13 views
0 reactions
|
Last Post
by SEQadmin2
06-05-2026, 10:09 AM
|
||
|
Started by SEQadmin2, 06-04-2026, 08:59 AM
|
0 responses
24 views
0 reactions
|
Last Post
by SEQadmin2
06-04-2026, 08:59 AM
|
||
|
Started by SEQadmin2, 06-02-2026, 12:03 PM
|
0 responses
28 views
0 reactions
|
Last Post
by SEQadmin2
06-02-2026, 12:03 PM
|
||
|
Started by SEQadmin2, 06-02-2026, 11:40 AM
|
0 responses
22 views
0 reactions
|
Last Post
by SEQadmin2
06-02-2026, 11:40 AM
|
Comment