Seqanswers Leaderboard Ad

**GenoMax** · 02-21-2016, 01:08 PM

Take a look at Sabre: https://github.com/najoshi/sabre otherwise you could split the samples first before doing the adapter trimming.

**sells78** · 02-22-2016, 03:53 AM

Hi genomax - appreciate the response.

could you possibly expand on what you mean by splitting them?

Many thanks

Jamie

**GenoMax** · 02-22-2016, 04:06 AM

@Jamie: You have inline barcodes at beginning of read 2, which I presume will be used to separate the multiplexed samples?

I was saying that you could use that information to first bin your R1/R2 reads into sample pools and then trim them afterwards as separate pools. At that point you could trim the barcode using fixed length trimming (followed by barcode trimming, if necessary) as a two pass operation.

**sells78** · 02-22-2016, 05:17 AM

Hi genomax,
so samples have already been demultiplexed into libraries of 24 individuals and we have 30 libraries each with separate read 1 and read 2 files. I'm trying to run adapter removal on the 60 total files individually to remove the barcode/adapter read through at the end of read 2's, where the order of components on the read is wanted genomic sequence - restriction site - 8 bp barcode - adapter sequence.

**GenoMax** · 02-22-2016, 05:42 AM

I see. So you don't want to separate the 24 individuals further as discriminated by that inline barcode?

Any trimming should be done with both pairs of files (so in case a read gets dropped from read 2 then the corresponding read would be removed from read 1 keeping the order of reads in R1/R2 files in sync).

I am not a regular cutadapt user but I can think of how you could do this with bbduk.sh. You could add all possible combinations of restriction site and the 8bp barcode (is that one per individual) in a file (or even to the adapters.fa file in the "resources" directory and then use that as input to scan against your data.

Let me know if I am still missing something.

**Jessica_L** · 02-22-2016, 12:07 PM

Your solution sounds right to me, GenoMax. I think with cutadapt, you can specifiy different files for either a forward or reverse primer/adapter sequence, but the idea is the same. I use cutadapt with two fasta files for processing one of our sample types since the kit vendor does the same thing and I'm comparing data with them.

bbduk is on my list of things to investigate this year, though.

**sells78** · 03-04-2016, 08:17 AM

Hi Geno and Jessica,

Apologies for the late thanks for your inputs into the query - very much apreciated.

It seems the issue I've had is the default match in cutadapt is 3 between sequence and barcode and so obviously 3 N's match with every sequence

. So the answer i smuch as you suggets but with increased match values to reduce random 3mer match

Best to you both

I've experimented with barcodes as a file and I think the answer will be to forget about using N's, use the multiple barcodes in a file and increase the default match value

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 39 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 41 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 35 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 55 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

cutadapt and barcode sequences

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News