Seqanswers Leaderboard Ad

**JackieBadger** · 10-17-2012, 11:03 AM

Use Trimmomatic... much more versatile and mate-pair aware. Or just use the trim function in text manipulation in Galaxy.

**sklages** · 10-17-2012, 11:07 AM

Why don't you ask your sequencing provider to demultiplex the data for you? It is not more work for them, as it is part of the fastq processing; you just need to provide some kind of sample description, a samplesheet.

All our customers are quite happy that this work has already been done when they get their data :-)

Sven

**JackieBadger** · 10-17-2012, 01:25 PM

BEWARE! You must always have the ability to check any processing a provider does for you... for e.g. the trimming script on the MiSeq software should be avoided as it is VERY promiscuous...even in the "remedied" latest update. Also...I have come across significant errors in the MiSeq de-multiplexing.
I would do everything with a program where you set the parameters and know what is going in and what should come out.

**newBioinfo** · 10-17-2012, 01:40 PM

Originally posted by JackieBadger View Post

Use Trimmomatic... much more versatile and mate-pair aware. Or just use the trim function in text manipulation in Galaxy.

Thanks jackieBadger,
I will try it!

**newBioinfo** · 10-17-2012, 01:42 PM

Originally posted by JackieBadger View Post

BEWARE! You must always have the ability to check any processing a provider does for you... for e.g. the trimming script on the MiSeq software should be avoided as it is VERY promiscuous...even in the "remedied" latest update. Also...I have come across significant errors in the MiSeq de-multiplexing.
I would do everything with a program where you set the parameters and know what is going in and what should come out.

Thanks JackieBadger,
I am planning to use QIME, so hopefully I will not encounter such issues.

Thanks for the help!!!

**newBioinfo** · 10-17-2012, 01:44 PM

Originally posted by sklages View Post

Why don't you ask your sequencing provider to demultiplex the data for you? It is not more work for them, as it is part of the fastq processing; you just need to provide some kind of sample description, a samplesheet.

All our customers are quite happy that this work has already been done when they get their data :-)

Sven

Thanks Seven,
Ii would be good if they demultiplex the data before sending, but in my case it is not.

**sklages** · 10-17-2012, 11:52 PM

Originally posted by JackieBadger View Post

BEWARE! You must always have the ability to check any processing a provider does for you... for e.g. the trimming script on the MiSeq software should be avoided as it is VERY promiscuous...even in the "remedied" latest update. Also...I have come across significant errors in the MiSeq de-multiplexing.
I would do everything with a program where you set the parameters and know what is going in and what should come out.

hhm, it's pretty easy to check what the provider does if you also have the "Undetermined_indices" data files. MiSeq is another thing ... the trimming issue is known and should not be used (currently). You could also ask for some (demultiplexing) stats, to see if the results are "good" or as expected.

If you don't trust in your sequence provider at all, you should look for another one ;-)

What "significant errors" did you encounter in the MiSeq demultiplexing?
We are not plexing Miseq libs, so I am just curious :-)

Sven

**NextGenSeq** · 10-18-2012, 07:08 AM

Are you sure they did an index read? The title of your files say No_Index. The only time we ever get data titled like this is if a index read wasn't done.

**sklages** · 10-18-2012, 08:59 AM

Originally posted by NextGenSeq View Post

Are you sure they did an index read? The title of your files say No_Index. The only time we ever get data titled like this is if a index read wasn't done.

No, you'll always get that naming when there is not index specified in the samplesheet for that run (irrespective if there was run an index read).
You cannot safely deduce from the naming wether there has been run an index read or not (at least for the "_NoIndex_" case)..

Sven

**GenoMax** · 10-18-2012, 11:38 AM

Originally posted by sklages View Post

No, you'll always get that naming when there is not index specified in the samplesheet for that run (irrespective if there was run an index read).
You cannot safely deduce from the naming wether there has been run an index read or not (at least for the "_NoIndex_" case)..

Sven

Let us hope that is the case. If the facility did not run this as a multiplex sample then OP is out of luck. This run will have to be repeated.

**sklages** · 10-18-2012, 12:04 PM

Originally posted by GenoMax View Post

Let us hope that is the case. If the facility did not run this as a multiplex sample then OP is out of luck. This run will have to be repeated.

Sure, you are absolutely right.

This problem might arise if the customer doesn't mention any indices that need to be demultiplexed in their "order" (however this order looks like), maybe assuming that this is not relevant for the sequencing run itself but for the post-processing only.
... and the sequencing core organizes their FCs with respect to read length and MP/no MP ...

We had a similar post a while ago, where the OP has hand-written a little note on the "order sheet" and as a result the sequencing didn't recognize it as "please do an index read, as my libraries have indices" ...

Sven

**LVAndrews** · 12-04-2012, 03:26 PM

Originally posted by newBioinfo View Post

Thanks JackieBadger,
I am planning to use QIME, so hopefully I will not encounter such issues.

Thanks for the help!!!

If you are using QIIME, then you will have the option to remove unwanted sequences (such as barcodes) during the split_libraries_fastq.py step. See http://qiime.org/scripts/split_libraries_fastq.html for more information.

**LVAndrews** · 12-04-2012, 03:27 PM

Originally posted by AKrohn View Post

If you are using QIIME, then you will have the option to remove unwanted sequences (such as barcodes) during the split_libraries_fastq.py step. See http://qiime.org/scripts/split_libraries_fastq.html for more information.

Oops for this application you want split_libraries.py script instead.

Topics	Statistics	Last Post
Evaluating Genome Sequencing for ECMO Patients in the NICU by seqadmin Started by seqadmin, 12-17-2024, 10:28 AM	0 responses 25 views 0 likes	Last Post by seqadmin 12-17-2024, 10:28 AM
New Genetic Toolkit Refines Studies on Gene Function and Disease by seqadmin Started by seqadmin, 12-13-2024, 08:24 AM	0 responses 42 views 0 likes	Last Post by seqadmin 12-13-2024, 08:24 AM
Study Links Brain Mechanism to Emotional Responses in Animals and Humans by seqadmin Started by seqadmin, 12-12-2024, 07:41 AM	0 responses 28 views 0 likes	Last Post by seqadmin 12-12-2024, 07:41 AM
Study Identifies Ribosomal RNA Fingerprints as Early Cancer Biomarkers by seqadmin Started by seqadmin, 12-11-2024, 07:45 AM	0 responses 42 views 0 likes	Last Post by seqadmin 12-11-2024, 07:45 AM

Seqanswers Leaderboard Ad

Announcement

Demultiplex Illumina reads

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News