Unconfigured Ad
Collapse
X
-
I had a very similar problem which was very helpfully fixed using Trimmomatic and TrimGalore as detailed in this thread:
The author of TrimGalore was particularly accommodating in modifying the script to allow different trimming of R1 and R2.
Comment
-
-
Does discarding the size estimate affect anything with the read data, the quality, or any potential variant calls?
I am trying to determine if I should use the -A option for all of my data or if there is a way to dynamically determine that sampe will take forever and the -A option should be used.
Thanks.
Comment
-
-
Originally posted by rskr View PostI have seen it when one of the pairs was quality filtered but the other then it gets replaced with whatever was next in the file so, it not longer matches.
1.1 1.2
2.1 2.2
3.1 3.2
4.1 5.2 <--4.2 was omitted, they are no longer in parity.
5.1 6.2
I have a question regarding using the -A option in the case above. If the reads are out of sync, as is the case between 4.1 and 5.2, bwa will not perform SW on the unmapped mate. What happens after that? will 5.1 and 6.2 be thrown away also bc they do not match...etc? I guess what I am asking is, is it dangerous to use -A and force bwa to throw away unmatched pairs. Are we losing important data by doing this? And is the mismatch something that carries on to all the reads after the mismatch?
Comment
-
-
-A should really only be used if you know that your files are lined up right, and you know that the insert sizes won't properly match what bwa is expecting.
Fix your fastqs. You can pull out the singletons, align them separately, then combine the bams.
Comment
-
-
Thank you so much for the advice. As tempting as it is to use -A as a quick solution, I am not completely comfortable with the idea because I don't completely understand what is being tossed away: the true "orphaned" read, or a read that does have a mate but simply does not line up correctly with its mate due to the presence of these singleton "orphans".Originally posted by swbarnes2 View Post-A should really only be used if you know that your files are lined up right, and you know that the insert sizes won't properly match what bwa is expecting.
Fix your fastqs. You can pull out the singletons, align them separately, then combine the bams.
I am looking for more details on this but havent found it yet. If anyone can confirm that only the true singletons are ignored, then I guess -A would be a good solution. In the meantime, I think barnes' advice is the safest.Last edited by dGho; 07-11-2013, 05:43 AM.
Comment
-
-
Does anyone know how to pull out the singletons from paired end fastqs separated into two fastq (read1.fastq and read2.fastq)? I haven't found a tool that does this yet. Is this something I should write a script for?Originally posted by swbarnes2 View Post-A should really only be used if you know that your files are lined up right, and you know that the insert sizes won't properly match what bwa is expecting.
Fix your fastqs. You can pull out the singletons, align them separately, then combine the bams.
Comment
-
-
Could you describe a method for identifying singletons between one read.fq file and its mate? Thanks.Originally posted by swbarnes2 View Post-A should really only be used if you know that your files are lined up right, and you know that the insert sizes won't properly match what bwa is expecting.
Fix your fastqs. You can pull out the singletons, align them separately, then combine the bams.
Comment
-
-
Any results with files out of parity are invalid(in addition to being a waste of time waiting for the results). If the files are in parity, and the mate doesn't map, is a different question.Originally posted by dGho View PostI have a question regarding using the -A option in the case above. If the reads are out of sync, as is the case between 4.1 and 5.2, bwa will not perform SW on the unmapped mate. What happens after that? will 5.1 and 6.2 be thrown away also bc they do not match...etc? I guess what I am asking is, is it dangerous to use -A and force bwa to throw away unmatched pairs. Are we losing important data by doing this? And is the mismatch something that carries on to all the reads after the mismatch?
Comment
-
-
answering my own question, but if anyone else is looking for a way remove singletons, check out this thread. I am trying this out now. azneto shared his script for making sure that two fastqs are in sync. It seems to use a whole lot of ram though
Comment
-
-
I just wanted to confirm that azneto's script worked well. It removed singletons and ordered the two fastq files so reads were synchronized. Running bwa sampe on the resulting fastqs produced no errors and had runtimes that feel within the expected range
.
Comment
-
Latest Articles
Collapse
-
by SEQadmin2
Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.
The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
...-
Channel: Articles
06-02-2026, 10:05 AM -
-
by SEQadmin2
With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.
Introduction
Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...-
Channel: Articles
05-22-2026, 06:42 AM -
ad_right_rmr
Collapse
News
Collapse
| Topics | Statistics | Last Post | ||
|---|---|---|---|---|
|
Sequencing the Two-Toed Sloth Genome Reveals Jumping Genes Tied to Its Extreme Metabolism
by SEQadmin2
Started by SEQadmin2, Today, 11:58 AM
|
0 responses
9 views
0 reactions
|
Last Post
by SEQadmin2
Today, 11:58 AM
|
||
|
Started by SEQadmin2, 06-05-2026, 10:09 AM
|
0 responses
25 views
0 reactions
|
Last Post
by SEQadmin2
06-05-2026, 10:09 AM
|
||
|
Started by SEQadmin2, 06-04-2026, 08:59 AM
|
0 responses
34 views
0 reactions
|
Last Post
by SEQadmin2
06-04-2026, 08:59 AM
|
||
|
Started by SEQadmin2, 06-02-2026, 12:03 PM
|
0 responses
56 views
0 reactions
|
Last Post
by SEQadmin2
06-02-2026, 12:03 PM
|
Comment