Seqanswers Leaderboard Ad

**ginolhac** · 04-02-2012, 01:04 AM

Hello,

I had the same issue, does someone has any clues about this?

Thanks,

**ginolhac** · 04-02-2012, 07:52 AM

I am answering myself,
it was due to fake read mapped with bwa such as:
(null) 73 chr21 48313514 25 0M = 48313514 0 * * XT:A:U NM:i:0 SM:i:25 AM:i:0 X0:i:1 X1:i:0 XM:i:1 XO:i:0 XG:i:0 MD:Z:0
(null) 73 chr21 48313514 25 0M = 48313514 0 * * XT:A:U NM:i:0 SM:i:25 AM:i:0 X0:i:1 X1:i:0 XM:i:1 XO:i:0 XG:i:0 MD:Z:0
(null) 65 chr21 48313514 25 0M chr18 18626503 0 * * XT:A:U NM:i:0 SM:i:25 AM:i:25 X0:i:1 X1:i:0 XM:i:1 XO:i:0 XG:i:0 MD:Z:0

(null) was found more than twice and MarkDuplicates complained. By increasing the mapping quality to 26 we can get rid of them or using samtools view -f 0x2 since they are not properly paired.

**shawpa** · 05-01-2012, 09:44 AM

I am running into the same error with picardmarkduplicates. My alignment was done with bowtie2. I have run this script before on different data sets and didn't see this error. Since you figured out what was wrong with your data I was hoping you could let me know how you did that. Here's the error I get.

Exception in thread "main" net.sf.picard.PicardException: Value was put into PairInfoMap more than once. 1: L3:MWR-PRG-0014:74:C0E94ACXX:3:1206:11809:158670

**upendra_35** · 07-09-2012, 02:00 PM

Hi ginolhac,

I encountered the same problem as you when i tried to use MarkDuplicate command and when i looked at the problematic read i found that the Mapping Quality of those two reads were more than 25. Then how do we remove those reads? Thanks in advance for your help......

**ginolhac** · 07-09-2012, 11:00 PM

He,

actually the issue came from fastq files that were not in sync. Some reads were missing at the end of one of the file. That explained those reads with a (null) name.
To remove those, I used:

Code:

samtools view -h file.bam | grep -v null | samtools view -bS - > file_clean.bam

hope this helps

**JezSupreme** · 04-17-2013, 07:23 PM

I encountered the same error using picard tools MarkDuplicates and it was related to the alignment I had done using BWA (BWA MEM).

I had failed to use the -M option when running the alignment which enables compatibility with picard-tools MarkDuplicates function. I went back and re-ran the alignment with that option and it fixed the error.

From the BWA manual site:
-M Mark shorter split hits as secondary (for Picard compatibility).

**bwubb** · 07-10-2013, 07:21 AM

I have been struggling with this issue. I have sample data merged from a Illumina PE runs. When trying to find other information/solutions it was suggested to modify the read group ID to include lane or run identification and then re-merge.

I have done that, but I still receive this error. Has anyone been able to resolve this issue? I could try to remove the offending read, but Im concerned there will be many more after.

**Heisman** · 07-10-2013, 07:37 AM

Originally posted by bwubb View Post

I have been struggling with this issue. I have sample data merged from a Illumina PE runs. When trying to find other information/solutions it was suggested to modify the read group ID to include lane or run identification and then re-merge.

I have done that, but I still receive this error. Has anyone been able to resolve this issue? I could try to remove the offending read, but Im concerned there will be many more after.

What is the actual cause of your problem? In this thread there were different causes posted (ie, some fastq files with lines truncated or using bwa without -M).

**bwubb** · 07-10-2013, 07:50 AM

Ah I am having issues with:

Code:

Exception in thread "main" net.sf.picard.PicardException: Value was put into PairInfoMap more than once.  1: E0005-FGC0298:HWI-ST970:298:C0MUAACXX:4:1201:13786:41745
	at net.sf.picard.sam.CoordinateSortedPairInfoMap.ensureSequenceLoaded(CoordinateSortedPairInfoMap.java:124)
	at net.sf.picard.sam.CoordinateSortedPairInfoMap.remove(CoordinateSortedPairInfoMap.java:78)
	at net.sf.picard.sam.DiskReadEndsMap.remove(DiskReadEndsMap.java:61)
	at net.sf.picard.sam.MarkDuplicates.buildSortedReadEndLists(MarkDuplicates.java:418)
	at net.sf.picard.sam.MarkDuplicates.doWork(MarkDuplicates.java:161)
	at net.sf.picard.cmdline.CommandLineProgram.instanceMain(CommandLineProgram.java:177)
	at net.sf.picard.sam.MarkDuplicates.main(MarkDuplicates.java:145)

Driving me crazy because this is repeat analysis, but adding yet another hi-seq run to it. I use bwa-sw (bwa aln) for alignment. Is it recommended to use bwa-mem instead with the -M option?

EDIT:

There must be something greater at work here. I cannot even run ValidateSamFile without running into this error...

**thedamian** · 09-13-2013, 01:27 AM

Hi All,
I have the same problem with BWA mem. I used -M option but still I get:

Code:

.PicardException: Value was put into PairInfoMap more than once.  1: null:M00840:39:000000000-A5TE9:1:2103:11538:25521

I have tried a trick with

Code:

samtools view -h before.bam | grep -v null | samtools view -bS - > cleaned.bam

but it didn't help me.

With BWA aln everthing is ok, but it's not recommened for my data since reads are ~251 bases long.

Did anyone solve this problem?

**Clown_Bassie** · 09-01-2014, 01:00 AM

Originally posted by thedamian View Post

Hi All,
I have the same problem with BWA mem. I used -M option but still I get:

Code:

.PicardException: Value was put into PairInfoMap more than once.  1: null:M00840:39:000000000-A5TE9:1:2103:11538:25521

I have tried a trick with

Code:

samtools view -h before.bam | grep -v null | samtools view -bS - > cleaned.bam

but it didn't help me.

With BWA aln everthing is ok, but it's not recommened for my data since reads are ~251 bases long.

Did anyone solve this problem?

This is exactly the same issue I'm running into! Does someone has the answer already?

**AdrianP** · 09-03-2014, 08:32 AM

Originally posted by JezSupreme View Post

From the BWA manual site:
-M Mark shorter split hits as secondary (for Picard compatibility).

I am going to try this solution now as I have the same issue.

**zhkzhou** · 11-01-2015, 04:16 AM

JezSupreme and AdrianP are right!
The BWA-MEM algorithm performs local alignment. It may produce multiple primary alignments for different part of a query sequence. This is a crucial feature for long sequences. However, some tools such as Picard’s markDuplicates does not work with split alignments. One may consider to use option -M to flag shorter split hits as secondary.

Topics	Statistics	Last Post
A Closer Look at the Enigmatic Genomes of Oikopleura dioica by seqadmin Started by seqadmin, 05-10-2024, 06:35 AM	0 responses 18 views 0 likes	Last Post by seqadmin 05-10-2024, 06:35 AM
Advanced Epigenome Editing Platform Explores Gene Regulation Mechanisms by seqadmin Started by seqadmin, 05-09-2024, 02:46 PM	0 responses 21 views 0 likes	Last Post by seqadmin 05-09-2024, 02:46 PM
Telomere Maintenance by PARP1: A New Perspective in Cancer Research by seqadmin Started by seqadmin, 05-07-2024, 06:57 AM	0 responses 19 views 0 likes	Last Post by seqadmin 05-07-2024, 06:57 AM
Enhanced Neoantigen Detection: Introducing NeoHunter by seqadmin Started by seqadmin, 05-06-2024, 07:17 AM	0 responses 21 views 0 likes	Last Post by seqadmin 05-06-2024, 07:17 AM

Seqanswers Leaderboard Ad

Announcement

Error with MarkDuplicates in Picard

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News