Unconfigured Ad

**ginolhac** · 04-02-2012, 01:04 AM

Hello,

I had the same issue, does someone has any clues about this?

Thanks,

**ginolhac** · 04-02-2012, 07:52 AM

I am answering myself,
it was due to fake read mapped with bwa such as:
(null) 73 chr21 48313514 25 0M = 48313514 0 * * XT:A:U NM:i:0 SM:i:25 AM:i:0 X0:i:1 X1:i:0 XM:i:1 XO:i:0 XG:i:0 MD:Z:0
(null) 73 chr21 48313514 25 0M = 48313514 0 * * XT:A:U NM:i:0 SM:i:25 AM:i:0 X0:i:1 X1:i:0 XM:i:1 XO:i:0 XG:i:0 MD:Z:0
(null) 65 chr21 48313514 25 0M chr18 18626503 0 * * XT:A:U NM:i:0 SM:i:25 AM:i:25 X0:i:1 X1:i:0 XM:i:1 XO:i:0 XG:i:0 MD:Z:0

(null) was found more than twice and MarkDuplicates complained. By increasing the mapping quality to 26 we can get rid of them or using samtools view -f 0x2 since they are not properly paired.

**shawpa** · 05-01-2012, 09:44 AM

I am running into the same error with picardmarkduplicates. My alignment was done with bowtie2. I have run this script before on different data sets and didn't see this error. Since you figured out what was wrong with your data I was hoping you could let me know how you did that. Here's the error I get.

Exception in thread "main" net.sf.picard.PicardException: Value was put into PairInfoMap more than once. 1: L3:MWR-PRG-0014:74:C0E94ACXX:3:1206:11809:158670

**upendra_35** · 07-09-2012, 02:00 PM

Hi ginolhac,

I encountered the same problem as you when i tried to use MarkDuplicate command and when i looked at the problematic read i found that the Mapping Quality of those two reads were more than 25. Then how do we remove those reads? Thanks in advance for your help......

**ginolhac** · 07-09-2012, 11:00 PM

He,

actually the issue came from fastq files that were not in sync. Some reads were missing at the end of one of the file. That explained those reads with a (null) name.
To remove those, I used:

Code:

samtools view -h file.bam | grep -v null | samtools view -bS - > file_clean.bam

hope this helps

**JezSupreme** · 04-17-2013, 07:23 PM

I encountered the same error using picard tools MarkDuplicates and it was related to the alignment I had done using BWA (BWA MEM).

I had failed to use the -M option when running the alignment which enables compatibility with picard-tools MarkDuplicates function. I went back and re-ran the alignment with that option and it fixed the error.

From the BWA manual site:
-M Mark shorter split hits as secondary (for Picard compatibility).

**bwubb** · 07-10-2013, 07:21 AM

I have been struggling with this issue. I have sample data merged from a Illumina PE runs. When trying to find other information/solutions it was suggested to modify the read group ID to include lane or run identification and then re-merge.

I have done that, but I still receive this error. Has anyone been able to resolve this issue? I could try to remove the offending read, but Im concerned there will be many more after.

**Heisman** · 07-10-2013, 07:37 AM

Originally posted by bwubb View Post

I have been struggling with this issue. I have sample data merged from a Illumina PE runs. When trying to find other information/solutions it was suggested to modify the read group ID to include lane or run identification and then re-merge.

I have done that, but I still receive this error. Has anyone been able to resolve this issue? I could try to remove the offending read, but Im concerned there will be many more after.

What is the actual cause of your problem? In this thread there were different causes posted (ie, some fastq files with lines truncated or using bwa without -M).

**bwubb** · 07-10-2013, 07:50 AM

Ah I am having issues with:

Code:

Exception in thread "main" net.sf.picard.PicardException: Value was put into PairInfoMap more than once.  1: E0005-FGC0298:HWI-ST970:298:C0MUAACXX:4:1201:13786:41745
	at net.sf.picard.sam.CoordinateSortedPairInfoMap.ensureSequenceLoaded(CoordinateSortedPairInfoMap.java:124)
	at net.sf.picard.sam.CoordinateSortedPairInfoMap.remove(CoordinateSortedPairInfoMap.java:78)
	at net.sf.picard.sam.DiskReadEndsMap.remove(DiskReadEndsMap.java:61)
	at net.sf.picard.sam.MarkDuplicates.buildSortedReadEndLists(MarkDuplicates.java:418)
	at net.sf.picard.sam.MarkDuplicates.doWork(MarkDuplicates.java:161)
	at net.sf.picard.cmdline.CommandLineProgram.instanceMain(CommandLineProgram.java:177)
	at net.sf.picard.sam.MarkDuplicates.main(MarkDuplicates.java:145)

Driving me crazy because this is repeat analysis, but adding yet another hi-seq run to it. I use bwa-sw (bwa aln) for alignment. Is it recommended to use bwa-mem instead with the -M option?

EDIT:

There must be something greater at work here. I cannot even run ValidateSamFile without running into this error...

**thedamian** · 09-13-2013, 01:27 AM

Hi All,
I have the same problem with BWA mem. I used -M option but still I get:

Code:

.PicardException: Value was put into PairInfoMap more than once.  1: null:M00840:39:000000000-A5TE9:1:2103:11538:25521

I have tried a trick with

Code:

samtools view -h before.bam | grep -v null | samtools view -bS - > cleaned.bam

but it didn't help me.

With BWA aln everthing is ok, but it's not recommened for my data since reads are ~251 bases long.

Did anyone solve this problem?

**Clown_Bassie** · 09-01-2014, 01:00 AM

Originally posted by thedamian View Post

Hi All,
I have the same problem with BWA mem. I used -M option but still I get:

Code:

.PicardException: Value was put into PairInfoMap more than once.  1: null:M00840:39:000000000-A5TE9:1:2103:11538:25521

I have tried a trick with

Code:

samtools view -h before.bam | grep -v null | samtools view -bS - > cleaned.bam

but it didn't help me.

With BWA aln everthing is ok, but it's not recommened for my data since reads are ~251 bases long.

Did anyone solve this problem?

This is exactly the same issue I'm running into! Does someone has the answer already?

**AdrianP** · 09-03-2014, 08:32 AM

Originally posted by JezSupreme View Post

From the BWA manual site:
-M Mark shorter split hits as secondary (for Picard compatibility).

I am going to try this solution now as I have the same issue.

**zhkzhou** · 11-01-2015, 04:16 AM

JezSupreme and AdrianP are right!
The BWA-MEM algorithm performs local alignment. It may produce multiple primary alignments for different part of a query sequence. This is a crucial feature for long sequences. However, some tools such as Picard’s markDuplicates does not work with split alignments. One may consider to use option -M to flag shorter split hits as secondary.

Topics	Statistics	Last Post
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population by SEQadmin2 Started by SEQadmin2, Yesterday, 06:09 AM	0 responses 16 views 0 reactions	Last Post by SEQadmin2 Yesterday, 06:09 AM
Sequencing the Two-Toed Sloth Genome Reveals Jumping Genes Tied to Its Extreme Metabolism by SEQadmin2 Started by SEQadmin2, 06-09-2026, 11:58 AM	0 responses 37 views 0 reactions	Last Post by SEQadmin2 06-09-2026, 11:58 AM
A New Method Makes Hantavirus Genome Analysis Faster and More Accessible by SEQadmin2 Started by SEQadmin2, 06-05-2026, 10:09 AM	0 responses 42 views 0 reactions	Last Post by SEQadmin2 06-05-2026, 10:09 AM
A New Single-Cell Method Maps DNA-Protein Interactions by SEQadmin2 Started by SEQadmin2, 06-04-2026, 08:59 AM	0 responses 49 views 0 reactions	Last Post by SEQadmin2 06-04-2026, 08:59 AM

Unconfigured Ad

Error with MarkDuplicates in Picard

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News