Seqanswers Leaderboard Ad

**nilshomer** · 04-02-2012, 03:42 PM

See the warning message for a better solution.

**oliviajm** · 04-02-2012, 11:22 PM

Thank you for answering so soon.
I've seen the warning, but my problem is not about the time now. I would like to understantd why it doesn't work any more when I had an option.

I made another test. I ran SRMA with other options and get again an error:

[Mon Apr 02 16:37:57 CEST 2012] srma.SRMA INPUT=[blabla.bfast.allBest.sort.bam.onTarget.bam] OUTPUT=[blabla_SRMArealigned_MHS100000_O100_MTC10000_MMQ10.bam] REFERENCE=hg19-ordre-valide.fa OFFSET=100 MIN_MAPQ=10 MAXIMUM_TOTAL_COVERAGE=10000 MAX_HEAP_SIZE=100000 MAX_QUEUE_SIZE=32768 MINIMUM_ALLELE_PROBABILITY=0.1 MINIMUM_ALLELE_COVERAGE=3 CORRECT_BASES=false USE_SEQUENCE_QUALITIES=true QUIET_STDERR=false GRAPH_PRUNING=false NUM_THREADS=1 TMP_DIR=/tmp/olivia VERBOSITY=INFO QUIET=false VALIDATION_STRINGENCY=STRICT COMPRESSION_LEVEL=5 MAX_RECORDS_IN_RAM=500000 CREATE_INDEX=false CREATE_MD5_FILE=false
Allele coverage cutoffs:
coverage: 1 minimum allele coverage: 0
coverage: 2 minimum allele coverage: 0
coverage: 3 minimum allele coverage: 0
coverage: 4 minimum allele coverage: 1
coverage: 5 minimum allele coverage: 1
coverage: 6 minimum allele coverage: 1
coverage: 7 minimum allele coverage: 2
coverage: 8 minimum allele coverage: 2
coverage: 9 minimum allele coverage: 3
coverage: >9 minimum allele coverage: 3
Records processsed: 1151528 (last chr1:115260225-115260274)java.lang.Exception: SAMRecord contig does not match the current reference sequence contig
at srma.Graph.addSAMRecord(Graph.java:54)
at srma.SRMA$GraphThread.run(SRMA.java:708)
Please report bugs to [email protected]

What bother me is why does it behave so differently with the same input file? I would understand some differences in time, but I don't get why it finds that "SAMRecord contig does not match the current reference sequence contig" when the input file and the reference file stay the same.

I will run it another time with RANGE to see if it solve this problem.

**oliviajm** · 04-10-2012, 06:33 AM

Hello again,

I noticed that I get an error every time that I have run several SRMA on the same input file at the same time but with different options and different output files.
Is it possible that the error come from the fact that several SRMA works on the same input file at the same time? I thought it should not be a problem because it just reads the input file, and does not modify it, but now I'm wondering if the error can be related.

**nilshomer** · 04-10-2012, 07:19 AM

What version are you using? Can you give me a small test case (just a few SAM records) that reproduces the error? Can you try just running it on one chromosome at a time?

**oliviajm** · 04-11-2012, 12:30 AM

I'm using srma-0.1.15.jar.

You can download a file containing the chr1 lines of the SAM file I'm using there: http://dl.free.fr/vcBPNsKpn
SRMA crashed at this level on my last try (Records processsed: 1152262 (last chr1:115260225-115260274)).

I have tried to run SRMA on one chromosome at a time, and it worked. But I find this way to do more complicated,and it needs more steps, and so more time.
Thanks for spending time on this issue.

**colindaven** · 04-11-2012, 03:56 AM

Why not write a perl or shell / grep script to divide your file up into chromosomes and run SRMA on each ?
I don't know how easy it is to recombine output though.

**oliviajm** · 04-13-2012, 01:18 AM

Hi,

I have run some others tests, with and without the option MINIMUM_ALLELE_PROBABILITY=1. And I'm not sure of what it does.

When the minimum allele probability value is 1 instead of the default value of 0.1, does that mean that I will consider more bases because it will include those for which the probability is less than 1? Or the MINIMUM_ALLELE_PROBABILITY has to be seen like a threshold, and when I increased this threshold, less bases will be considered?

**nilshomer** · 04-13-2012, 09:06 PM

Basically this is trying to determine if the coverage is X, what is the minimum # of times you have to see the variant allele. See the AlleleCoverageCutoffs class for the exact computation. Also check out the "minimum edge probability" in the paper: http://dx.doi.org/10.1186/gb-2010-11-10-r99.

Topics	Statistics	Last Post
Evaluating Genome Sequencing for ECMO Patients in the NICU by seqadmin Started by seqadmin, 12-17-2024, 10:28 AM	0 responses 22 views 0 likes	Last Post by seqadmin 12-17-2024, 10:28 AM
New Genetic Toolkit Refines Studies on Gene Function and Disease by seqadmin Started by seqadmin, 12-13-2024, 08:24 AM	0 responses 42 views 0 likes	Last Post by seqadmin 12-13-2024, 08:24 AM
Study Links Brain Mechanism to Emotional Responses in Animals and Humans by seqadmin Started by seqadmin, 12-12-2024, 07:41 AM	0 responses 28 views 0 likes	Last Post by seqadmin 12-12-2024, 07:41 AM
Study Identifies Ribosomal RNA Fingerprints as Early Cancer Biomarkers by seqadmin Started by seqadmin, 12-11-2024, 07:45 AM	0 responses 42 views 0 likes	Last Post by seqadmin 12-11-2024, 07:45 AM

Seqanswers Leaderboard Ad

Announcement

SRMA problem with NUM_THREADS

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News