Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • baohua100
    Senior Member
    • Jun 2008
    • 103

    SOAP2 -v doesn't work

    SOAP2 -v maximum number of mismatches allowed on a read

    I have tried -v 3 -v 2 -v 4 for 50bp read, but the maximum number of mismatch in the output is always 2. there are no read wish mismatch 3 or more.



    my command:
    soap -a reads/PT0012.3_1.fastq -b reads/PT0012.3_2.fastq -D genome.fa.index -m 0 -x 500000 -u 3.unmap -o 3.paired -2 3.unpaired -v 3 -r 2 -p 2




    Usage: soap [options]
    -a <str> query a file, *.fq, *.fa
    -b <str> query b file
    -D <str> reference sequences indexing table, *.index format
    -o <str> output alignment file(txt)
    -M <int> match mode for each read or the seed part of read, which shouldn't contain more than 2 mismaches, [4]
    0: exact match only
    1: 1 mismatch match only
    2: 2 mismatch match only
    4: find the best hits
    -u <str> output unmapped reads file
    -t output reads id instead reads name, [none]
    -l <int> align the initial n bps as a seed [256] means whole length of read
    -n <int> filter low-quality reads containing >n Ns before alignment, [5]
    -r [0,1,2] how to report repeat hits, 0=none; 1=random one; 2=all, [1]
    -m <int> minimal insert size allowed, [400]
    -x <int> maximal insert size allowed, [600]
    -2 <str> output file of unpaired alignment hits
    -v <int> maximum number of mismatches allowed on a read. [5] bp
    -s <int> minimal alignment length (for soft clip) [255] bp
    -g <int> one continuous gap size allowed on a read. [0] bp
    -R for long insert size of pair end reads RF. [none](means FR pair)
    -e <int> will not allow gap exist inside n-bp edge of a read, default=5
    -p <int> number of processors to use, [1]

    -h this help
  • sediener
    Junior Member
    • Aug 2009
    • 3

    #2
    You have to use it with some combination of -M # . I'm not sure why but that's what I've been told.

    - Steve

    Comment

    • baohua100
      Senior Member
      • Jun 2008
      • 103

      #3
      -M <int> match mode for each read or the seed part of read, which shouldn't contain more than 2 mismaches, [4]
      0: exact match only
      1: 1 mismatch match only
      2: 2 mismatch match only
      4: find the best hits

      what‘ that mean? If i want to find alignment with 3 mismatch, how to set the parameter M?
      Last edited by baohua100; 08-26-2009, 08:32 PM.

      Comment

      • sediener
        Junior Member
        • Aug 2009
        • 3

        #4
        Sorry I can't help more, a co-worker told me of that difference btw SOAP and SOAP2 so that's about all I know. I'll ask him later today to see if he's figured out more of the rules.

        It would be nice if there was a better standard for documentation of Bioinformatics apps. Seems like the only way to use some of these programs correctly is to contact the author.

        Comment

        • karthikprabu
          Junior Member
          • Sep 2009
          • 6

          #5
          I would like to know
          Last edited by karthikprabu; 11-25-2009, 07:45 AM.

          Comment

          • Insane
            Junior Member
            • Oct 2009
            • 3

            #6
            Hello everybody,

            Does anybody knows how to deal with that problem?
            I would like to work with 4 mismatches per read and I don't know how to do...

            Comment

            • zhongj
              Member
              • Dec 2011
              • 13

              #7
              I have the same problem,have anyone figured it out? pls let me know.Thanks!

              Comment

              Latest Articles

              Collapse

              • GATTACAT
                Reply to Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                by GATTACAT
                Love this - good data definitely starts from good input, and poor input can only give relatively poor data. I particularly like the mention of Nanodrop/absorbance based methods for quantification. It's such a toss up if you'll get an accurate reading or what amounts to a randomly generated number, and a lot of library/sequencing related issues can be traced back to poor quant.
                Today, 11:43 AM
              • SEQadmin2
                Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                by SEQadmin2


                I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

                Here are nine questions we think about, in roughly the order they matter, before...
                06-18-2026, 07:11 AM
              • SEQadmin2
                From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                by SEQadmin2


                Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                ...
                06-02-2026, 10:05 AM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by SEQadmin2, Yesterday, 05:37 AM
              0 responses
              7 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-26-2026, 11:10 AM
              0 responses
              17 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-17-2026, 06:09 AM
              0 responses
              51 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-09-2026, 11:58 AM
              0 responses
              110 views
              0 reactions
              Last Post SEQadmin2  
              Working...