Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • KaiYe
    Senior Member
    • Jun 2009
    • 133

    Pindel: improved version for indels and structural variants

    hi all

    Just put an improved Pindel on my website https://trac.nbic.nl/pindel/ with wiki, mail list, user manual.

    An instruction using it from BWA mapping is provided. You can use it to detect indels and SVs at single-base resolution from SLX paired-end short reads.

    Currently 1bp-1M bp deletions and 1bp-(read length -20)bp insertions can be detected. You can also find events of non-template insertion in deletions.

    I am working on inversions and large insertions as well as using pindel for RNA-Seq data.
    Please comment on Pindel and suggest additional functions.

    Kai
    [email protected]
    Last edited by KaiYe; 10-28-2011, 07:26 AM.
  • sdvie
    Member
    • Jul 2010
    • 68

    #2
    Hi Kai,

    Just trying out pindel for the first time...one question on the bam2pindel step: in the user manual, the input for this script is described as "aln.NameSorted.MateFixed.bam". Does this mean I have to do something additional to the bam generated by samtools? If yes, what?

    Thanks,
    Sophia

    Comment

    • KaiYe
      Senior Member
      • Jun 2009
      • 133

      #3
      Hi Sophia,

      If you generate bam from sam directly after mapping with BWA, you don't have to do anything else.

      Kai

      Comment

      • sdvie
        Member
        • Jul 2010
        • 68

        #4
        Originally posted by KaiYe View Post
        Hi Sophia,

        If you generate bam from sam directly after mapping with BWA, you don't have to do anything else.

        Kai
        cool, thanks.

        Comment

        • raela
          Member
          • Apr 2010
          • 39

          #5
          I must be missing something.. it isn't producing any output for me, but it also isn't giving an error. I'm trying to convert my BAM file to the pindel format:

          [heather@frankie (Mon Jul 26 13:33:04)]% bam2pindel.pl -i aln.sort.pindel.bam -o out.pindel -s retina -om -pi 150
          [heather@frankie (Mon Jul 26 13:41:02)]% ls
          . aln.sort.fix.bam.bai horse_genome_v2_all.fa.ann horse_genome_v2_all.fa.rsa
          .. aln.sort.pindel.bam horse_genome_v2_all.fa.bwt horse_genome_v2_all.fa.sa
          align.sort.bam aln_read1.sai horse_genome_v2_all.fa.fai tag_trim_6_1.fq
          aln.bam aln_read2.sai horse_genome_v2_all.fa.pac tag_trim_6_2.fq
          aln.sam horse_genome_v2_all.fa horse_genome_v2_all.fa.rbwt
          aln.sort.fix.bam horse_genome_v2_all.fa.amb horse_genome_v2_all.fa.rpac

          Comment

          • wuhoucdc
            Member
            • Oct 2009
            • 14

            #6
            Hi Kai,

            Can Pintel call large structural variants (>1M) now?

            Thanks.

            Wu
            Last edited by wuhoucdc; 08-27-2010, 02:55 PM.

            Comment

            • tinacai
              Member
              • Apr 2010
              • 18

              #7
              Dear Kai Ye,
              I've used the pindel software recently. I have heared that you have published a new version software, will you please give me the linkage please.




              Best,
              Cong chen
              Wenzhou Medical College

              Comment

              • KaiYe
                Senior Member
                • Jun 2009
                • 133

                #8
                Originally posted by tinacai View Post
                Dear Kai Ye,
                I've used the pindel software recently. I have heared that you have published a new version software, will you please give me the linkage please.




                Best,
                Cong chen
                Wenzhou Medical College
                Hi Cong Chen,

                It seems to me that I have sent you my latest Pindel for test. Have you experienced any problems in using it?

                Kai

                Comment

                • KaiYe
                  Senior Member
                  • Jun 2009
                  • 133

                  #9
                  Originally posted by wuhoucdc View Post
                  Hi Kai,

                  Can Pintel call large structural variants (>1M) now?

                  Thanks.

                  Wu
                  Pindel can detect variants of any sizes as long as they are not inter-chromosome events. The only thing I worry about is speed. The runtime is linear to the maximum size of SVs.

                  I am currently testing a new version with the following additional functions:
                  1. Allow sequence errors/SNPs in the same reads containing INDELs/SVs
                  2. non-template sequence in deletions
                  3. inversions
                  4. tandem duplications
                  5. breakpoints of large insertions

                  Please send me an email for ask for it in case you want to test it.

                  Cheers,

                  Kai

                  Comment

                  • KaiYe
                    Senior Member
                    • Jun 2009
                    • 133

                    #10
                    Originally posted by raela View Post
                    I must be missing something.. it isn't producing any output for me, but it also isn't giving an error. I'm trying to convert my BAM file to the pindel format:

                    [heather@frankie (Mon Jul 26 13:33:04)]% bam2pindel.pl -i aln.sort.pindel.bam -o out.pindel -s retina -om -pi 150
                    [heather@frankie (Mon Jul 26 13:41:02)]% ls
                    . aln.sort.fix.bam.bai horse_genome_v2_all.fa.ann horse_genome_v2_all.fa.rsa
                    .. aln.sort.pindel.bam horse_genome_v2_all.fa.bwt horse_genome_v2_all.fa.sa
                    align.sort.bam aln_read1.sai horse_genome_v2_all.fa.fai tag_trim_6_1.fq
                    aln.bam aln_read2.sai horse_genome_v2_all.fa.pac tag_trim_6_2.fq
                    aln.sam horse_genome_v2_all.fa horse_genome_v2_all.fa.rbwt
                    aln.sort.fix.bam horse_genome_v2_all.fa.amb horse_genome_v2_all.fa.rpac
                    Would you please inform me your email address? I have cpp code to extract reads from sam files for Pindel.

                    Thanks.

                    Comment

                    • jtjli
                      Member
                      • Nov 2008
                      • 21

                      #11
                      help

                      Hi KaiYe

                      I'm having problem running Pindel. Here's what I've done:
                      1) Download all files from http://www.ebi.ac.uk/~kye/pindel/v_0.2.0/
                      2) ran bam2pindel.pl on one paired-end samples (aligned using BWA). My bam file is sorted but it does not have the header expected by your program, so i used the -om to force the script to run.
                      a number of files is generated: e.g. myprefix.1.txt (chr1)
                      3) then I tried running pindel_x86_64, but i then got this error message: ./pindel_x86_64: /usr/lib64/libstdc++.so.6: version `GLIBCXX_3.4.9' not found (required by ./pindel_x86_64)
                      4) i tried upgrading some packages in my redhat linux, but still the same.
                      5) i then downloaded your source code from sourceforge (with svn) and compiled your pindel from scratch. It seems to work.
                      6) I find the "-i" parameter confusing as it says "-i, --config-file: the bam file later to be a config file;" in the script but "Input: the unmapped reads in a modified fastq format" in your powerpoint manual.
                      7) I assumed -i refers to the files generated by bam2pindel.pl, so i tested the command on some chromosomes. E.g.
                      pindel_64 -f hg19.fasta -i myprefix.1.txt -o otherprefix -c 1 -b empty.txt
                      8) but whichever chromosome i try, i always get "There are no reads for this chromosome":

                      Processing chromosome 1
                      Processing chromosome 2
                      Skip the rest of chromosomes.
                      1 249250621 269250621
                      26926 10000
                      BreakDancer events: 0
                      There are no reads for this chromosome.


                      What have i done wrong?

                      my email is jason.li @ petermac.org

                      Thanks
                      Jason

                      Comment

                      • rwenang
                        Member
                        • Jan 2009
                        • 31

                        #12
                        Hi Kaiye,

                        interesting tool you got there. anyway, have you publish the method? I am curious about one thing, say you have 1 read, you will grow the pattern until you cannot get a match, then you find the rest of the read within the next 1-1M bps. What if there are several matches in the 1-1M bps region, which one do you use and what kind of consideration do you use to choose it?

                        Comment

                        • KaiYe
                          Senior Member
                          • Jun 2009
                          • 133

                          #13
                          Originally posted by rwenang View Post
                          Hi Kaiye,

                          interesting tool you got there. anyway, have you publish the method? I am curious about one thing, say you have 1 read, you will grow the pattern until you cannot get a match, then you find the rest of the read within the next 1-1M bps. What if there are several matches in the 1-1M bps region, which one do you use and what kind of consideration do you use to choose it?
                          Yes, Pindel has been published (http://www.ncbi.nlm.nih.gov/pubmed/19561018) and it was awarded best paper at ISMB 2009 Special Interest Group on Short Read Sequencing.


                          Only unique hit will be considered here.

                          Comment

                          • KaiYe
                            Senior Member
                            • Jun 2009
                            • 133

                            #14
                            Originally posted by jtjli View Post
                            Hi KaiYe

                            I'm having problem running Pindel. Here's what I've done:
                            1) Download all files from http://www.ebi.ac.uk/~kye/pindel/v_0.2.0/
                            2) ran bam2pindel.pl on one paired-end samples (aligned using BWA). My bam file is sorted but it does not have the header expected by your program, so i used the -om to force the script to run.
                            a number of files is generated: e.g. myprefix.1.txt (chr1)
                            3) then I tried running pindel_x86_64, but i then got this error message: ./pindel_x86_64: /usr/lib64/libstdc++.so.6: version `GLIBCXX_3.4.9' not found (required by ./pindel_x86_64)
                            4) i tried upgrading some packages in my redhat linux, but still the same.
                            5) i then downloaded your source code from sourceforge (with svn) and compiled your pindel from scratch. It seems to work.
                            6) I find the "-i" parameter confusing as it says "-i, --config-file: the bam file later to be a config file;" in the script but "Input: the unmapped reads in a modified fastq format" in your powerpoint manual.
                            7) I assumed -i refers to the files generated by bam2pindel.pl, so i tested the command on some chromosomes. E.g.
                            pindel_64 -f hg19.fasta -i myprefix.1.txt -o otherprefix -c 1 -b empty.txt
                            8) but whichever chromosome i try, i always get "There are no reads for this chromosome":

                            Processing chromosome 1
                            Processing chromosome 2
                            Skip the rest of chromosomes.
                            1 249250621 269250621
                            26926 10000
                            BreakDancer events: 0
                            There are no reads for this chromosome.


                            What have i done wrong?

                            my email is jason.li @ petermac.org

                            Thanks
                            Jason
                            I will send you my source code via email.

                            Comment

                            • Fabrice ODEFREY
                              Member
                              • May 2010
                              • 21

                              #15
                              Hi KaiYe,

                              I'm working with SOLiD data...and would like to use Pindel but couldn't find anything about it. is Pindel only for Illumina data?
                              thanks in advance for your reply.
                              Fabrice

                              Comment

                              Latest Articles

                              Collapse

                              ad_right_rmr

                              Collapse

                              News

                              Collapse

                              Topics Statistics Last Post
                              Started by SEQadmin2, Today, 10:09 AM
                              0 responses
                              9 views
                              0 reactions
                              Last Post SEQadmin2  
                              Started by SEQadmin2, Yesterday, 08:59 AM
                              0 responses
                              14 views
                              0 reactions
                              Last Post SEQadmin2  
                              Started by SEQadmin2, 06-02-2026, 12:03 PM
                              0 responses
                              24 views
                              0 reactions
                              Last Post SEQadmin2  
                              Started by SEQadmin2, 06-02-2026, 11:40 AM
                              0 responses
                              20 views
                              0 reactions
                              Last Post SEQadmin2  
                              Working...