![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Structural Variants | jkozubek | Bioinformatics | 4 | 08-22-2016 08:38 AM |
Calling structural variants (CNVs) with single-end reads | agwe | Genomic Resequencing | 5 | 01-18-2016 08:30 AM |
Strangely high proportion of variants are INDELs | PeteH | Bioinformatics | 2 | 05-09-2012 01:13 AM |
Calling structural variants from capture data | Heisman | Bioinformatics | 3 | 04-16-2012 08:01 AM |
cufflinks 1.2.0 version got me significantly different results than the old version | slowsmile | Bioinformatics | 9 | 02-01-2012 02:26 AM |
![]() |
|
Thread Tools |
![]() |
#1 |
Senior Member
Location: amsterdam Join Date: Jun 2009
Posts: 133
|
![]()
hi all
Just put an improved Pindel on my website https://trac.nbic.nl/pindel/ with wiki, mail list, user manual. An instruction using it from BWA mapping is provided. You can use it to detect indels and SVs at single-base resolution from SLX paired-end short reads. Currently 1bp-1M bp deletions and 1bp-(read length -20)bp insertions can be detected. You can also find events of non-template insertion in deletions. I am working on inversions and large insertions as well as using pindel for RNA-Seq data. Please comment on Pindel and suggest additional functions. Kai k.ye@lumc.nl Last edited by KaiYe; 10-28-2011 at 08:26 AM. |
![]() |
![]() |
![]() |
#2 |
Member
Location: Spain Join Date: Jul 2010
Posts: 68
|
![]()
Hi Kai,
Just trying out pindel for the first time...one question on the bam2pindel step: in the user manual, the input for this script is described as "aln.NameSorted.MateFixed.bam". Does this mean I have to do something additional to the bam generated by samtools? If yes, what? Thanks, Sophia |
![]() |
![]() |
![]() |
#3 |
Senior Member
Location: amsterdam Join Date: Jun 2009
Posts: 133
|
![]()
Hi Sophia,
If you generate bam from sam directly after mapping with BWA, you don't have to do anything else. Kai |
![]() |
![]() |
![]() |
#4 |
Member
Location: Spain Join Date: Jul 2010
Posts: 68
|
![]() |
![]() |
![]() |
![]() |
#5 |
Member
Location: Ithaca, NY Join Date: Apr 2010
Posts: 39
|
![]()
I must be missing something.. it isn't producing any output for me, but it also isn't giving an error. I'm trying to convert my BAM file to the pindel format:
[heather@frankie (Mon Jul 26 13:33:04)]% bam2pindel.pl -i aln.sort.pindel.bam -o out.pindel -s retina -om -pi 150 [heather@frankie (Mon Jul 26 13:41:02)]% ls . aln.sort.fix.bam.bai horse_genome_v2_all.fa.ann horse_genome_v2_all.fa.rsa .. aln.sort.pindel.bam horse_genome_v2_all.fa.bwt horse_genome_v2_all.fa.sa align.sort.bam aln_read1.sai horse_genome_v2_all.fa.fai tag_trim_6_1.fq aln.bam aln_read2.sai horse_genome_v2_all.fa.pac tag_trim_6_2.fq aln.sam horse_genome_v2_all.fa horse_genome_v2_all.fa.rbwt aln.sort.fix.bam horse_genome_v2_all.fa.amb horse_genome_v2_all.fa.rpac |
![]() |
![]() |
![]() |
#6 |
Member
Location: Nashville Join Date: Oct 2009
Posts: 14
|
![]()
Hi Kai,
Can Pintel call large structural variants (>1M) now? Thanks. Wu Last edited by wuhoucdc; 08-27-2010 at 03:55 PM. |
![]() |
![]() |
![]() |
#7 |
Member
Location: china Join Date: Apr 2010
Posts: 18
|
![]()
Dear Kai Ye,
I've used the pindel software recently. I have heared that you have published a new version software, will you please give me the linkage please. Best, Cong chen Wenzhou Medical College |
![]() |
![]() |
![]() |
#8 | |
Senior Member
Location: amsterdam Join Date: Jun 2009
Posts: 133
|
![]() Quote:
It seems to me that I have sent you my latest Pindel for test. Have you experienced any problems in using it? Kai |
|
![]() |
![]() |
![]() |
#9 | |
Senior Member
Location: amsterdam Join Date: Jun 2009
Posts: 133
|
![]() Quote:
I am currently testing a new version with the following additional functions: 1. Allow sequence errors/SNPs in the same reads containing INDELs/SVs 2. non-template sequence in deletions 3. inversions 4. tandem duplications 5. breakpoints of large insertions Please send me an email for ask for it in case you want to test it. Cheers, Kai |
|
![]() |
![]() |
![]() |
#10 | |
Senior Member
Location: amsterdam Join Date: Jun 2009
Posts: 133
|
![]() Quote:
Thanks. |
|
![]() |
![]() |
![]() |
#11 |
Member
Location: australia Join Date: Nov 2008
Posts: 21
|
![]()
Hi KaiYe
I'm having problem running Pindel. Here's what I've done: 1) Download all files from http://www.ebi.ac.uk/~kye/pindel/v_0.2.0/ 2) ran bam2pindel.pl on one paired-end samples (aligned using BWA). My bam file is sorted but it does not have the header expected by your program, so i used the -om to force the script to run. a number of files is generated: e.g. myprefix.1.txt (chr1) 3) then I tried running pindel_x86_64, but i then got this error message: ./pindel_x86_64: /usr/lib64/libstdc++.so.6: version `GLIBCXX_3.4.9' not found (required by ./pindel_x86_64) 4) i tried upgrading some packages in my redhat linux, but still the same. 5) i then downloaded your source code from sourceforge (with svn) and compiled your pindel from scratch. It seems to work. 6) I find the "-i" parameter confusing as it says "-i, --config-file: the bam file later to be a config file;" in the script but "Input: the unmapped reads in a modified fastq format" in your powerpoint manual. 7) I assumed -i refers to the files generated by bam2pindel.pl, so i tested the command on some chromosomes. E.g. pindel_64 -f hg19.fasta -i myprefix.1.txt -o otherprefix -c 1 -b empty.txt 8) but whichever chromosome i try, i always get "There are no reads for this chromosome": Processing chromosome 1 Processing chromosome 2 Skip the rest of chromosomes. 1 249250621 269250621 26926 10000 BreakDancer events: 0 There are no reads for this chromosome. What have i done wrong? my email is jason.li @ petermac.org Thanks Jason |
![]() |
![]() |
![]() |
#12 |
Member
Location: Singapore Join Date: Jan 2009
Posts: 31
|
![]()
Hi Kaiye,
interesting tool you got there. anyway, have you publish the method? I am curious about one thing, say you have 1 read, you will grow the pattern until you cannot get a match, then you find the rest of the read within the next 1-1M bps. What if there are several matches in the 1-1M bps region, which one do you use and what kind of consideration do you use to choose it? |
![]() |
![]() |
![]() |
#13 | |
Senior Member
Location: amsterdam Join Date: Jun 2009
Posts: 133
|
![]() Quote:
Only unique hit will be considered here. |
|
![]() |
![]() |
![]() |
#14 | |
Senior Member
Location: amsterdam Join Date: Jun 2009
Posts: 133
|
![]() Quote:
|
|
![]() |
![]() |
![]() |
#15 |
Member
Location: Melbourne Join Date: May 2010
Posts: 21
|
![]()
Hi KaiYe,
I'm working with SOLiD data...and would like to use Pindel but couldn't find anything about it. is Pindel only for Illumina data? thanks in advance for your reply. Fabrice |
![]() |
![]() |
![]() |
#16 | |
Senior Member
Location: amsterdam Join Date: Jun 2009
Posts: 133
|
![]() Quote:
I don't have a procedure with SOLiD data but would explore this together with you. First you need to convert the data from color space to sequence space. Second, convert the sequence to the correct strand. Pindel assume the data is paired-end as illumina so that the reads are facing each other rather than on the same strand. You may then try my sam2pindel.cpp to extract reads and run Pindel. Please visit https://trac.nbic.nl/pindel and register as a Pindel user. Kai |
|
![]() |
![]() |
![]() |
#17 |
Member
Location: Melbourne Join Date: May 2010
Posts: 21
|
![]()
thanks a lot Kai for your quick reply.
for the second step is there a tool to do that? thanks again! Fabrice |
![]() |
![]() |
![]() |
#18 | |
Senior Member
Location: amsterdam Join Date: Jun 2009
Posts: 133
|
![]() Quote:
You may need to write a script to do that. Kai |
|
![]() |
![]() |
![]() |
#19 |
Member
Location: Melbourne Join Date: May 2010
Posts: 21
|
![]()
alright, that's what I thought, thanks!
Fabrice |
![]() |
![]() |
![]() |
#20 |
Member
Location: Spain Join Date: Jun 2010
Posts: 56
|
![]()
Hi KaiYe,
I am having the same problem as jtjli (http://seqanswers.com/forums/showthr...0820#post30820). I did as follows: 1) Download all files from http://www.ebi.ac.uk/~kye/pindel/v_0.2.0/. I aligned with BWA, processed with samtools and filtered by MAPQ quality (<30). 2) ran bam2pindel.pl on one paired-end samples (aligned using BWA). My bam file is sorted and duplicates are removed but it does not have the header expected by your program, so i used the -om to force the script to run. A file for each chromosome was generated: e.g. myprefix.1.txt (chr1) 3) I downloaded your source code from sourceforge (with svn) and compiled your pindel from scratch. It seems to work. 4) I run the following comand /home/Pindel_source_v0.2.2/pindel -f /home/hg19.fa -i /s_4_QC_sort_pind.bam_chr1.txt -o ./s4 -c chr1 empty but whichever chromosome i try, i always get "There are no reads for this chromosome": BreakDancer events: 0 Processing chromosome: chr10 Skipping chromosome: chr10 ... Processing chromosome: chr1 Chromosome Size: 249250621 26926 10000 Looking at chromosome chr1 bases 0 to 10000000. BinBorder 0 10000000 There are no reads for this bin. Looking at chromosome chr1 bases 10000000 to 20000000. BinBorder 10000000 20000000 There are no reads for this bin. .... Loading genome sequences and reads: 0 seconds. Mining, Sorting and output results: 0 seconds. What I am doing wrong? How did you solve jtjli's problem? Last edited by chariko; 05-10-2011 at 12:36 AM. Reason: Incomplete |
![]() |
![]() |
![]() |
Tags |
pindel |
Thread Tools | |
|
|