Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • vcf2beagle.jar error "cannot find int file"

    I am using vcf2beagle.jar to convert vcf files to beagle format. In the documentation it says:
    Three files will be created with extensions:".markers", ".bgl.gz", and ".int".

    However the programme crashes saying that it cannot find the int file. Other programmes manage to write to the folder so I assume it is not a permissions problem. I created an empty version of the missing file using touch and it gives the same error message. My full command and error message is below.

    I would be most grateful for any suggestions

    Harry

    zcat /pub7/harry/tgen/vcf/Chr18.batch1+2+3.beagle.phased.vcf.gz | java -jar vcf2beagle.jar missing=? prefix=/pub7/harry/tgen/vcf/Chr18.batch1+2+3.beagle.phased
    java.io.FileNotFoundException: prefix=/pub7/harry/tgen/vcf/Chr18.batch1+2+3.beagle.phased.int (No such file or directory)
    at java.io.FileOutputStream.open(Native Method)
    at java.io.FileOutputStream.<init>(FileOutputStream.java:221)
    at java.io.FileWriter.<init>(FileWriter.java:107)
    at blbutil.FileUtil.printWriter(FileUtil.java:341)
    at blbutil.FileUtil.printWriter(FileUtil.java:320)
    at blbutil.FileUtil.printWriter(FileUtil.java:269)
    at vcf.Vcf2Beagle.main(Vcf2Beagle.java:49)
    Error opening prefix=/pub7/harry/tgen/vcf/Chr18.batch1+2+3.beagle.phased.int
    terminating program.

  • #2
    Pure speculation - I wonder if the program does not like the "+" signs in your directory path? Is it creating the other two files?

    Comment


    • #3
      Thanks GenoMax, it was an interesting thought but replacing the "+" with "." made no difference.
      It has not created the other files either. I suspect that .int is just the first one that it got to.

      Comment


      • #4
        prefix= should point to the directory where you want the result files to go. Can you try prefix=/pub7/harry/tgen/vcf/ (or some other directory) ?

        Comment


        • #5
          I thought for a minute that you had cracked it, but:
          zcat /pub7/harry/tgen/vcf/Chr18.batch1.2.3.beagle.phased.vcf.gz | java -jar vcf2beagle.jar missing=? prefix=/pub7/harry/tgen/vcf/
          java.io.FileNotFoundException: prefix=/pub7/harry/tgen/vcf/.int (No such file or directory)
          at java.io.FileOutputStream.open(Native Method)
          at java.io.FileOutputStream.<init>(FileOutputStream.java:221)
          at java.io.FileWriter.<init>(FileWriter.java:107)
          at blbutil.FileUtil.printWriter(FileUtil.java:341)
          at blbutil.FileUtil.printWriter(FileUtil.java:320)
          at blbutil.FileUtil.printWriter(FileUtil.java:269)
          at vcf.Vcf2Beagle.main(Vcf2Beagle.java:49)
          Error opening prefix=/pub7/harry/tgen/vcf/.int
          terminating program.

          Comment


          • #6
            How about this: prefix=/pub7/harry/tgen/vcf/test

            Comment


            • #7
              Thanks for your perseverance but no luck with that either:
              zcat /pub7/harry/tgen/vcf/Chr18.batch1.2.3.beagle.phased.vcf.gz | java -jar vcf2beagle.jar missing=? prefix=/pub7/harry/tgen/vcf/test
              java.io.FileNotFoundException: prefix=/pub7/harry/tgen/vcf/test.int (No such file or directory)
              at java.io.FileOutputStream.open(Native Method)
              at java.io.FileOutputStream.<init>(FileOutputStream.java:221)

              Comment


              • #8
                That is odd indeed. I am looking at the help for vcf2beagle and that should have worked.

                Since we are grasping at straws now .. can you ungzip the file and feed it to vcf2beagle using cat (instead of zcat)?

                Is there an example file I can grab from the net to try this?

                Comment


                • #9
                  Thanks for trying so hard with this.
                  Unzipping with bgzip made no difference.
                  Last edited by cowman; 01-27-2016, 07:30 AM.

                  Comment


                  • #10
                    @cowman: It appears that program does not like ? (or *) as value for missing. If I do this

                    Code:
                    $ zcat Chr18.batch1.2.3.beagle.phased.vcf.gz | java -jar vcf2beagle.jar ANY_CHAR ./test
                    the program works and makes the three files.

                    Comment


                    • #11
                      Thanks a lot GenoMax.
                      It is working when I omit the parameter names and the question mark.
                      I thought the question mark was the beagle default for missing values. I am sure that I have used it before. I wonder if the script has been modified?

                      Thanks again for your help

                      Comment


                      • #12
                        Great. Remember to remove the file (and the link) from post #9.

                        Comment


                        • #13
                          Thanks. Links and files gone.
                          I have just fiddled a bit more and find that I can still use a ? as a missing value. It was the parameter names that was causing the trouble. I will write to Beagle HQ to mention this problem. At least it was not only me.

                          Comment

                          Latest Articles

                          Collapse

                          • seqadmin
                            Essential Discoveries and Tools in Epitranscriptomics
                            by seqadmin




                            The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
                            04-22-2024, 07:01 AM
                          • seqadmin
                            Current Approaches to Protein Sequencing
                            by seqadmin


                            Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                            04-04-2024, 04:25 PM

                          ad_right_rmr

                          Collapse

                          News

                          Collapse

                          Topics Statistics Last Post
                          Started by seqadmin, Yesterday, 11:49 AM
                          0 responses
                          15 views
                          0 likes
                          Last Post seqadmin  
                          Started by seqadmin, 04-24-2024, 08:47 AM
                          0 responses
                          16 views
                          0 likes
                          Last Post seqadmin  
                          Started by seqadmin, 04-11-2024, 12:08 PM
                          0 responses
                          62 views
                          0 likes
                          Last Post seqadmin  
                          Started by seqadmin, 04-10-2024, 10:19 PM
                          0 responses
                          60 views
                          0 likes
                          Last Post seqadmin  
                          Working...
                          X