Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • CNVnator histogram run for too long

    Hello!

    It's my first time using CNVnator. After creating the .root file I did the following:
    ./cnvnator -root file.root -chrom groupI -his 100
    to generate a histogram. The program seems to be running fine at the beginning, and give out such output:
    Warning in <TUnixSystem::SetDisplay>: DISPLAY not set, setting it to minint-612e653.************
    Allocating memory ...
    Done.
    Calculating histograms with bin size of 100 for 'groupI' ...
    Making directory bin_100 ...
    Making GC histogram for 'groupI' ...

    and then it has been running more than 10 hours without any further respond...

    Could someone please tell me where the problem could happen?
    Thank you very much!!

  • #2
    Discussion of next-gen sequencing related bioinformatics: resources, algorithms, open source efforts, etc


    This was my experience with a similar issue.

    Comment


    • #3
      Originally posted by vivek_ View Post
      http://seqanswers.com/forums/showthread.php?t=21888

      This was my experience with a similar issue.
      Thank you very much! I didn't know there has to be linebreaks in the fasta.

      Comment


      • #4
        Yeah its kind of trivial but that's how the software reads the fasta files. If you use the reference from the UCSC download directly, it works perfectly well.

        When you get the results please let me know if the MAPQ0 value in the output varies with values other than 1 and -1. I haven't been able to figure this out yet.

        Comment


        • #5
          Originally posted by vivek_ View Post
          Yeah its kind of trivial but that's how the software reads the fasta files. If you use the reference from the UCSC download directly, it works perfectly well.

          When you get the results please let me know if the MAPQ0 value in the output varies with values other than 1 and -1. I haven't been able to figure this out yet.
          I'll let you know.
          And now I'm also trying to use another package for CNV calling, it's a R package called "cn.mops", which seem to be much easier to use. I managed to get some results in one day.

          Comment


          • #6
            Originally posted by vivek_ View Post
            Yeah its kind of trivial but that's how the software reads the fasta files. If you use the reference from the UCSC download directly, it works perfectly well.

            When you get the results please let me know if the MAPQ0 value in the output varies with values other than 1 and -1. I haven't been able to figure this out yet.
            It's always 1 for me. I don't even have a -1...

            Comment


            • #7
              Yeah that was my issue as well with the software. The source code is a little hard to read since it uses quite a bit of root modules as well.

              They describe the results in the CNVnator paper by doing a filtering on the Mapq0 value and this is kind of important for de novo CNV detection, which is my interest.

              Comment


              • #8
                so how did you go about creating line breaks in fast (what does mean anyway)? I'm getting the same error ...its stops from "Making histogram.."

                cnvnator -root PT0113-C.7009.45.bam.root -d . -his 100
                Allocating memory ...
                Done.
                Calculating histograms with bin size of 100 for 'Pf3D7_01_v3' ...
                Making GC histogram for 'Pf3D7_01_v3' ...

                Comment


                • #9
                  Line breaks simply means each line in the fasta file is limited to 60 bases. You could just have Bio-Perl module read and print the chromosomes in your fasta file in the proper format.

                  Comment


                  • #10
                    Thanks, I corrected that. But now I get a Maximum buffer size exceeded! Any idea how to correct it?
                    Calculating histograms with bin size of 1000 for 'Pf3D7_05_v3' ...
                    Making GC histogram for 'Pf3D7_05_v3' ...
                    Maximum buffer size exceeded.

                    Comment


                    • #11
                      Originally posted by elisold View Post
                      Thanks, I corrected that. But now I get a Maximum buffer size exceeded! Any idea how to correct it?
                      Calculating histograms with bin size of 1000 for 'Pf3D7_05_v3' ...
                      Making GC histogram for 'Pf3D7_05_v3' ...
                      Maximum buffer size exceeded.
                      Hello, I also encounter this problem, so did you solve it, and I found this error would influence our -genome step.

                      So if you solved, pelease let me know, thanks a lot!

                      Comment


                      • #12
                        Originally posted by vivek_ View Post
                        Line breaks simply means each line in the fasta file is limited to 60 bases. You could just have Bio-Perl module read and print the chromosomes in your fasta file in the proper format.
                        Hi, I've met this problem,too. It does not work after doing line-breaking by fold command in bash. Can you help me with that? Thanks in advance!

                        Comment


                        • #13
                          Originally posted by elisold View Post
                          Thanks, I corrected that. But now I get a Maximum buffer size exceeded! Any idea how to correct it?
                          Calculating histograms with bin size of 1000 for 'Pf3D7_05_v3' ...
                          Making GC histogram for 'Pf3D7_05_v3' ...
                          Maximum buffer size exceeded.
                          Hi! did you ever resolve this issue?
                          I only just started using cnvnator and am coming across the exact same message. Any advice would be much appreciated!

                          Comment


                          • #14
                            Originally posted by Alicia B View Post
                            Hi! did you ever resolve this issue?
                            I only just started using cnvnator and am coming across the exact same message. Any advice would be much appreciated!
                            Still getting this error with the latest version? I haven't seen this one in particular but it could be that you don't have enough memory for the job. Can you run the commands with a smaller test data set?

                            Comment


                            • #15
                              Originally posted by SES View Post
                              Still getting this error with the latest version? I haven't seen this one in particular but it could be that you don't have enough memory for the job. Can you run the commands with a smaller test data set?
                              Hi SES,

                              Thanks for your advice. You were right I wasn't allocating enough memory to run the job, it's now running smoothly.

                              Thanks for your help!

                              Comment

                              Latest Articles

                              Collapse

                              • seqadmin
                                Current Approaches to Protein Sequencing
                                by seqadmin


                                Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                                04-04-2024, 04:25 PM
                              • seqadmin
                                Strategies for Sequencing Challenging Samples
                                by seqadmin


                                Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                                03-22-2024, 06:39 AM

                              ad_right_rmr

                              Collapse

                              News

                              Collapse

                              Topics Statistics Last Post
                              Started by seqadmin, 04-11-2024, 12:08 PM
                              0 responses
                              17 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 04-10-2024, 10:19 PM
                              0 responses
                              22 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 04-10-2024, 09:21 AM
                              0 responses
                              16 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 04-04-2024, 09:00 AM
                              0 responses
                              46 views
                              0 likes
                              Last Post seqadmin  
                              Working...
                              X