Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • how to do reference based de novo assembly using velvet

    As above.

    Currently need to do a de novo assembly, but don't know how to use a reference in the velveth. : |


    Thanks for your reply!
    Last edited by arkilis; 12-01-2013, 11:01 PM.

  • #2
    I got some tips after google.

    velveth run 31 -reference PHAC_sample_2.txt -shortPaired -fastq.gz 1_001.fastq.gz 2_001.fastq.gz &
    But always got some error like:

    Incomplete Sequences file (computeHSPScores)
    Does anyone got any idea on this? Thanks a lot!

    Comment


    • #3
      I haven't seen that error message before,

      but usually when running velveth either you interleave the files containing R1 and R2, or you specify -separate if the R1 and R2 reads are in separate files.

      Comment


      • #4
        Google for "Velvet Columbus", which is the reference-guided de novo assembly pipeline.

        (Though on the off-chance you get weird results)

        Comment


        • #5
          Originally posted by arkilis View Post
          Currently need to do a de novo assembly, but don't know how to use a reference in the velveth.
          Then you should probably read the Columbus_manual.pdf which is included in the velvet distribution tar file.

          Basically you're doing it wrong. To do reference guided assembly in velvet you first align your reads to the reference with something like bowtie or bwa and then input the aligned read file (SAM) along with the reference file (FASTA) to velveth. You can not go directly from reads to reference guided assembly with velvet, prealignment is required.

          Comment


          • #6
            Originally posted by ctseto View Post
            Google for "Velvet Columbus", which is the reference-guided de novo assembly pipeline.

            (Though on the off-chance you get weird results)

            http://bioweb2.pasteur.fr/docs/velve...bus_manual.pdf
            Thanks for your advices.

            Comment


            • #7
              Originally posted by kmcarr View Post
              Then you should probably read the Columbus_manual.pdf which is included in the velvet distribution tar file.

              Basically you're doing it wrong. To do reference guided assembly in velvet you first align your reads to the reference with something like bowtie or bwa and then input the aligned read file (SAM) along with the reference file (FASTA) to velveth. You can not go directly from reads to reference guided assembly with velvet, prealignment is required.
              Hi I found out that if I use the -separate option, that will be fine. No such error anymore. But the N50 is pertty lame (only 100, kmer 103). I read the http://bioweb2.pasteur.fr/docs/velve...bus_manual.pdf you mentioned, but there is no clue on do alignment before velvet assembly. Why you suggest to do that? thx

              Comment


              • #8
                Originally posted by arkilis View Post
                I read the http://bioweb2.pasteur.fr/docs/velve...bus_manual.pdf you mentioned, but there is no clue on do alignment before velvet assembly. Why you suggest to do that? thx
                On page 2, section 3 "Overview of the process", step 1 says
                Map the reads against a set of target sequences (typically, an entire reference genome, made up of chromosomal sequences).
                Step 4 then says
                Provide this FASTA file along with the SAM/BAM alignment file
                ("this FASTA file means the reference file")

                Pretty clear indication that alignment is the first step and you input the reference and alignment, not the read fastq files.

                Comment


                • #9
                  Originally posted by kmcarr View Post
                  On page 2, section 3 "Overview of the process", step 1 says

                  Step 4 then says ("this FASTA file means the reference file")

                  Pretty clear indication that alignment is the first step and you input the reference and alignment, not the read fastq files.

                  Genius, I did not read the manual carefully. But when I run the velveth with columbus, I got:

                  "WARNING: None of your read mappings recognized the reference sequence!"

                  Is that mean I got the wrong reference file or does not recognized at all?
                  Last edited by arkilis; 12-04-2013, 09:55 PM.

                  Comment

                  Latest Articles

                  Collapse

                  • seqadmin
                    Recent Innovations in Spatial Biology
                    by seqadmin


                    Spatial biology is an exciting field that encompasses a wide range of techniques and technologies aimed at mapping the organization and interactions of various biomolecules in their native environments. As this area of research progresses, new tools and methodologies are being introduced, accompanied by efforts to establish benchmarking standards and drive technological innovation.

                    3D Genomics
                    While spatial biology often involves studying proteins and RNAs in their...
                    Yesterday, 07:30 PM
                  • seqadmin
                    Advancing Precision Medicine for Rare Diseases in Children
                    by seqadmin




                    Many organizations study rare diseases, but few have a mission as impactful as Rady Children’s Institute for Genomic Medicine (RCIGM). “We are all about changing outcomes for children,” explained Dr. Stephen Kingsmore, President and CEO of the group. The institute’s initial goal was to provide rapid diagnoses for critically ill children and shorten their diagnostic odyssey, a term used to describe the long and arduous process it takes patients to obtain an accurate...
                    12-16-2024, 07:57 AM

                  ad_right_rmr

                  Collapse

                  News

                  Collapse

                  Topics Statistics Last Post
                  Started by seqadmin, 12-30-2024, 01:35 PM
                  0 responses
                  21 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 12-17-2024, 10:28 AM
                  0 responses
                  41 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 12-13-2024, 08:24 AM
                  0 responses
                  55 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 12-12-2024, 07:41 AM
                  0 responses
                  40 views
                  0 likes
                  Last Post seqadmin  
                  Working...
                  X