Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • sfh838t
    Member
    • Apr 2014
    • 29

    velvet columbus reference errors

    I have seen some posts about using velvet with a reference, but they have not helped me........
    I am trying to assemble a plant genome using a related species as reference. Trying to use velvet columbus, but I don't seem to be able to get the command line right.
    following the manual, using:
    velveth dir_name 31 -reference ref.fasta -sam illumina_align.sam
    this way I get: -sam file cannot contain reference sequence.
    if I try to add -short or -long in front of the -sam switch, it seems to ignore the -reference switch and I get that my "read1" (which is supposed to be the reference) is too long.
    I have removed from the sam file anything that might have something to do with the reference, but still get the same error.
    I am not good with the linux command line and I am getting the impression i am forgetting some little space, comma, slash or something.
    I used bowtie2 for the alignment of the single end read Illumina reads. Does someone know if bowtie puts the reference sequence into it's sam output anywhere? Or, since all the posts about velvet columbus I have been able to find deal with PE input, can velvet with reference only be used with PE reads?
  • ctseto
    Member
    • Oct 2013
    • 44

    #2
    By its nature, SAM has the alignment in it (SAM=Sequence Alignment Map).

    First: Did you sort your SAM file? (will need samtools for this).

    Checking the manual again, it looks like it should be:

    Code:
    velveth $FOLDERNAME $KMER -reference ref.fasta -shortPaired -sam illumina_align.sam
    Can you post the first few lines of your sam file by using

    Code:
    head illumina_align.sam
    Since you already have to do the alignment of your reads to the reference, have you tried looking at the alignments? Are there dips in coverage or re-arrangements, things like that?

    Comment

    • mastal
      Senior Member
      • Mar 2009
      • 666

      #3
      Originally posted by sfh838t View Post
      velveth dir_name 31 -reference ref.fasta -sam illumina_align.sam
      If you have single end reads, you should also have '-short' before the '-sam' switch.

      Have a look at the Columbus manual.


      I would leave the header lines in the sam file.
      The sam file should be sorted by read name. The default with samtools is to sort the files by chromosome and alignment position.

      Also check the requirements for the reference.fasta file.
      Last edited by mastal; 05-09-2014, 12:15 PM.

      Comment

      • sfh838t
        Member
        • Apr 2014
        • 29

        #4
        thanks for any and all replies
        mastal: if I put -short before the sam, velvet seems to ignore the -reference switch. It then reads both files, puts them together into one file and like I said, I get "read 1" is too long" error.

        ctseto: I used both sorted and unsorted files. I used unsorted at first, then noticed that was wrong and then used the very same file(s) that someone else tried for me (and that ran perfectly fine for them) and I ALWAYS get the very same error. Since I want to do an assembly I used the same entire reference seq for velvet that I used for bwa.

        I have looked at the alignments (IGV) and yes, there are areas with lots of coverage, and then some without. I am working with a plant, and I do know that there will be lots and lots of repeat elements. but honestly, I do not see how that would stop velvet from even reading my sam file, because the error shows up as soon as the sam file is opening. takes about 3 secs flat .

        Any further suggestions anyone?

        Comment

        Latest Articles

        Collapse

        • SEQadmin2
          From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
          by SEQadmin2


          Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


          The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
          ...
          06-02-2026, 10:05 AM
        • SEQadmin2
          Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
          by SEQadmin2


          With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


          Introduction

          Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
          05-22-2026, 06:42 AM
        • SEQadmin2
          Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
          by SEQadmin2

          Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


          Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
          05-06-2026, 09:04 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by SEQadmin2, 06-02-2026, 12:03 PM
        0 responses
        19 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-02-2026, 11:40 AM
        0 responses
        14 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 05-28-2026, 11:40 AM
        0 responses
        29 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 05-26-2026, 10:12 AM
        0 responses
        31 views
        0 reactions
        Last Post SEQadmin2  
        Working...