Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • tdoniger
    Member
    • Nov 2010
    • 13

    Vector Removal Software

    I am trying to perform de novo assembly on 454 data using Newbler v2.5.

    As a first stage, I need to remove the vector sequence. I have the vector sequence.

    1. Using newbler, I have included the vector file both as a trimming database and as a screening database. Nonetheless, vector sequence is found in the assembly.

    2. I have tried using Lucy. It does remove some of the vector sequences, but then using BLAST I still find quite a bit that was not removed.

    2. I tried using SeqClean -which actually manages to remove all the vector but does not provide a quality file. I could write some program that would produce a modified .qual file, but I was wondering if such a tool already exists.

    What are others experience in vector removal?

    Much Thanks,
    Tirza Doniger
    --
    Tirza Doniger, Ph.D.
    Bioinformatics Unit
    The Mina and Everard Faculty of Life Sciences
    Bar Ilan University
  • ulz_peter
    Senior Member
    • Feb 2010
    • 219

    #2
    In case you've got a fasta and a qual file you could merge them to a fastq file and then clip the vector sequences with the FastX package using FastX clipper:



    That worked quite well for me for removing Tranposon DNA elements from Library prep.

    Comment

    • kmcarr
      Senior Member
      • May 2008
      • 1181

      #3
      Originally posted by tdoniger View Post
      2. I tried using SeqClean -which actually manages to remove all the vector but does not provide a quality file. I could write some program that would produce a modified .qual file, but I was wondering if such a tool already exists.
      Much Thanks,
      Tirza Doniger
      Tirza,

      SeqClean includes a utility to create a new qual file which corresponds to your cleaned reads. The program is called 'cln2qual' and it is in the main seqclean directory. It takes as input the cleaning report generated by SeqClean (the .cln file) and your original .qual file. It outputs a new .qual file with appropriately trimmed (or excluded) qual scores.
      Last edited by kmcarr; 02-08-2011, 08:13 AM. Reason: Removed comments about read ordering in new qual file; mistaken about this.

      Comment

      • tdoniger
        Member
        • Nov 2010
        • 13

        #4
        Wow! Thank you! Just what I was looking for! The 'cln2qual' tool works great. I didn't notice it in the SeqClean directory. Newbler accepts the input without any problems.

        Thanks again,
        Tirza
        --
        Tirza Doniger, Ph.D.
        Bioinformatics Unit
        The Mina and Everard Faculty of Life Sciences
        Bar Ilan University

        Comment

        Latest Articles

        Collapse

        • SEQadmin2
          Nine Things a Sample Prep Scientist Thinks About Before Sequencing
          by SEQadmin2


          I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

          Here are nine questions we think about, in roughly the order they matter, before...
          06-18-2026, 07:11 AM
        • SEQadmin2
          From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
          by SEQadmin2


          Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


          The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
          ...
          06-02-2026, 10:05 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by SEQadmin2, 06-17-2026, 06:09 AM
        0 responses
        33 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-09-2026, 11:58 AM
        0 responses
        97 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-05-2026, 10:09 AM
        0 responses
        117 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-04-2026, 08:59 AM
        0 responses
        111 views
        0 reactions
        Last Post SEQadmin2  
        Working...