Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • sparks
    Senior Member
    • Mar 2008
    • 126

    ABI Paired End Reads

    Hi,
    Does anyone have information on the new ABI Paired end reads, especially file formats and orientations of the reads.

    We are developing a colour space version of Novoalign that's performing pretty well but we would like to support paired end as well as mate pair for first release and having some sample data would help a lo, let us know if you have some reads to share, we don't need many.

    If anyone would like to try beta of NovoalignCS just ask.

    Colin
  • epigen
    Senior Member
    • May 2010
    • 101

    #2
    NovoalignCS sounds great. I'm currently comparing the performance of BWA and BioScope on SOLiD transcriptome data (single end, unfortunately). Having Novoalign as a third option would be nice, especially since other people in my group are happy using it on their non-colorspace reads. I always stumble across all kinds of bugs anyway, so volunteering as a beta tester seems obvious.

    Comment

    • sparks
      Senior Member
      • Mar 2008
      • 126

      #3
      hi epigen,

      That would be great if you could help. Just email me at colin at novocraft dot com I'll send you necessary info.

      Colin

      Comment

      • nilshomer
        Nils Homer
        • Nov 2008
        • 1283

        #4
        I have found novoalignCS very sensitive and specific when the alignments are evaluated after variant calling (SAMtools). I would definitely recommend testing this alignment tool, although it may be a little slow (like the regular novoalign) if you have 10+ sequencers (talking to you Broad/Baylor/WashU/etc.). Otherwise the multi-threaded version is the way to go, with MPI if you have it installed.

        One issue is that the # of reads from a SOLiD slide is in the hundreds of millions. If you don't have MPI installed, and you still want to align the many reads from a SOLiD run in a parallel fashion, I have created a script to convert and split the reads after using the "solid2fastq" utility in BFAST. It's a perl script, but is renamed with a .txt extension to fool the HTTP server:

        You can use this in a piped fashion:
        Code:
        solid2fastq [options] <csfastas> <quals> | bfast2novo.pl - <out.prefix> <split num>
        Hopefully this will help those with idle machines fill them up with novoalignCS.
        Last edited by nilshomer; 06-16-2010, 11:51 AM. Reason: Must fool the HTTP server

        Comment

        • Rao
          Member
          • Oct 2008
          • 36

          #5
          data set is available on solidsoftwarecommunity.com

          Comment

          • KevinLam
            Senior Member
            • Nov 2009
            • 204

            #6
            Hmmm why not approach ABI for the sample data sets?
            with SOLiD4 being the new standard you might get different results if u used old datasets
            http://kevin-gattaca.blogspot.com/

            Comment

            • sparks
              Senior Member
              • Mar 2008
              • 126

              #7
              Thanks for help and suggestions. e managed to get a few files of Colour space paired end reads and latest versions of NovoalignCS now fully support paired end and mate pair reads.

              Colin

              Comment

              Latest Articles

              Collapse

              • SEQadmin2
                Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                by SEQadmin2


                I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

                Here are nine questions we think about, in roughly the order they matter, before...
                06-18-2026, 07:11 AM
              • SEQadmin2
                From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                by SEQadmin2


                Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                ...
                06-02-2026, 10:05 AM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by SEQadmin2, 06-17-2026, 06:09 AM
              0 responses
              36 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-09-2026, 11:58 AM
              0 responses
              99 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-05-2026, 10:09 AM
              0 responses
              120 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-04-2026, 08:59 AM
              0 responses
              113 views
              0 reactions
              Last Post SEQadmin2  
              Working...