Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • ABI Paired End Reads

    Hi,
    Does anyone have information on the new ABI Paired end reads, especially file formats and orientations of the reads.

    We are developing a colour space version of Novoalign that's performing pretty well but we would like to support paired end as well as mate pair for first release and having some sample data would help a lo, let us know if you have some reads to share, we don't need many.

    If anyone would like to try beta of NovoalignCS just ask.

    Colin

  • #2
    NovoalignCS sounds great. I'm currently comparing the performance of BWA and BioScope on SOLiD transcriptome data (single end, unfortunately). Having Novoalign as a third option would be nice, especially since other people in my group are happy using it on their non-colorspace reads. I always stumble across all kinds of bugs anyway, so volunteering as a beta tester seems obvious.

    Comment


    • #3
      hi epigen,

      That would be great if you could help. Just email me at colin at novocraft dot com I'll send you necessary info.

      Colin

      Comment


      • #4
        I have found novoalignCS very sensitive and specific when the alignments are evaluated after variant calling (SAMtools). I would definitely recommend testing this alignment tool, although it may be a little slow (like the regular novoalign) if you have 10+ sequencers (talking to you Broad/Baylor/WashU/etc.). Otherwise the multi-threaded version is the way to go, with MPI if you have it installed.

        One issue is that the # of reads from a SOLiD slide is in the hundreds of millions. If you don't have MPI installed, and you still want to align the many reads from a SOLiD run in a parallel fashion, I have created a script to convert and split the reads after using the "solid2fastq" utility in BFAST. It's a perl script, but is renamed with a .txt extension to fool the HTTP server:

        You can use this in a piped fashion:
        Code:
        solid2fastq [options] <csfastas> <quals> | bfast2novo.pl - <out.prefix> <split num>
        Hopefully this will help those with idle machines fill them up with novoalignCS.
        Last edited by nilshomer; 06-16-2010, 11:51 AM. Reason: Must fool the HTTP server

        Comment


        • #5
          data set is available on solidsoftwarecommunity.com

          Comment


          • #6
            Hmmm why not approach ABI for the sample data sets?
            with SOLiD4 being the new standard you might get different results if u used old datasets
            http://kevin-gattaca.blogspot.com/

            Comment


            • #7
              Thanks for help and suggestions. e managed to get a few files of Colour space paired end reads and latest versions of NovoalignCS now fully support paired end and mate pair reads.

              Colin

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Essential Discoveries and Tools in Epitranscriptomics
                by seqadmin


                The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
                Yesterday, 07:01 AM
              • seqadmin
                Current Approaches to Protein Sequencing
                by seqadmin


                Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                04-04-2024, 04:25 PM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, 04-11-2024, 12:08 PM
              0 responses
              53 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-10-2024, 10:19 PM
              0 responses
              50 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-10-2024, 09:21 AM
              0 responses
              44 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-04-2024, 09:00 AM
              0 responses
              55 views
              0 likes
              Last Post seqadmin  
              Working...
              X