Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • Patidar
    Member
    • Feb 2012
    • 10

    SOLid2fastq

    Hi,
    I ran solid2fastq from bfast+bwa and from bfast.
    the stats are different for a sample i.e. entirely different:

    /usr/local/bfast+bwa/bin/solid2fastq -o khan03_20101013_PE_NB1001WGA_E /home/rajehh/mapping/data/DW/khan03_20101013_PE/khan03_20101013_PE_NB1001WGA_E_F*.csfasta /home/rajehh/mapping/data/DW/khan03_20101013_PE/khan03_20101013_PE_NB1001WGA_E_F*.qual
    Outputting, currently on:
    105443471
    Found
    number_of_ends number_of_reads
    1 105301151
    2 142320


    [rajehh@biowulf test]$ /usr/local/bfast/bin/solid2fastq -o NB1001WGA_E /home/rajehh/mapping/data/DW/khan03_20101013_PE/khan03_20101013_PE_NB1001WGA_E_F*.csfasta /home/rajehh/mapping/data/DW/khan03_20101013_PE/khan03_20101013_PE_NB1001WGA_E_F*.qual
    Outputting, currently on:
    105443471
    Found
    number_of_ends number_of_reads
    1 0
    2 105443471

    Any Idea why the two converters do not give the same result ?

    Thanks.
  • nilshomer
    Nils Homer
    • Nov 2008
    • 1283

    #2
    Very interesting. Could you share the data (or reduce it to manageable size)?

    Comment

    • Patidar
      Member
      • Feb 2012
      • 10

      #3
      Hi Nils,
      I got it to 5000 Reads/file and
      /usr/local/bfast+bwa/bin/solid2fastq F3 F5 F3.q F5.q
      is
      Found
      number_of_ends number_of_reads
      1 4999
      2 0
      /usr/local/bfast/bin/solid2fastq F3 F5 F3.q F5.q
      is

      Found
      number_of_ends number_of_reads
      1 0
      2 4999


      But the file size it over 1MB and I cant attache it here.

      If you would like to see them I can send in a mail.

      Thanks for help

      Comment

      • nilshomer
        Nils Homer
        • Nov 2008
        • 1283

        #4
        Please send it to [email protected]

        Comment

        • Patidar
          Member
          • Feb 2012
          • 10

          #5
          I sent it to you using secure mail.

          Thanks for the help.

          Comment

          • Patidar
            Member
            • Feb 2012
            • 10

            #6
            Hi Nils,

            If I am correct you received my mail. Did you got any chance to look into this matter?

            Thanks.

            Comment

            • Patidar
              Member
              • Feb 2012
              • 10

              #7
              Hi,

              For another sample I noticed:

              Found
              number_of_ends number_of_reads
              1 0
              2 173685356

              Collecting paired end statistics...
              Collected statistics for 0 paired end distances.
              The paired end distance range was from 2147483647 to -2147483648.
              The paired end distance mean and standard deviation were 200.00 and 150.00. (Specified.)
              Reads processed: 10000000
              ************************************************************
              Alignment complete.
              ************************************************************
              Found 1416266 reads with no ends mapped.
              Found 8583734 reads with 1 end mapped.
              Found 8583734 reads with at least one end mapping.


              Am I doing something wrong ?

              My commands are
              SOLID2FASTQ
              /usr/local/bfast/bin/solid2fastq -n 10000000 -o sample F3 F5 F3.qual F5.qual

              MATCH
              $bfast match -f $ref -A 1 -n 16 -t -T $workdir/tmp/ -r $workdir/$_ >$workdir/$_.bmf

              LOCALALIGN
              $bfast localalign -f $ref -A 1 -n 16 -t -m $workdir/$_.bmf >$workdir/$_.baf

              POSTPROCESS

              $bfast postprocess -f $ref -A 1 -n 16 -a 3 -Y 0 -t -r $workdir/rg_info_$SAMPLE -v 200 -s 150 -i $workdir/$_.baf >$workdir/$_.1.sam

              I am aligning it to the hg19.


              Thanks,

              Comment

              • nilshomer
                Nils Homer
                • Nov 2008
                • 1283

                #8
                Originally posted by Patidar View Post
                Hi Nils,

                If I am correct you received my mail. Did you got any chance to look into this matter?

                Thanks.
                No, I am doing this on my own time, so I may not be able to look at it in the short term. I would be happy to accept a patch if you solve it before I do.

                Comment

                • Patidar
                  Member
                  • Feb 2012
                  • 10

                  #9
                  Hi Nils,

                  Thanks for looking into it.

                  Any Idea on the 9th post in this thread ?
                  "
                  The paired end distance range was from 2147483647 to -2147483648.
                  The paired end distance mean and standard deviation were 200.00 and 150.00. (Specified.)
                  "

                  is "2147483647 to -2147483648." is the default range ?
                  I looked into the Sam files and they looks good to me.

                  Comment

                  Latest Articles

                  Collapse

                  • SEQadmin2
                    Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                    by SEQadmin2


                    I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.


                    Here are nine questions we think about, in roughly the order they matter, before...
                    Yesterday, 07:11 AM
                  • SEQadmin2
                    From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                    by SEQadmin2


                    Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                    The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                    ...
                    06-02-2026, 10:05 AM
                  • SEQadmin2
                    Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                    by SEQadmin2


                    With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                    Introduction

                    Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                    05-22-2026, 06:42 AM

                  ad_right_rmr

                  Collapse

                  News

                  Collapse

                  Topics Statistics Last Post
                  Started by SEQadmin2, 06-17-2026, 06:09 AM
                  0 responses
                  20 views
                  0 reactions
                  Last Post SEQadmin2  
                  Started by SEQadmin2, 06-09-2026, 11:58 AM
                  0 responses
                  38 views
                  0 reactions
                  Last Post SEQadmin2  
                  Started by SEQadmin2, 06-05-2026, 10:09 AM
                  0 responses
                  45 views
                  0 reactions
                  Last Post SEQadmin2  
                  Started by SEQadmin2, 06-04-2026, 08:59 AM
                  0 responses
                  49 views
                  0 reactions
                  Last Post SEQadmin2  
                  Working...