Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • paolo.kunder
    Member
    • Aug 2011
    • 93

    XSQ converter

    I have a PE run, 6 lane, 12 barcoded samples.
    ICS SOLiD software generated 6 XSQ file ( one per lane).

    Now I will use XSQ_Tools to split each XSQ file in 12 indexed XSQ.

    I will end up with 72 XSQ indexed files (6 lanes x 12 samples).

    Is there a way to combine the 6 XSQ files prior to split them in order to generate at the end only 12 XSQ indexed files?

    May I use merge Linux command to combine the 6 XSQ files or I may lose information?

    Thanks,
    Paolo
    Last edited by paolo.kunder; 12-29-2011, 06:22 AM.
  • genehunter
    Junior Member
    • Aug 2009
    • 4

    #2
    Xsq

    No, you cannot combine them using linux commands. The files are binary.
    You may want to use LifeScope or HDF (format) APIs.

    Comment

    • colindaven
      Senior Member
      • Oct 2008
      • 417

      #3
      We have been doing alignments of all and recombining - merging - the SAM files with Picards MergeSamFiles.

      This is admittedly not a great solution !

      Let us know if you have any better ideas.

      Comment

      • paolo.kunder
        Member
        • Aug 2011
        • 93

        #4
        Finally I converted my indexed XSQ files in csfasta (and QV.qual) files with XSQconverter and merged each individual csfasta (and QV.qual) with cat function,
        information seems to be maintained,
        paolo

        Comment

        • colindaven
          Senior Member
          • Oct 2008
          • 417

          #5
          Yeah, did that too. Bioscope and Lifescope wouldn't use all the reads for alignment, but just the first set.
          Not sure why that was.

          NovoalignCS doesn't seem to pick up on the different read sets and uses all reads.

          Colin

          Comment

          • kevleb
            Member
            • Jun 2009
            • 10

            #6
            Originally posted by paolo.kunder View Post
            Finally I converted my indexed XSQ files in csfasta (and QV.qual) files with XSQconverter and merged each individual csfasta (and QV.qual) with cat function,
            information seems to be maintained,
            paolo
            When you cat files, you should obtain a final file with 6 times the same bead_id for different sequence comming from each of your 6 lanes. I'm not sure it won't be a problem then in the future bam file.
            For external pipelines using .csfasta, instead of cat file, i open each .csfasta (and QV.qual) and update the panel number:
            lane 1 : 1 to 708
            lane 2 : 709 to 1416
            lane 3 ...

            In case of you use lifescope you should not do anything, import .xsq in lifescope and create reads set directly in lifescope. You can merge the .bam files from each lane with samtools after mapping. There you will have the bead_id issues in your final bam file. It seems not to be problem in case of only visualization in genome browser but it certainly depends on what you plan to do with the merge .bam file.

            kevin.
            Last edited by kevleb; 01-05-2012, 03:00 AM.

            Comment

            Latest Articles

            Collapse

            • GATTACAT
              Reply to Nine Things a Sample Prep Scientist Thinks About Before Sequencing
              by GATTACAT
              Love this - good data definitely starts from good input, and poor input can only give relatively poor data. I particularly like the mention of Nanodrop/absorbance based methods for quantification. It's such a toss up if you'll get an accurate reading or what amounts to a randomly generated number, and a lot of library/sequencing related issues can be traced back to poor quant.
              Yesterday, 11:43 AM
            • SEQadmin2
              Nine Things a Sample Prep Scientist Thinks About Before Sequencing
              by SEQadmin2


              I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

              Here are nine questions we think about, in roughly the order they matter, before...
              06-18-2026, 07:11 AM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by SEQadmin2, Today, 11:08 AM
            0 responses
            6 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 06-30-2026, 05:37 AM
            0 responses
            11 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 06-26-2026, 11:10 AM
            0 responses
            19 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 06-17-2026, 06:09 AM
            0 responses
            53 views
            0 reactions
            Last Post SEQadmin2  
            Working...