Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Conversion from bcl format to fastq files

    Hi,

    I have sequencing data in raw format, the plan is to convert the .bcl files into fastq and demultiplexing using CSAVA. In order to do that I need to provide a sample sheet file. Does anyone know if I need to create this file and what files to use in order to do that.

    Thanks for your help

  • #2
    Yes, you need to create it. A csv file with the following header:

    FCID,Lane,SampleID,SampleRef,Index,Description,Control,Recipe,Operator


    Sorry, I got it confused with demultiplexing. For bcl conversion, you don't need any sample sheet file. Just run this python script (setupBclToQseq.py) provided by Illumina.

    Clariet
    Last edited by clariet; 09-13-2011, 10:41 AM.

    Comment


    • #3
      Kjaja,

      I find it very surprising that you have been given raw data without conversion (and possibly internal QC) by your sequence provider. Do you have access to CASAVA software? Processing with CASAVA does have a learning curve. It may be more work than necessary if you are planning to do this for just one data set.

      Comment


      • #4
        Originally posted by clariet View Post
        Sorry, I got it confused with demultiplexing. For bcl conversion, you don't need any sample sheet file. Just run this python script (setupBclToQseq.py) provided by Illumina.
        That depends on the version of Casava you're using. For 1.8 you need a sample sheet even if you're not demultiplexing since it uses the sample and project names to construct the output file and directory names.

        I agree with other comments though - if this is just a one off I'd go back to your sequencing provider and ask them to do the conversion for you. You don't want to have to get your head around Casava just for one sample.

        Comment


        • #5
          thank you all for your help, but what does the "Recipe" in the sample sheet stands for, I guess how do I get this information. also, can I tell from my files what the sample ID's are?

          Comment


          • #6
            To just run the extraction the only fields you actually need to fill in are, lane, sample name, contol and project name (and barcode index if you need to demultiplex). You can leave all of the rest blank and it won't make any difference to the output. All of the other fields are only there to keep a record of what happened in your sample, the pipeline doesn't actually use them in any way.

            The sample IDs are just arbitary names you've give to your samples. If you don't care about these you can just use Sample_1 Sample_2 etc.

            The recipe is the set of instructions used to control the sequencer and normally just corresponds to the type of run you've done.

            Comment

            Latest Articles

            Collapse

            • seqadmin
              Essential Discoveries and Tools in Epitranscriptomics
              by seqadmin


              The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
              Yesterday, 07:01 AM
            • seqadmin
              Current Approaches to Protein Sequencing
              by seqadmin


              Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
              04-04-2024, 04:25 PM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by seqadmin, 04-11-2024, 12:08 PM
            0 responses
            55 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-10-2024, 10:19 PM
            0 responses
            52 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-10-2024, 09:21 AM
            0 responses
            45 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-04-2024, 09:00 AM
            0 responses
            55 views
            0 likes
            Last Post seqadmin  
            Working...
            X