Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • noIndex in SampleSheet casava1.8

    I simply want to convert .bcl files to .fastQ.
    I am having trouble with SampleSheet.csv for casava1.8. My samples do not have index field and I kept the "index" field of the SampleSheet empty (as suggested in the manual). But it keeps throwing an error.
    "ERROR: Conflicting sample sheet definitions for lane lane 1. Sample sheet line: 5. Existing:NoIndex New: NoIndex."

    Can someone please help.

  • #2
    Well it looks like you have two entries for lane #1; if you aren't using indexes you can't do this. But if you really want help you need to upload/paste a copy of the samplesheet.csv file so we can look at it.

    Comment


    • #3
      Is this a MiSeq or a HiSeq dataset?

      It is perfectly ok to have mixed samples (multiplex and non-m) on a flowcell. Attach a copy (or pm me) of the samplesheet file (redact sample names if they mean something).

      Comment


      • #4
        thanks all. Here is the SampleSheet
        FCID,Lane,SampleID,SampleRef,Index,Description,Control,Recipe,Operator,SampleProject
        HHPDCAZX,1,H-1,Rat,,Hu,Control,HiSeq_R_90,Rob,Hu
        HHPDCAZX,1,H-2,Rat,,Hu,Control,HiSeq_R_90,Rob,Hu
        HHPDCAZX,1,H-3,Rat,,Hu,Control,HiSeq_R_90,Rob,Hu
        HHPDCAZX,1,H-4,Rat,,Hu,Control,HiSeq_R_90,Rob,Hu
        HHPDCAZX,1,H-5,Rat,,Hu,Control,HiSeq_R_90,Rob,Hu

        Comment


        • #5
          Actually my samples do have Indexes which are not Illumina. I want to convert the files to fastq and then use barsplitter ot something.

          Comment


          • #6
            Originally posted by biofreak View Post
            thanks all. Here is the SampleSheet
            FCID,Lane,SampleID,SampleRef,Index,Description,Control,Recipe,Operator,SampleProject
            HHPDCAZX,1,H-1,Rat,,Hu,Control,HiSeq_R_90,Rob,Hu
            HHPDCAZX,1,H-2,Rat,,Hu,Control,HiSeq_R_90,Rob,Hu
            HHPDCAZX,1,H-3,Rat,,Hu,Control,HiSeq_R_90,Rob,Hu
            HHPDCAZX,1,H-4,Rat,,Hu,Control,HiSeq_R_90,Rob,Hu
            HHPDCAZX,1,H-5,Rat,,Hu,Control,HiSeq_R_90,Rob,Hu
            biofreak,

            If you do not list indexes to distinguish the various samples in lane 1 then CASAVA will throw and error; after all, how could CASAVA know which sample (H-1, H-2, etc.) a read was from without an index.

            Since you don't want to perform demultiplexing with CASAVA what you really need it to do is output a single FASTQ file containing all of your samples mixed together. To do this your SampleSheet.csv file should include a SINGLE line for lane #1, e.g.:

            Code:
            HHPDCAZX,1,H-x,Rat,,Hu,Control,HiSeq_R_90,Rob,Hu
            Feed this file to whatever demultiplexer you like and configure that software as needed to identify your samples.

            Comment


            • #7
              Originally posted by biofreak View Post
              Actually my samples do have Indexes which are not Illumina. I want to convert the files to fastq and then use barsplitter ot something.
              Use a single entry for that lane (with some name in sampleID field) in the SampleSheet file as kmcarr suggested to avoid the error you are getting. You can then parse the resulting sequence files to sort the reads based on your tags.

              Comment


              • #8
                Oh thanks you so much. I'll make changes to my csv file for all the lanes.
                I tried using casava to demultiplex the barcodes that I have (nugen) using --use-bases-mask field. It does not complain and creates fastq files. However, after the alignment, the number of reads in the SAM files are very less ( in the range of 1000 to something).. So something is wrong. DO you know if casava 1.8 can handle non Illumina indexes?
                Once again thanks a lot.

                Comment


                • #9
                  Originally posted by biofreak View Post
                  Oh thanks you so much. I'll make changes to my csv file for all the lanes.
                  I tried using casava to demultiplex the barcodes that I have (nugen) using --use-bases-mask field. It does not complain and creates fastq files. However, after the alignment, the number of reads in the SAM files are very less ( in the range of 1000 to something).. So something is wrong. DO you know if casava 1.8 can handle non Illumina indexes?
                  Once again thanks a lot.
                  Yes, CASAVA can handle non-Illumina barcodes just fine. If you are using a dual indexing protocol then you need to use the most recent version of CASAVA (I don't recall if dual index support was added in 1.8.1 or 1.8.2 and I'm too lazy to check). You just have to make sure that the format provided with "--use-bases-mask" properly reflects the library construction. I'm not familiar with the NuGen library prep protocol so I can't give you any specifics.

                  Curious as to why you think the demultiplexing led to problems with alignment. What fraction of your reads were demultiplexed by CASAVA and was the distribution into pools as expected? Is there some adapter remnant resulting from the NuGen prep left over that needs to be removed (it may be possible to deal with that using the "--use-bases-mask" as well).

                  [ETA Dual index support was added in v1.8.2 of CASAVA]
                  Last edited by kmcarr; 04-11-2012, 06:38 AM. Reason: Update version info for dual indexing.

                  Comment


                  • #10
                    I am sorry for my lack of knowledge but I did change the csv and it runs and does not output any fastq files.
                    The sample sheet now looks like.

                    HHPDCAZX,1,H-X,Rat,,NoIndex1,Control,HiSeq_R_90,Rob,NoIndex1
                    HHPDCAZX,2,N-X,Rat,,NoIndex2,Control,HiSeq_R_90,Rob,NoIndex2
                    HHPDCAZX,3,Y-X,Rat,,NoIndex3,Control,HiSeq_R_90,Rob,NoIndex3
                    HHPDCAZX,4,M-X,Rat,,NoIndex4,Control,HiSeq_R_90,Rob,NoIndex4
                    HHPDCAZX,5,P-X,Rat,,NoIndex5,Control,HiSeq_R_90,Rob,NoIndex5
                    HHPDCAZX,6,P-X,Rat,,NoIndex6,Control,HiSeq_R_90,Rob,NoIndex6

                    thanks a lot for helping

                    Comment


                    • #11
                      Originally posted by biofreak View Post
                      I am sorry for my lack of knowledge but I did change the csv and it runs and does not output any fastq files.
                      The sample sheet now looks like.

                      HHPDCAZX,1,H-X,Rat,,NoIndex1,Control,HiSeq_R_90,Rob,NoIndex1
                      HHPDCAZX,2,N-X,Rat,,NoIndex2,Control,HiSeq_R_90,Rob,NoIndex2
                      HHPDCAZX,3,Y-X,Rat,,NoIndex3,Control,HiSeq_R_90,Rob,NoIndex3
                      HHPDCAZX,4,M-X,Rat,,NoIndex4,Control,HiSeq_R_90,Rob,NoIndex4
                      HHPDCAZX,5,P-X,Rat,,NoIndex5,Control,HiSeq_R_90,Rob,NoIndex5
                      HHPDCAZX,6,P-X,Rat,,NoIndex6,Control,HiSeq_R_90,Rob,NoIndex6

                      thanks a lot for helping
                      Could you please provide some more information on where the procedure is failing. Does configureBclToFastq.pl run successfully with your new SampleSheet.csv? Does it create a working directory (default is a directory named Unaligned in the main run directory)? Does it fail when you try to run make in that directory?

                      Comment


                      • #12
                        I have specified separate output dir which shows all the 6 folders for the 6 lanes. Each one contains just the csv file for the respective sample but no zipped fastq files. My command is:

                        nohup ~/home/CASAVA_v1.8.0/bin/configureBclToFastq.pl --force --input-dir \
                        ~/home/HU_02_HPDCAZX/Data/Intensities/BaseCalls \
                        --output-dir /home/HU_02_HPDCAZX/Data/NoIndexUnaligned --sample-sheet ~/home/HU_02_HPDCAZX/Data/Intensities/BaseCalls/SampleSheet_NoIndex.csv > nohup_first.out

                        Comment


                        • #13
                          Originally posted by biofreak View Post
                          I have specified separate output dir which shows all the 6 folders for the 6 lanes. Each one contains just the csv file for the respective sample but no zipped fastq files. My command is:

                          nohup ~/home/CASAVA_v1.8.0/bin/configureBclToFastq.pl --force --input-dir \
                          ~/home/HU_02_HPDCAZX/Data/Intensities/BaseCalls \
                          --output-dir /home/HU_02_HPDCAZX/Data/NoIndexUnaligned --sample-sheet ~/home/HU_02_HPDCAZX/Data/Intensities/BaseCalls/SampleSheet_NoIndex.csv > nohup_first.out
                          You have only completed the first step of the BCL->FASTQ conversion, namely running configureBclToFastq.pl. This script merely checks that all of the input files are present, creates the necessary output locations and configures demultiplexing (if indexes are being used, not in your case obviously). [As an aside you do not need to run nohup for this command as it runs very quickly and you want to see the output in your terminal to see if any errors are reported.]

                          Once you have successfully run configureBclToFastq.pl you 'cd' to the output directory it created (/home/HU_02_HPDCAZX/Data/NoIndexUnaligned in your example) and run 'make'. This is where the BCL to FASTQ conversion (and demultiplexing if configured) happens. [This is also where you would use nohup.] So assuming you are now in the /home/HU_02_HPDCAZX/Data/NoIndexUnaligned directory

                          Code:
                          #> /usr/bin/nohup make -j n &
                          where 'n' is the number of threads you would like to run make with (hint, don't use more than the number of cpu cores in your computer).

                          Comment


                          • #14
                            oops! My bad
                            thanks

                            Comment


                            • #15
                              Originally posted by biofreak View Post
                              oops! My bad
                              thanks
                              And, did it work?

                              Btw, 'Control' in your samplesheet should be either 'Y' or 'N'.

                              Comment

                              Latest Articles

                              Collapse

                              • seqadmin
                                Strategies for Sequencing Challenging Samples
                                by seqadmin


                                Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                                03-22-2024, 06:39 AM
                              • seqadmin
                                Techniques and Challenges in Conservation Genomics
                                by seqadmin



                                The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

                                Avian Conservation
                                Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
                                03-08-2024, 10:41 AM

                              ad_right_rmr

                              Collapse

                              News

                              Collapse

                              Topics Statistics Last Post
                              Started by seqadmin, Yesterday, 06:37 PM
                              0 responses
                              10 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, Yesterday, 06:07 PM
                              0 responses
                              9 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 03-22-2024, 10:03 AM
                              0 responses
                              49 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 03-21-2024, 07:32 AM
                              0 responses
                              67 views
                              0 likes
                              Last Post seqadmin  
                              Working...
                              X