Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Basespace update

    Illumina replaced MiSeq reporter as the demultiplexor for miseq data on basespace with bcl2fastq a couple of weeks ago. Since then I've run into a number of instances where MSR and bcl2fastq are different which had meant many demultiplexing failures. To save people time I thought I'd start the list of issues I've hit.

    1. MSR allowed "." in sample names and sample ID, bcl2fastq does not


    2. MSR treats "N" as wildcard, bcl2fastq treats it as exact. I run a mix of dual 8bp and TruSeq lt single index 6bp. MSR had allowed me to just put NNNNNNNN as the i5 and NN at the end of the i7, this no longer works. you have to use the actual sequences (AT at the end of i7 and TCTTTCCC for i5)


    3. bcl2fastq or basespace is much much slower at demultiplexing. It used to take <30min to rerun a sample sheet, it's taking >4hours now.


    4. bcl2fastq doesn't allow you to set the indexing mismatch (at least tech support that I talked to didn't know how to globally set). It tries to allow 1 mismatch and only drops to exact match if the hamming distance is <3
    Last edited by thermophile; 09-20-2016, 07:06 AM.
    Microbial ecologist, running a sequencing core. I have lots of strong opinions on how to survey communities, pretty sure some are even correct.

  • #2
    In addition the BaseSpace apps are no longer free. You now must be subscribed to a Professional account to access the apps and also pay each time you run them.

    Comment


    • #3
      Interesting about having to pay to use all of the apps. We're going to have some unhappy customers!

      Comment


      • #4
        Well that sucks, I just talked a few users into trying BaseSpace based on the NCBI_SRA app
        Microbial ecologist, running a sequencing core. I have lots of strong opinions on how to survey communities, pretty sure some are even correct.

        Comment


        • #5
          Originally posted by thermophile View Post
          3. bcl2fastq or basespace is much much slower at demultiplexing. It used to take <30min to rerun a sample sheet, it's taking >4hours now.
          Perhaps this is related to the number of cores you allow bcl2fastq to use?

          Originally posted by thermophile View Post
          4. bcl2fastq doesn't allow you to set the indexing mismatch (at least tech support that I talked to didn't know how to globally set). It tries to allow 1 mismatch and only drops to exact match if the hamming distance is <3
          from the bcl2fastq --help text:
          Code:
            --barcode-mismatches arg (=1)
          number of allowed mismatches per index
          multiple entries, comma delimited entries, allowed; 
          each entry is applied to the corresponding index;
          last entry applies to all remaining indices
          there is also this, which I have no idea what it does:
          Code:
            --adapter-stringency arg (=0.9)                 adapter stringency

          Comment


          • #6
            Originally posted by microgirl123 View Post
            Interesting about having to pay to use all of the apps. We're going to have some unhappy customers!
            I would imagine some of the third party developers aren't too happy either since their apps are now stuck behind a paywall.

            Comment


            • #7
              $5k to upgrade to professional which gives you the privilege of paying for apps
              Microbial ecologist, running a sequencing core. I have lots of strong opinions on how to survey communities, pretty sure some are even correct.

              Comment


              • #8
                Originally posted by fanli View Post
                Perhaps this is related to the number of cores you allow bcl2fastq to use?


                from the bcl2fastq --help text:
                Code:
                  --barcode-mismatches arg (=1)
                number of allowed mismatches per index
                multiple entries, comma delimited entries, allowed; 
                each entry is applied to the corresponding index;
                last entry applies to all remaining indices
                there is also this, which I have no idea what it does:
                Code:
                  --adapter-stringency arg (=0.9)                 adapter stringency
                Thanks! I'll have to see if i can change something in the sample sheet to set this
                Microbial ecologist, running a sequencing core. I have lots of strong opinions on how to survey communities, pretty sure some are even correct.

                Comment


                • #9
                  Originally posted by thermophile View Post
                  Thanks! I'll have to see if i can change something in the sample sheet to set this
                  Or switch to using bcl2fastq locally instead of BaseSpace

                  Comment


                  • #10
                    Originally posted by GenoMax View Post
                    Or switch to using bcl2fastq locally instead of BaseSpace
                    I may have to do that, but that means I'll have to build a server for distributing the data to clients.
                    Microbial ecologist, running a sequencing core. I have lots of strong opinions on how to survey communities, pretty sure some are even correct.

                    Comment


                    • #11
                      Originally posted by thermophile View Post
                      I may have to do that, but that means I'll have to build a server for distributing the data to clients.
                      If you are part of an academic institution then look into tapping common central compute resource. That way you won't need to become a sys admin in addition to other hats you wear (and not have to worry about security etc). If your users use that central compute resource then they would appreciate getting their data directly delivered to them.

                      Comment


                      • #12
                        Originally posted by GenoMax View Post
                        If you are part of an academic institution then look into tapping common central compute resource. That way you won't need to become a sys admin in addition to other hats you wear (and not have to worry about security etc). If your users use that central compute resource then they would appreciate getting their data directly delivered to them.
                        My experience has been that you always still need a bit of sysadmin experience to configure things exactly the way you like. For example, how do you add a new user/client for data access? Sometimes it's just easier to htpasswd it yourself

                        Comment


                        • #13
                          the only charge for the apps is the cost for the compute on AWS unless it is a 3rd party app that costs to run. Also there are still Free accounts that come with some credits so you can trial Basespace. If your clients plan on using it a lot for analysis then they will need to upgrade otherwise they can still receive the data on a free account i think.

                          Comment


                          • #14
                            Originally posted by elutheria View Post
                            the only charge for the apps is the cost for the compute on AWS unless it is a 3rd party app that costs to run. Also there are still Free accounts that come with some credits so you can trial Basespace. If your clients plan on using it a lot for analysis then they will need to upgrade otherwise they can still receive the data on a free account i think.
                            Or buy an Illumina sequencer and negotiate free use of BaseSpace for some time period as part of the deal.

                            Comment


                            • #15
                              A couple of things:

                              - BaseSpace will still do basecalling for free from instrument runs.
                              - The newest version of bcl2fastq2 will now treat N bases properly (as 'wildcards' so-to-speak)

                              Also, surely this doesn't come as a surprise... as far as I'm aware, it was pretty well communicated a long time ago that it was going to become a pay-per-use service.

                              Cheers,

                              Scott.

                              Comment

                              Latest Articles

                              Collapse

                              • seqadmin
                                Strategies for Sequencing Challenging Samples
                                by seqadmin


                                Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                                03-22-2024, 06:39 AM
                              • seqadmin
                                Techniques and Challenges in Conservation Genomics
                                by seqadmin



                                The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

                                Avian Conservation
                                Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
                                03-08-2024, 10:41 AM

                              ad_right_rmr

                              Collapse

                              News

                              Collapse

                              Topics Statistics Last Post
                              Started by seqadmin, Yesterday, 06:37 PM
                              0 responses
                              10 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, Yesterday, 06:07 PM
                              0 responses
                              9 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 03-22-2024, 10:03 AM
                              0 responses
                              49 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 03-21-2024, 07:32 AM
                              0 responses
                              67 views
                              0 likes
                              Last Post seqadmin  
                              Working...
                              X