Seqanswers Leaderboard Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • vsnae
    Junior Member
    • Nov 2015
    • 4

    Complete assemblies with raw data

    I'm looking for a few public data sets of genomic and transcriptomic assemblies (preferably complete) where the source reads for the assemblies are available for download. The more the better.

    Been trying to navigate my way through ncbi and ebi's websites with little success. It's always either or where I look.

    Would appreciate any nudge in the correct direction!
    Thanks
  • GenoMax
    Senior Member
    • Feb 2008
    • 7142

    #2
    Your best bet is to find the papers you like and then backtrack and find the datasets at SRA using the SRA/GEO accession numbers.

    Comment

    • vsnae
      Junior Member
      • Nov 2015
      • 4

      #3
      Was afraid of that, never hurt anyone to comb for quality papers though. Appreciate the answer.

      Having said that, from a general point of view, wouldn't it make sense to link these in the databases?

      Comment

      • GenoMax
        Senior Member
        • Feb 2008
        • 7142

        #4
        Look at the first three datasets here: http://www.ncbi.nlm.nih.gov/sra/?term=platinum

        These are the "platinum" genomes that illumina made available for coriell samples.

        Are you looking for a specific genome otherwise a query like this brings up many datasets: http://www.ncbi.nlm.nih.gov/sra/?ter...ptome+assembly

        Comment

        • vsnae
          Junior Member
          • Nov 2015
          • 4

          #5
          Thanks. While those read files look solid I was hoping to find such files along with de-novo assemblies made from them (am I not seeing them?)

          Why? Looking to assess the effects of raw data quality and characteristics to assembly results. Reproducing and comparing assemblies given different preprocessing and assembly methods to assess the overall quality and differences. While that can be done without looking at previous assemblies I'd find it more reassuring to do so, especially since they often contain manual gap filling etc.

          Comment

          • Brian Bushnell
            Super Moderator
            • Jan 2014
            • 2709

            #6
            It's much simpler to study these things in the context of lower organisms, such as bacteria. Or, for the more aggressive... unicellular haploid eukaryotes. Then diploids such as small plants and animals.

            You can get a lot of raw data at JGI's mycocosm, phytozome, and other places on the website. Unfortunately we generally don't study animals, but there should be a lot of raw C.elegans and drosophila data floating around.

            To clarify, studying the effects of data quality and so forth on assembly is easiest in the context of low-repeat haploids, which means bacteria. You can also do it for low-het-rate diploids. The smaller the genome, the better.
            Last edited by Brian Bushnell; 11-14-2015, 10:57 PM.

            Comment

            • vsnae
              Junior Member
              • Nov 2015
              • 4

              #7
              Thanks, and right, fungal and bacterial haploids would be more than enough.

              Not sure how to navigate JGI's website, returning 404's when I try to access the data for e.g. Amaranthus hypochondriacus.

              Afraid that for the model organisms, any assembly made would have benefited from earlier ones and I'd prefer not retracing the complexity involved in mapping assemblies. On the other hand those reads would suit well for treatment without comparisons to previous (direct) assemblies.

              Comment

              • GenoMax
                Senior Member
                • Feb 2008
                • 7142

                #8
                It is probably going to be difficult to find both the raw data and the assemblies in public databases. Some people may submit both but most probably only submit the raw data since that is all the journals require.

                Another option could be to find the raw data/published papers that go with it and then ask the authors directly if they can share the assembly, if you can't find it in a public resource.

                Comment

                Latest Articles

                Collapse

                • seqadmin
                  Pathogen Surveillance with Advanced Genomic Tools
                  by seqadmin




                  The COVID-19 pandemic highlighted the need for proactive pathogen surveillance systems. As ongoing threats like avian influenza and newly emerging infections continue to pose risks, researchers are working to improve how quickly and accurately pathogens can be identified and tracked. In a recent SEQanswers webinar, two experts discussed how next-generation sequencing (NGS) and machine learning are shaping efforts to monitor viral variation and trace the origins of infectious...
                  03-24-2025, 11:48 AM
                • seqadmin
                  New Genomics Tools and Methods Shared at AGBT 2025
                  by seqadmin


                  This year’s Advances in Genome Biology and Technology (AGBT) General Meeting commemorated the 25th anniversary of the event at its original venue on Marco Island, Florida. While this year’s event didn’t include high-profile musical performances, the industry announcements and cutting-edge research still drew the attention of leading scientists.

                  The Headliner
                  The biggest announcement was Roche stepping back into the sequencing platform market. In the years since...
                  03-03-2025, 01:39 PM

                ad_right_rmr

                Collapse

                News

                Collapse

                Topics Statistics Last Post
                Started by seqadmin, 03-20-2025, 05:03 AM
                0 responses
                49 views
                0 reactions
                Last Post seqadmin  
                Started by seqadmin, 03-19-2025, 07:27 AM
                0 responses
                57 views
                0 reactions
                Last Post seqadmin  
                Started by seqadmin, 03-18-2025, 12:50 PM
                0 responses
                50 views
                0 reactions
                Last Post seqadmin  
                Started by seqadmin, 03-03-2025, 01:15 PM
                0 responses
                201 views
                0 reactions
                Last Post seqadmin  
                Working...