Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • #16
    Did you check this? http://www.ncbi.nlm.nih.gov/bioproje...rm=PRJNA169002

    I felt there's something missing, but didn't really know what WGS is. Aren't the raw reads all I need?

    Comment


    • #17
      Originally posted by antifolate View Post
      Did you check this? http://www.ncbi.nlm.nih.gov/bioproje...rm=PRJNA169002

      I felt there's something missing, but didn't really know what WGS is. Aren't the raw reads all I need?
      That link is just the main "project" page (it also appears to be for S. cerevisiae N85 strain). If the "whole genome sequencing" (WGS) data for YHJ7 was submitted under that project it should have been linked there.

      See related examples here: http://www.ncbi.nlm.nih.gov/bioproject/PRJNA282006 or http://www.ncbi.nlm.nih.gov/bioproject/PRJNA188959

      Comment


      • #18
        If you click Taxonomy in the upper right corner, then Nucleotide on the next page, then WGS CBYJ010000001-CBYJ010000371 on the next one, you'll get here http://www.ncbi.nlm.nih.gov/Traces/w...CBYJ01#contigs where you can download the assembled contigs.

        Any tips on hybrid de novo/ reference guided assembly without CLC?

        Comment


        • #19
          You can probably get similar results to a reference-guided assembly by assembling your reads with a normal assembler, then scaffolding with simulated long-mate-pairs from the reference. SSPace is one program for scaffolding assembled contigs. You can generate a fake long-mate-pair library using RandomReads (distributed with BBMap).

          Hard to say which assembler you should be using, but Velvet is pretty straightforward and fast, so it's good to try at first.

          Comment


          • #20
            I'm confused so please bear with me. I have three files now: lab_S_cervaise.fastq, ricewine_S_cervaise.fastq, and the WGS contigs file (.fsa_nt). What is the WGS file for? Which on of these should I use to assemble the genome?

            Comment


            • #21
              @antifolate: I have not read the paper you linked but the WGS sequence you downloaded from the taxonomy link is for strain N85. How it relates to YHJ7 is something you will need to consider.

              Comment


              • #22
                Originally posted by antifolate View Post
                I'm confused so please bear with me. I have three files now: lab_S_cervaise.fastq, ricewine_S_cervaise.fastq, and the WGS contigs file (.fsa_nt). What is the WGS file for? Which on of these should I use to assemble the genome?
                Just a guess. lab = S288c, ricewine = N85 (or YHJ7?).

                What is in the WGS contigs file? Just fasta sequence?

                Comment


                • #23
                  I contacted one of the authors and it turns out they didn't upload the raw reads for the genome sequencing, only the RNA-Seq ones. Thanks all.

                  Comment

                  Latest Articles

                  Collapse

                  • seqadmin
                    Advancing Precision Medicine for Rare Diseases in Children
                    by seqadmin




                    Many organizations study rare diseases, but few have a mission as impactful as Rady Children’s Institute for Genomic Medicine (RCIGM). “We are all about changing outcomes for children,” explained Dr. Stephen Kingsmore, President and CEO of the group. The institute’s initial goal was to provide rapid diagnoses for critically ill children and shorten their diagnostic odyssey, a term used to describe the long and arduous process it takes patients to obtain an accurate...
                    12-16-2024, 07:57 AM
                  • seqadmin
                    Recent Advances in Sequencing Technologies
                    by seqadmin



                    Innovations in next-generation sequencing technologies and techniques are driving more precise and comprehensive exploration of complex biological systems. Current advancements include improved accessibility for long-read sequencing and significant progress in single-cell and 3D genomics. This article explores some of the most impactful developments in the field over the past year.

                    Long-Read Sequencing
                    Long-read sequencing has seen remarkable advancements,...
                    12-02-2024, 01:49 PM

                  ad_right_rmr

                  Collapse

                  News

                  Collapse

                  Topics Statistics Last Post
                  Started by seqadmin, 12-17-2024, 10:28 AM
                  0 responses
                  26 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 12-13-2024, 08:24 AM
                  0 responses
                  42 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 12-12-2024, 07:41 AM
                  0 responses
                  28 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 12-11-2024, 07:45 AM
                  0 responses
                  42 views
                  0 likes
                  Last Post seqadmin  
                  Working...
                  X