Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Hello and a Question: 50 or 100 bp reads?

    Greetings all,

    I'm a 'senior' grad student at UCB working on a maize genetics/epigenetics project. I've prepared a couple libraries that we are planning to have sequenced here on one of our campus facility's nice new HiSeq 2000 machines! Validating them right now by small scale cloning, but from the size of most of the inserts, it looks like they are exactly what we expected, so all systems are go.

    I'm quite new to this whole deep sequencing technique, but I'm very excited to start the learning process of how to analyze these data sets! On advice from this excellent post (http://seqanswers.com/forums/showthr...good+computers), which explains my situation exactly, I am slowly but surely working through the Unix and Perl for Biologists primer (http://korflab.ucdavis.edu/Unix_and_Perl/). Hopefully I'll have at least a novice understanding of programming by the time we get our reads.

    But more importantly, a question: Should I get 50 or 100 bp reads for these libraries?

    Here are some details and issues that we are dealing with:

    The libraries were prepared using the small RNA adapters, so they will have to be done with single reads. Our main goal is to compare the two libraries, which represent two biological samples (WT vs. mutant), quantitatively, so getting fairly deep coverage is important to our analysis. However, we are working with the highly repetitive maize genome, so we also want to maximize the number of reads we can unambiguously map to the genome. In fact, reads that contain repetitive sequence AND unique sequence (eg., the insertion site of a transposon or other repeat into a unique genomic region) may be of particular interest, so capturing as many of these sites would be super. I'm guessing that longer reads would help in this respect.

    From the Bioanalyzer traces for the libraries, it looks like the most *abundant* inserts are ~75 and ~56 bp, ie. that's where the peaks are. The insert size range is ~30-230 bp though (I cut out between 100-300 bp on the gel). Does the range really matter here? What percentage of 75 and 56 bp-sized inserts can we expect out of all of the reads we get? And from the larger sized inserts that we capture, can we expect to get decent enough coverage to be able to compare the two libraries at a particular region?

    I would just automatically go with 100 bp reads I guess, but am wondering: is coverage significantly reduced with an increase in read length from what people have seen?

    It looks like there are many programs out there which recognize and trim
    adapter sequences from Illumina reads, for the reads that sequence INTO
    the 3' adapters. So it seems like that wouldn't be TOO big of a problem.

    Any advice/help on this would be much appreciated!

Latest Articles

Collapse

  • seqadmin
    Strategies for Sequencing Challenging Samples
    by seqadmin


    Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
    03-22-2024, 06:39 AM
  • seqadmin
    Techniques and Challenges in Conservation Genomics
    by seqadmin



    The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

    Avian Conservation
    Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
    03-08-2024, 10:41 AM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, Yesterday, 06:37 PM
0 responses
10 views
0 likes
Last Post seqadmin  
Started by seqadmin, Yesterday, 06:07 PM
0 responses
9 views
0 likes
Last Post seqadmin  
Started by seqadmin, 03-22-2024, 10:03 AM
0 responses
51 views
0 likes
Last Post seqadmin  
Started by seqadmin, 03-21-2024, 07:32 AM
0 responses
67 views
0 likes
Last Post seqadmin  
Working...
X