Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • How to examine paired reads

    Hello, a beginner question, any help appreciated

    We sequenced an organism (Illumina Hiseq) and would like now to examine how the reads are getting paired, just as a quality control look at how far the paired end are from each other, how many pairs there are.

    Unfortunately, this organism does not have a reference genome, so initially our approach was to get a single gene, such as the rRNA gene fasta sequence for this organism, align as many of our sequenced reads as possible to that gene, and examine the distance between the pairs.

    Does this approach make sense? What program can we use to align our reads to only a single gene.

    Any other suggestions appreciated!

    Thanks in advance,
    Ramiro

  • #2
    I presume you are talking about an transcriptome project instead of a whole genome project thus avoiding the whole issue of intron/exons.

    I wouldn't use rRNA -- too many duplicates -- but if that is all you have then, sure, use it. And it would be useful to have more than one gene to look at just for good statistics. In general you approach is valid for a first pass. As for the program to use, there are many. I'd try one of the short read aligners such as BWA, Bowtie2, SNAP, etc.

    Also I suggest running FastQC to get a feel for the quality of the data.

    Comment


    • #3
      You are correct, it is a transcriptome project.

      Thank you very much for your feedback. The reason I was asking about what program to use, is that BWA, Bowtie2, etc seem to be meant for long reference sequences (such as genomes, chromosomes) and not for a small gene of just a few hundred nucleotides in length. However, I will try.

      I ran FastQC and the reads are of high quality.

      The situation, on the bigger picture, is that the reads are not assembling, and one possibility is that the "pairing" has shifted for some reason (that what are being called "pairs" are not really pairs). I don't think this is true (for various reasons) but would like to verify it somehow before we rule it out and have that as a QC step for the future.

      Comment


      • #4
        Certainly BWA, Bowtie2, SNAP are good for long references but they can also be used on short references. I use said programs to scan reads for matches to the rRNA database that we use. Said database has hundreds of short rRNA references but it could just as easily have one short reference. I also use the programs to scan my phiX database which has a single fairly short reference in it.

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Strategies for Sequencing Challenging Samples
          by seqadmin


          Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
          03-22-2024, 06:39 AM
        • seqadmin
          Techniques and Challenges in Conservation Genomics
          by seqadmin



          The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

          Avian Conservation
          Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
          03-08-2024, 10:41 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, Yesterday, 06:37 PM
        0 responses
        10 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, Yesterday, 06:07 PM
        0 responses
        9 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 03-22-2024, 10:03 AM
        0 responses
        49 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 03-21-2024, 07:32 AM
        0 responses
        67 views
        0 likes
        Last Post seqadmin  
        Working...
        X