Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • six huge read count outliers on neighboring transcripts on one lane - theories?

    This is a purely academic question since these particular transcripts are not of particular interest to us (and they're only bad on one out of 71 lanes), but I'm curious as to what the theory may be for what happened to these particular reads.

    The setup is that we've run 71 Arabidopsis samples, one per lane, at Otogenetics, and they sent the data over to DNANexus for read mapping against TAIR9. I've downloaded the mapped read counts and used them in DESeq, edgeR, etc. The data is really good, quite consistent across the lanes for almost all transcripts.

    Except this one lane, lane #30, has six huge outliers in neighboring transcripts on chromosome 4:

    at4g12470 799 bp from Chr4:7,401,109..7,401,907
    at4g12480 833 bp from Chr4:7,406,105..7,406,937
    at4g12490 786 bp from Chr4:7,409,621..7,410,406
    at4g12500 778 bp from Chr4:7,414,150..7,414,927
    at4g12510 568 bp from Chr4:7,417,236..7,417,803
    at4g12520 683 bp from Chr4:7,421,056..7,421,738

    I've attached a plot below (SVG) that shows these six plus the neighboring two transcripts that look perfectly normal. If this were a microarray assay I'd say there was a scratch on the plate; but with RNA-Seq, any simple explanation?

    Last edited by samhokin; 11-10-2013, 08:33 AM.
    Sam Hokin
    Computational Scientist, Carnegie and NCGR

  • #2
    Data outlyer

    This is Natalie from Otogenetics. I would like to discuss your data with you to see if there is anything on our end that could help. Unfortunately I cannot tell which sample has your outlying data. Could you contact us at [email protected] or give us a call? I would be happy to discuss the sample offline.

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Current Approaches to Protein Sequencing
      by seqadmin


      Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
      04-04-2024, 04:25 PM
    • seqadmin
      Strategies for Sequencing Challenging Samples
      by seqadmin


      Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
      03-22-2024, 06:39 AM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, 04-11-2024, 12:08 PM
    0 responses
    27 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 10:19 PM
    0 responses
    31 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 09:21 AM
    0 responses
    27 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-04-2024, 09:00 AM
    0 responses
    52 views
    0 likes
    Last Post seqadmin  
    Working...
    X