Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Fastqc report

    Hi,

    I'm having a problem with one of my sample!
    I analysed it with fastqc and it shown the Quality score between 30 and 40 (I attached the pic), but when i looked the kmer (attacehd pics) it looked awful! I don't understand what it is and why it shows a good quality if my sequences are not good at all!! The analysis is terribile: I'm not able to identify SNP/indel and the coverage is low!! Do you know what it's going on with this sample?? And do you know if there is a way to clean or to restore it??

    Thanks!!

    M
    Attached Files

  • #2
    The run was performed with Illumina Hi-Seq 2000, with a multiplexed assay: I run twelve samples in one lane. The data from the other samples are fine, the problem is only for this one! I forgot to mention that it's paired end and the pics attached are only for the read number 1, but the read number 2 looks the same. How can I trim it? Do you think in galaxy can I find a tool? DO you think is it contamination or what?

    Thanks so much!!

    Comment


    • #3
      Did you rule out that possibility that your sample is not from the organism that you think it should be from? You ask, "DO you think is it contamination" ... that is the most plausible explanation. Take about 100 reads and blast them against 'nr' to see you you get.

      Comment


      • #4
        This looks like the random hexamer problem which comes up a lot on RNA-Seq data. Basically the random primers used to create RNA-Seq libraries aren't actually random and give you a Kmer bias at the front end of your reads. We see this in most RNA-Seq libraries to a greater or lesser extent. Yours is maybe a little worse than normal but if it maps OK then it's probably alright to keep using it. If it is this then there's no point trying to remove this bias, since the sequence you've got is probably correct but from a slightly biased subset of positions within your data.

        More details can be found in this paper.

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Essential Discoveries and Tools in Epitranscriptomics
          by seqadmin




          The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
          04-22-2024, 07:01 AM
        • seqadmin
          Current Approaches to Protein Sequencing
          by seqadmin


          Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
          04-04-2024, 04:25 PM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, Today, 08:47 AM
        0 responses
        10 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-11-2024, 12:08 PM
        0 responses
        60 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-10-2024, 10:19 PM
        0 responses
        57 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-10-2024, 09:21 AM
        0 responses
        53 views
        0 likes
        Last Post seqadmin  
        Working...
        X