Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • GA pipeline chastity/purity filters

    Anyone have experience of changing the default settings of the filtering parameters in the GA pipeline used to remove low quality reads?

    From the manual "The default filter is equivalent to:
    --smt-filter failed-chastity --smt-relation le --smt-threshold 1 --chastity-threshold 0.6 --pure-bases 25 "

    We have somewhat more data per sample than we need. We want to do SNP calling, and I'd like to have the highest quality data only as the SNP calling is sensitive to error rate.

    regards

    David

  • #2
    where to find GA pipeline? is it free to use? I want to remove low quality reads from SRA.

    Comment


    • #3
      I would hesitate to change the value from 0.6 unless I had a good look at the normalized intensity values of each channel. Illumina claims that 0.6 is quite a conservative cutoff. I think changing the number of cycles that chastity evaluates should be relatively safe though. How long are your reads? You could easily bump the value from 25 to maybe 35 - even higher perhaps. Try running the pipeline on a few different intervals and see how many reads you are left with and what the corresponding quality scores look like until you are happy with it.

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Essential Discoveries and Tools in Epitranscriptomics
        by seqadmin




        The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
        04-22-2024, 07:01 AM
      • seqadmin
        Current Approaches to Protein Sequencing
        by seqadmin


        Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
        04-04-2024, 04:25 PM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, Yesterday, 08:47 AM
      0 responses
      12 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-11-2024, 12:08 PM
      0 responses
      60 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-10-2024, 10:19 PM
      0 responses
      59 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-10-2024, 09:21 AM
      0 responses
      54 views
      0 likes
      Last Post seqadmin  
      Working...
      X