Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • pipeline to construct alt contigs from reference contigs?

    Dear all,

    I am working with a genome with very low heterozygosity. I suspect that this monoploid genome (which only includes reference contigs in its fasta file) is actually tetraploid (multiple independent analyses of SNPs confirm this suspicion).

    As a first step towards testing whether different alleles of the same gene have different expression levels, I would like to map my RNA-seq reads to the reference contigs, and also to a separate set of "alt" contigs that are identical to the reference contigs, but which include the SNP variants. The idea is to separate, into two separate Bam files, the RNA reads exactly matching the reference genome from the RNA reads exactly matching the alternative SNPs.

    Is there a straightforward way to take one's fasta file for the reference contigs, along with a BAM file (made from mapping Illumina DNA reads back to the reference contigs), to call the SNPs with some stringent filtering, and then to generate a new fasta file for "alt contigs" (the contigs with SNP variants)? Although this would give no idea as to the phasing, I think it would be okay in the case of my genome for purposes of looking at differential RNA-seq, because the SNPs are pretty far apart, and generally there are only two alleles per gene.

    Thank you for any suggestions you may have.

    Best regards,

    TylerDodgeball

  • #2
    I have found this utility, which seems to have the same functionality, without the need to make the alt contigs:

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Essential Discoveries and Tools in Epitranscriptomics
      by seqadmin




      The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
      04-22-2024, 07:01 AM
    • seqadmin
      Current Approaches to Protein Sequencing
      by seqadmin


      Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
      04-04-2024, 04:25 PM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, Yesterday, 08:47 AM
    0 responses
    12 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-11-2024, 12:08 PM
    0 responses
    60 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 10:19 PM
    0 responses
    59 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 09:21 AM
    0 responses
    54 views
    0 likes
    Last Post seqadmin  
    Working...
    X