Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Differential Expression Analysis

    I am working with novel RNAseq data from a type of grass whose genome has not yet been completely sequenced or annotated. I have a number of FASTQ files with RNAseq data from different parts of the plant and am trying to conduct a differential expression analysis of these files. I was planning to use the DEGseq package in R to conduct the analysis, but from what I understand, this requires me to map the reads to an index to ultimately convert them to the .bed format and I would also need a reference genome file in the ucsc refFlat format. Since this plant genome has not even been sequenced, these files are unavailable, so I thought to map the reads to the genome of brachypodium distachyon, which is a model organism for grasses. I was able to create an index through bowtie using the genome from phytozome, but I have not been able to find a reference file for brachypodium in the reFlat or GTF format. Is there any way to convert to or create a reference file in the GTF or refFlat format, and am I even on the right track to conduct differential expression analysis on these files?
    I also have access to the original RNA assembly data which came from an illumina HiSeq. I'm not sure if this would be helpful.

  • #2
    I don't know much about plants, but it sounds like you might want to try to build de novo transcriptome assembly - https://www.sciencedirect.com/scienc...14662817301032

    Comment


    • #3
      Use supertranscripts as your reference genome:

      Building SuperTranscripts: A linear representation of transcriptome data - Oshlack/Lace


      Trinity RNA-Seq de novo transcriptome assembly. Contribute to trinityrnaseq/trinityrnaseq development by creating an account on GitHub.

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Current Approaches to Protein Sequencing
        by seqadmin


        Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
        04-04-2024, 04:25 PM
      • seqadmin
        Strategies for Sequencing Challenging Samples
        by seqadmin


        Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
        03-22-2024, 06:39 AM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, 04-11-2024, 12:08 PM
      0 responses
      18 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-10-2024, 10:19 PM
      0 responses
      22 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-10-2024, 09:21 AM
      0 responses
      17 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-04-2024, 09:00 AM
      0 responses
      49 views
      0 likes
      Last Post seqadmin  
      Working...
      X