Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • A little help with cufflinks

    Hi!
    I hope these questions won't be to stupid as I'm pretty new in the Bioinformatics field. And if these questions have been answered please point me in the right direction as I haven't found it.
    So the project I'm working on needs expression values for transcripts, genes or proteins and I am suppose to look in to high throughput sequencing data. Mainly RNA-Seq in this case.

    So I have sett up bowtie->tophat->cufflinks to extract expression values for transcripts and i put in a fastq file in one end and get a couple of output files in the other.

    I use the basic commands
    Code:
    $tophat d_mel_genome_fb5_22/d_melanogaster_fb5_22 SRR034813.fastq
    and
    Code:
    $cufflinks ../tophat_out/accepted_hits.bam
    1) Is this a good enough approach? Is there some flags that are common to use to get better(more accurate) results? Is there something I generally should think about?

    2) When i get my output (genes.expr, transcripts.expr, transcripts.gtf) files. How do I map cufflinks internal gene ID to a real ID found at for example ensemble or other more general sources.

    3) When i find for example an already processed .bam file at the GEO-Database and try to run it through cufflinks it spits out mainly errors. I assume there is more then one type of .bam formats. How do i convert (if possible) to something cufflinks accepts.

    4) Does cufflinks accept other types of processed files like wiggle files or .bed files and convert to expression values for transcirpts? Is there any other programs that does or do these files not contain that type of information. The main thing here is if there is a way to cheat a couple of steps and lower the risk of making errors in the procedure.

    Any help is appreciated and I hope my neewbiness doesn't shine through all that much. Thanks

  • #2
    If you give the cufflinks command a gtf file, it will calculate FPKMs for genes and transcripts with the names from the gtf file. If you don't, it tries to define the transcriptome itself. You can also do cuffcompare with an annotation gtf file and the transcripts.gtf file generated by cufflinks to see how different they are.

    Comment


    • #3
      Originally posted by mgogol View Post
      If you give the cufflinks command a gtf file, it will calculate FPKMs for genes and transcripts with the names from the gtf file. If you don't, it tries to define the transcriptome itself. You can also do cuffcompare with an annotation gtf file and the transcripts.gtf file generated by cufflinks to see how different they are.
      Thanks for the info!

      On my 4th question... Is there a pipeline to do what I want there? I haven't been able to find it. But may be that someone here has also been interested in expression values for specific genes that has.

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Essential Discoveries and Tools in Epitranscriptomics
        by seqadmin




        The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
        04-22-2024, 07:01 AM
      • seqadmin
        Current Approaches to Protein Sequencing
        by seqadmin


        Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
        04-04-2024, 04:25 PM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, Today, 08:47 AM
      0 responses
      10 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-11-2024, 12:08 PM
      0 responses
      60 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-10-2024, 10:19 PM
      0 responses
      57 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-10-2024, 09:21 AM
      0 responses
      53 views
      0 likes
      Last Post seqadmin  
      Working...
      X