Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • understand the "left" and "right" outputs from NCBI data

    I am trying to understand what do the columns mean for a sequencing dataset that I downloaded from NCBI. The link to the page is http://www.ncbi.nlm.nih.gov/geo/quer...i?acc=GSE49372, and I download the "GSE49372_RAW.tar" file where, after uncompress, we see each sample is summarized in a .txt file. I extract a subset of the first sample as below:

    gene_id bundle_id chr left right FPKM FPKM_conf_lo FPKM_conf_hi status
    YAL069W 12733 I 334 649 0 0 0 OK
    YAL068W-A 12733 I 537 792 0 0 0 OK
    YAL068C 12734 I 1806 2169 2.22061 0 5.20096 OK
    YAL067W-A 12735 I 2479 2707 0 0 0 OK
    YAL067C 12736 I 7234 9016 44.7682 31.3864 58.15 OK
    YAL066W 12737 I 10090 10399 0 0 0 OK

    Here I see the "left" and "right" columns correspond to read counts (is that correct?). Because I want to obtain a read count matrix with rows as genes and columns as samples (as we analyze in edgeR or DESeq), I don't know how can I summarize the "left" and "right" columns for this sample. I appreciate any suggestions

  • #2
    I suspect that those are coordinates, which would normally be labeled start/end or something like that. It's likely that you'll have to do that actual counting yourself.

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Essential Discoveries and Tools in Epitranscriptomics
      by seqadmin


      The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
      Yesterday, 07:01 AM
    • seqadmin
      Current Approaches to Protein Sequencing
      by seqadmin


      Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
      04-04-2024, 04:25 PM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, 04-11-2024, 12:08 PM
    0 responses
    52 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 10:19 PM
    0 responses
    50 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 09:21 AM
    0 responses
    44 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-04-2024, 09:00 AM
    0 responses
    55 views
    0 likes
    Last Post seqadmin  
    Working...
    X