Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Multiple files with same barcode in TCGA

    Does anyone know what differentiates the bam files from samples with the same barcode? in TCGA For example, this sample (TCGA-D7-8575-01A-11D-2340-08) has files *.5.bam and *.1.bam. What is the difference?


    C440.TCGA-D7-8575-01A-11D-2340-08.5.bam
    C1646.TCGA-D7-8575-01A-11D-2340-08.1.bam

  • #2
    It's some kind of version number; they appear to be exclusively associated with "BI", broad institute.
    It's not clear whether newer ones are replacing the previous version or if they are a separate run and are therefore additional information.

    Comment


    • #3
      Not an answer for your question but see this link: https://browser.cghub.ucsc.edu/searc...1A-11D-2340-08 If you scroll over to the right in the table you can see that there are 2 live files. If you click on the "i" symbol you can get additional information (choose "show details in new window"). These may be two separate runs for the sample looking at the submission date.

      Comment


      • #4
        The same aliquot may have been sequenced multiple times and/or processed multiple times:

        Some aliquots have been sequenced for VALIDATION, WGS, and WXS
        Some aliquots have been captured with different probe sets.

        Some data may have been aligned to NCBI36, GRCh37, and GRCh38.

        Some data have had additional reads added or removed after the version used in a publication was generated so both versions are live.

        You can query CGHub for these details:

        library_type
        assembly
        published
        uploaded
        modified
        state
        reason
        reagent_vendor
        reagent_name

        or pull them from columns 1,8,13,23-26,31-33 of the MANIFEST at

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Essential Discoveries and Tools in Epitranscriptomics
          by seqadmin


          The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
          Yesterday, 07:01 AM
        • seqadmin
          Current Approaches to Protein Sequencing
          by seqadmin


          Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
          04-04-2024, 04:25 PM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, 04-11-2024, 12:08 PM
        0 responses
        55 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-10-2024, 10:19 PM
        0 responses
        51 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-10-2024, 09:21 AM
        0 responses
        45 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-04-2024, 09:00 AM
        0 responses
        55 views
        0 likes
        Last Post seqadmin  
        Working...
        X