Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • mirauta
    Junior Member
    • Jul 2010
    • 2

    RNA-seq data sample

    Hello,

    I need some RNA seq data & the coresponding genome annotation.
    I'm new to the business and am trying to develop a segmentation method.

    I searched the web but did not find untill this moment any good data.
    I don't need full genome but just >10MB sample.

    Thank you,

    Bogdan
  • Dethecor
    Member
    • May 2010
    • 24

    #2
    Short Read Archive

    Hi,

    you could try the Short Read Archive (http://www.ncbi.nlm.nih.gov/sra), although their datasets are normally in files of at least 500 Mb . . . you could just download some and then extract a reasonable subset of reads from the files (you might have to familiarize yourself a little bit with the formats beforehand).

    I don't exactly know what you are trying to do, but if your method is meant to work on real-life data you should not restrict yourself too much in the size (like the max. 10 Mb that you mentioned), because you might end up developing something that works well on toy examples but does not scale well for the amounts of data a real-life RNA-Seq experiment might produce.

    Cheers

    "You are only young once, but you can stay immature indefinitely."

    Comment

    • mirauta
      Junior Member
      • Jul 2010
      • 2

      #3
      Thanks, I'll try to get the data from the ncbi site.
      And thanks again for the advice you gave me. I want to use a small smaple just to allow my machine to work. For the moment it can't handle big data.

      Bogdan

      Comment

      • malachig
        Senior Member
        • Aug 2010
        • 117

        #4
        The point made by Dethecor is a good one. There is nothing particularly unique about the substance of next-gen sequencing data itself. What is unique and remarkable about it is the sheer volume of it. What is needed most by the field is the ability to rapidly process these huge data sets without excessive memory usage. Start working with large data sets right away. In addition to SRA you can also find RNA-Seq data in GEO. They have a better interface for searching their data archive (IMHO).

        For example ... if you go to the following link, select 'GEO DataSets' from the drop down menu, and use 'RNA-Seq' as your search term you currently get 78 RNA-Seq data sets deposited in GEO.
        PubMed® comprises more than 40 million citations for biomedical literature from MEDLINE, life science journals, and online books. Citations may include links to full text content from PubMed Central and publisher web sites.


        All kinds of interesting test data sets to play around with in there...

        Comment

        Latest Articles

        Collapse

        • GATTACAT
          Reply to Nine Things a Sample Prep Scientist Thinks About Before Sequencing
          by GATTACAT
          Love this - good data definitely starts from good input, and poor input can only give relatively poor data. I particularly like the mention of Nanodrop/absorbance based methods for quantification. It's such a toss up if you'll get an accurate reading or what amounts to a randomly generated number, and a lot of library/sequencing related issues can be traced back to poor quant.
          07-01-2026, 11:43 AM
        • SEQadmin2
          Nine Things a Sample Prep Scientist Thinks About Before Sequencing
          by SEQadmin2


          I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

          Here are nine questions we think about, in roughly the order they matter, before...
          06-18-2026, 07:11 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by SEQadmin2, 07-02-2026, 11:08 AM
        0 responses
        12 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-30-2026, 05:37 AM
        0 responses
        14 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-26-2026, 11:10 AM
        0 responses
        20 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-17-2026, 06:09 AM
        0 responses
        54 views
        0 reactions
        Last Post SEQadmin2  
        Working...