Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • neetu
    Member
    • Oct 2011
    • 12

    RNA-seq analysis raw data format

    Hello,

    I am relatively new to this sequencing field. can anybody guide me with the basics and tell me what is the RNA-seq analysis raw data format. Is it .bed files or fastq files. How to find out what all files in GEO or array express is having RNA-seq data?
  • NicoBxl
    not just another member
    • Aug 2010
    • 264

    #2
    in general it's fastq format

    Comment

    • kopi-o
      Senior Member
      • Feb 2008
      • 319

      #3
      For GEO, you can find RNA-seq experiments by filtering on "series type":

      Gene Expression Omnibus (GEO) is a database repository of high throughput gene expression data and hybridization arrays, chips, microarrays.


      On ArrayExpress, go to



      and select "RNA assay" and "high-throughput sequencing" from two of the menus.

      Comment

      • neetu
        Member
        • Oct 2011
        • 12

        #4
        Thanks a lot for your reply..i will try searching..

        Comment

        • neetu
          Member
          • Oct 2011
          • 12

          #5
          i am actually having a problem, i opened this page from GEO
          NCBI's Gene Expression Omnibus (GEO) is a public archive and resource for gene expression data.

          this is an RNA-seq analysis for sure, if i need the data and i scroll down to the files attached to this page, i get .txt files and also files which say are for SRA study. now how should i take the data from this page? can the data be in .txt format also. also want to know where can we use the data from SRA study.

          Comment

          • ulz_peter
            Senior Member
            • Feb 2010
            • 219

            #6
            For introduction to RNA-seq see: http://seqanswers.com/wiki/How-to/RNASeq_analysis

            and: sometimes fastq files got the ending .txt as windows users won't recognize text files as such if they do not have this ending. They may be fastq files even with the .txt ending. (I haven't looked into these files, they may be something else as well)

            Comment

            • dpryan
              Devon Ryan
              • Jul 2011
              • 3478

              #7
              If you click on one of the samples (e.g. going to here) and look in the "Data processing" section, they mention the file type and where to find the actual specification for it. The SRA files could be converted to fastq format with the SRA toolkit. I should note, whether you actually want to redo the alignment yourself (i.e. downloading the SRA files, converting them to fastq, alignment with tophat or whatever) or directly use the prealigned files depends a bit on what your goals are.

              BTW, if you need the reads aligned to hg19 instead of hg18, you can google for the very useful "liftOver" tool.

              Comment

              • neetu
                Member
                • Oct 2011
                • 12

                #8
                thanks a ton for your reply dpryan and peter. your links are really helpful. one more doubt now arises is that is there a way by which we can get prealigned files. i till now presumed that we get only RAW data from GEO and AE, and we have to compulsorily align it to process further. Can we get SAM and BAM files also ?

                Comment

                • dpryan
                  Devon Ryan
                  • Jul 2011
                  • 3478

                  #9
                  Unfortunately I don't think there's a single answer to that question that applies to all datasets. I've used a number of datasets that provided prealigned BED or similar files, but that's certainly not the case with all of them.

                  Comment

                  Latest Articles

                  Collapse

                  • SEQadmin2
                    Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                    by SEQadmin2


                    I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

                    Here are nine questions we think about, in roughly the order they matter, before...
                    06-18-2026, 07:11 AM
                  • SEQadmin2
                    From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                    by SEQadmin2


                    Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                    The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                    ...
                    06-02-2026, 10:05 AM

                  ad_right_rmr

                  Collapse

                  News

                  Collapse

                  Topics Statistics Last Post
                  Started by SEQadmin2, Today, 11:10 AM
                  0 responses
                  5 views
                  0 reactions
                  Last Post SEQadmin2  
                  Started by SEQadmin2, 06-17-2026, 06:09 AM
                  0 responses
                  41 views
                  0 reactions
                  Last Post SEQadmin2  
                  Started by SEQadmin2, 06-09-2026, 11:58 AM
                  0 responses
                  102 views
                  0 reactions
                  Last Post SEQadmin2  
                  Started by SEQadmin2, 06-05-2026, 10:09 AM
                  0 responses
                  123 views
                  0 reactions
                  Last Post SEQadmin2  
                  Working...