Seqanswers Leaderboard Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • zslee
    Member
    • May 2009
    • 29

    about UCSC full length cDNA

    in one paper, they use downloaded full length cDNA sequences(from ucsc) to detect A-to-I rna editing by aligning to reference hg16, but when i want to repeat their work by downloading human mRNA sequences(i think this is the data they used), i find there's no difference between the mRNA sequences and hg16, why? did i use the wrong data?

    can anyone give me a hint
    thanks in advance
  • jiaco
    Member
    • May 2010
    • 35

    #2
    Is this really the first post in a thread or is my browser messed up? I hope that my reply here can actually be considered constructive, as intended.

    Originally posted by zslee View Post
    in one paper,
    what paper?

    Originally posted by zslee View Post
    by downloading human mRNA sequences
    from where? which file(s)

    Originally posted by zslee View Post
    (i think this is the data they used)
    This appears to be the crux of your problem, you do not know what data they used and therefore cannot download it, and cannot repeat their work.

    But again, what paper?

    Originally posted by zslee View Post
    why? did i use the wrong data?
    I honestly have no idea based on the info in your post.

    Comment

    • zslee
      Member
      • May 2009
      • 29

      #3
      yes, i should make it clear ~

      the paper i consider is
      "Widespread RNA editing of embedded alu elements in the human transcriptome"
      in the method part, they mentioned "We obtained all human and mouse full-length cDNA sequences from the UCSC Genome Browser database and aligned them against their reference genome sequences using BLAT"

      my problem is i don't know where to download the full length cDNA sequences from UCSC database

      thanks

      Comment

      • jiaco
        Member
        • May 2010
        • 35

        #4
        From http://genome.ucsc.edu/
        click Tables
        then select the following form entries

        Mammal : Human : Mar2006 (or the assembly you want)
        Genes and Gene Prediction Tracks : UCSC Genes
        knownGene
        region: genome

        outputformat: sequence
        set an output file name
        use gzip compression

        get output

        on the next page, you can select mRNA which will splice out introns and start the download of all ucsc known gene mRNA sequences in fasta format.

        (but this is really a question for UCSC genome browser, no?)

        Comment

        • steven
          Senior Member
          • Aug 2009
          • 269

          #5
          Originally posted by zslee View Post
          in one paper, they use downloaded full length cDNA sequences(from ucsc) to detect A-to-I rna editing by aligning to reference hg16, but when i want to repeat their work by downloading human mRNA sequences(i think this is the data they used), i find there's no difference between the mRNA sequences and hg16, why? did i use the wrong data?

          can anyone give me a hint
          thanks in advance
          Yep indeed, looks like if you go to "Tables" then select group=mRNAs and ESTs, track=human mRNAs, then table=all_mrna, the output=sequence sends you to "Human mRNAs *Genomic* Sequence", which i understand is a genomic extraction based on the positions from the alignments, which is obviously not what you want.
          To get the original transcript sequences, try with table=RefSeq Genes instead, you will be given the choice between genomic, protein and mRNA.
          I bet the later works for you.
          cheers,
          s.

          Comment

          • zslee
            Member
            • May 2009
            • 29

            #6
            Originally posted by jiaco View Post
            From http://genome.ucsc.edu/
            click Tables
            then select the following form entries

            Mammal : Human : Mar2006 (or the assembly you want)
            Genes and Gene Prediction Tracks : UCSC Genes
            knownGene
            region: genome

            outputformat: sequence
            set an output file name
            use gzip compression

            get output

            on the next page, you can select mRNA which will splice out introns and start the download of all ucsc known gene mRNA sequences in fasta format.

            (but this is really a question for UCSC genome browser, no?)
            yes, i've tried, it works, thanks to jiaco !
            and i'm not familiar with UCSC

            Comment

            • zslee
              Member
              • May 2009
              • 29

              #7
              Originally posted by steven View Post
              Yep indeed, looks like if you go to "Tables" then select group=mRNAs and ESTs, track=human mRNAs, then table=all_mrna, the output=sequence sends you to "Human mRNAs *Genomic* Sequence", which i understand is a genomic extraction based on the positions from the alignments, which is obviously not what you want.
              To get the original transcript sequences, try with table=RefSeq Genes instead, you will be given the choice between genomic, protein and mRNA.
              I bet the later works for you.
              cheers,
              s.
              yes, just as you mentioned, i need mRNA not genomic sequences,
              and in the paper i considered, they download data same as jiaco's suggestion
              thanks to steven !

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Pathogen Surveillance with Advanced Genomic Tools
                by seqadmin




                The COVID-19 pandemic highlighted the need for proactive pathogen surveillance systems. As ongoing threats like avian influenza and newly emerging infections continue to pose risks, researchers are working to improve how quickly and accurately pathogens can be identified and tracked. In a recent SEQanswers webinar, two experts discussed how next-generation sequencing (NGS) and machine learning are shaping efforts to monitor viral variation and trace the origins of infectious...
                03-24-2025, 11:48 AM
              • seqadmin
                New Genomics Tools and Methods Shared at AGBT 2025
                by seqadmin


                This year’s Advances in Genome Biology and Technology (AGBT) General Meeting commemorated the 25th anniversary of the event at its original venue on Marco Island, Florida. While this year’s event didn’t include high-profile musical performances, the industry announcements and cutting-edge research still drew the attention of leading scientists.

                The Headliner
                The biggest announcement was Roche stepping back into the sequencing platform market. In the years since...
                03-03-2025, 01:39 PM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, 03-20-2025, 05:03 AM
              0 responses
              49 views
              0 reactions
              Last Post seqadmin  
              Started by seqadmin, 03-19-2025, 07:27 AM
              0 responses
              57 views
              0 reactions
              Last Post seqadmin  
              Started by seqadmin, 03-18-2025, 12:50 PM
              0 responses
              50 views
              0 reactions
              Last Post seqadmin  
              Started by seqadmin, 03-03-2025, 01:15 PM
              0 responses
              201 views
              0 reactions
              Last Post seqadmin  
              Working...