Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • question about SRMA

    i am a freshman in this field, i am trying to use SRMA to realign my data, but i don't know how to generate a ranges file for SRMA, it is a txt file, the content is like:
    chr1 1889866 1890066
    chr1 12561395 12561595
    chr1 34999494 34999694
    chr1 43681831 43682031
    chr1 74810345 74810545
    chr1 74810352 74810552
    chr1 89245929 89246129
    chr1 143585219 143585419
    chr1 144037256 144037456
    chr1 150462252 150462452
    chr1 156418029 156418229
    chr1 169823406 169823606
    chr1 227839825 227840025

    any body any suggest?i will appreciate your help!

  • #2
    Originally posted by rlh View Post
    i am a freshman in this field, i am trying to use SRMA to realign my data, but i don't know how to generate a ranges file for SRMA, it is a txt file, the content is like:
    chr1 1889866 1890066
    chr1 12561395 12561595
    chr1 34999494 34999694
    chr1 43681831 43682031
    chr1 74810345 74810545
    chr1 74810352 74810552
    chr1 89245929 89246129
    chr1 143585219 143585419
    chr1 144037256 144037456
    chr1 150462252 150462452
    chr1 156418029 156418229
    chr1 169823406 169823606
    chr1 227839825 227840025

    any body any suggest?i will appreciate your help!
    That looks fine, what was the error?

    Comment


    • #3
      Originally posted by nilshomer View Post
      That looks fine, what was the error?
      perhaps i didn't say clearly, i just wanna know how to generate that RANGES file. can you offer some advices?

      Comment


      • #4
        Oh, well if you have regions that you are interested in, use a ranges file. Otherwise, you can run it without the ranges file, which will re-align your entire dataset.

        Comment


        • #5
          Originally posted by nilshomer View Post
          Oh, well if you have regions that you are interested in, use a ranges file. Otherwise, you can run it without the ranges file, which will re-align your entire dataset.
          thankyou for your reply! my database is 56G, if i run re-align in the entire database, the memory it needs is extreme, so i wanna run it with RANGES file.
          is there any tools can help me generate RANGES file from my database, if i write it by myself will cost such a long time.

          Comment


          • #6
            Originally posted by rlh View Post
            thankyou for your reply! my database is 56G, if i run re-align in the entire database, the memory it needs is extreme, so i wanna run it with RANGES file.
            is there any tools can help me generate RANGES file from my database, if i write it by myself will cost such a long time.
            It doesn't load the entire reference and reads into memory, it loads them in as needed. I assume by database, you mean reference.

            Comment


            • #7
              Originally posted by nilshomer View Post
              It doesn't load the entire reference and reads into memory, it loads them in as needed. I assume by database, you mean reference.
              i run a 5G testing data, 64* coverage, -Xmx8G, it print "Error: java heap space", and the records processing stop at about 11000000, someone suggest me to do it with RANGES file.

              Comment


              • #8
                A Nils (who wrote the program, so he might understand it :-) said, if you want to speed things up you can use the ranges file to specify specific regions of interest. You, and only you, can decide what you find interesting.

                Probably you want to pick a bunch of genes of interest & then extract the columns for them from the annotation file off UCSC for the genome you are interested in. That's pretty trivial with Perl/Python/Java/whathaveyou (or even with grep & cut in UNIX) & the time you invest now to learn how to do it will pay off many times over.

                Comment


                • #9
                  Originally posted by krobison View Post
                  A Nils (who wrote the program, so he might understand it :-) said, if you want to speed things up you can use the ranges file to specify specific regions of interest. You, and only you, can decide what you find interesting.

                  Probably you want to pick a bunch of genes of interest & then extract the columns for them from the annotation file off UCSC for the genome you are interested in. That's pretty trivial with Perl/Python/Java/whathaveyou (or even with grep & cut in UNIX) & the time you invest now to learn how to do it will pay off many times over.
                  thank you! i understand it now, i am really appreciate!

                  Comment


                  • #10
                    i understand what you mean now. thanks very much!

                    Comment

                    Latest Articles

                    Collapse

                    • seqadmin
                      Current Approaches to Protein Sequencing
                      by seqadmin


                      Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                      04-04-2024, 04:25 PM
                    • seqadmin
                      Strategies for Sequencing Challenging Samples
                      by seqadmin


                      Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                      03-22-2024, 06:39 AM

                    ad_right_rmr

                    Collapse

                    News

                    Collapse

                    Topics Statistics Last Post
                    Started by seqadmin, 04-11-2024, 12:08 PM
                    0 responses
                    30 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 04-10-2024, 10:19 PM
                    0 responses
                    32 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 04-10-2024, 09:21 AM
                    0 responses
                    28 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 04-04-2024, 09:00 AM
                    0 responses
                    52 views
                    0 likes
                    Last Post seqadmin  
                    Working...
                    X