Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • Zimbobo
    Member
    • Mar 2010
    • 25

    RepeatMasker & RepeatScout

    Hello there,

    I was wondering whether anybody on this list could knows how to run RepeatScout (1.0.5) and RepeatMasker (3.2.8).

    Basically I have a new genome, and want to use RepeatScout to make a
    library for RepeatMasker.

    Here is what I do:

    build_lmer_table -sequence genome.fa -freq genome.fq
    RepeatScout -sequence genome.fa -output repeats.fa -freq genome.fq
    filter-stage-1.prl repeats.fa &> repeats.fa.filter_1
    RepeatMasker genome.fa -e abblast -lib repeats.fa.filter_1

    Now:
    Do I use the correct file for -lib?
    RepeatMasker is still complaining about not finding Libraries/RepeatMasker.lib
    and Libraries/RepeatmaskerLib.embl.

    Thanks a lot in advance for any help.
  • darked89
    Member
    • Jun 2009
    • 38

    #2
    Here is a recipe how to install and run RepeatScout:



    Hope it helps,

    Darek

    Comment

    • Zimbobo
      Member
      • Mar 2010
      • 25

      #3
      I actually followed those instructions.

      RepeatMasker is complaining still about missing libraries (ie Libraries/RepeatMasker.lib etc) and advises to get something from www.girinst.org.

      The whole point of running RepeatScout for me is to build my own library. Is there a flag to teach RepeatMasker not to look for those libraries or is there a reason RepeatMasker must have those libraries?

      Comment

      • edge
        Senior Member
        • Sep 2009
        • 199

        #4
        Hi Zimbobo,

        Can I ask you how you edit the perl script, filter-stage-1.prl to allow it point to the TRF path that we install?
        Which line of filter-stage-1.prl that we need to edit the path of TRF?
        My server keep on shown the below message:
        "No such file or directory at ./filter-stage-1.prl line 110"
        Thanks a lot for your sharing and guiding.

        Comment

        • hkuntal
          Junior Member
          • Oct 2011
          • 1

          #5
          repeatscout

          hello everyone
          i m try to work with repeatscout but every time when i m runninf filter-stage-1.prl, the filtered library generated is created empty( no data)..... any solution???

          Comment

          • sameoldmike
            Junior Member
            • Oct 2011
            • 1

            #6
            same thing happened to me -- the filtered output file is empty after running for a very long time. it was run on a repeat-rich genome.

            could it be that i don't have nseg and TRF properly installed? there is no output about those two programs that i can see...

            Comment

            • WhatsOEver
              Senior Member
              • Apr 2012
              • 215

              #7
              I don't know if its still an actual problem, but I had it too and was able to solve it on my system (ubuntu11, 64bit).
              The libs RepeatMasker is looking for are not the downloaded ones, but the blast dbs that should have been created by rmblast. rmblast itself is looking for a libpcre.so.0 file which it could not find on my system. The file is known to cause problems with some progs as symlinks are not made correctly during updates.
              Therefore I just created symlinks manually in my /lib/ and /lib32/ folder to the actual file (so just type "sudo ln -s /lib/libpcre.so.3 /lib/libpcre.so.0" and "sudo ln -s /lib32/libpcre.so.3 /lib32/libpcre.so.0") and afterwards everything worked fine for me

              @edge: you don't need to change anything in the .prl file, but you need to rename the trf404-linux64 (or else) executable to simply to trf.
              Last edited by WhatsOEver; 04-20-2012, 01:27 AM.

              Comment

              • gprakhar
                Member
                • Aug 2010
                • 78

                #8
                Hello,

                This is not a direct answer to your question, but there is a tool from the Repeat Masker group.
                Its called Repeat Modeler, this tool integrates Repeat Scout, RECON and TRF.
                It creates a de-novo repeat library and then annotates the sequences.
                Repeat Modeler

                --
                pg

                Comment

                • WhatsOEver
                  Senior Member
                  • Apr 2012
                  • 215

                  #9
                  Thats true and it works fine, but RepeatModeler also uses RepeatMasker and eventually the rmblast package, so you might have to face the same problems as described before.

                  Comment

                  • rahularjun86
                    Member
                    • Jan 2011
                    • 58

                    #10
                    Dear All,
                    I ran Repeatscout successfully, Commands I used:
                    Code:
                    1225 ##RepeatSout Run
                    1226 #step1
                    1227 build_lmer_table -l 14 -sequence Final_assembly.fasta -freq Final_assembly.freq
                    1228 #step2
                    1229 RepeatScout -sequence Final_assembly.fasta -output Final_assembly_repeats.fasta -freq Final_assembly.freq -l 14
                    1230 #step3
                    1231 cat Final_assemblyf_repeats.fasta | filter-stage-1.prl > Final_assembly_repeats_filtered_stg1.fasta
                    1232 #step4
                    1233 RepeatMasker -pa 20 -s -lib Final_assembly_repeats_filtered_stg1.fasta Final_assembly.fasta &
                    1234 #step5
                    1235 cat Final_assembly_repeats_filtered_stg1.fasta | filter-stage-2.prl --cat=Final_assembly.fasta.out --thresh=3 > Final_assembly_repeats_filtered_stg2_thresh3.fasta
                    1236 #step6
                    1237 RepeatMasker -pa 20 -s -lib Final_assembly_repeats_filtered_stg2_thresh3.fasta Final_assembly.fasta &
                    Rahul Sharma,
                    Ph.D
                    Frankfurt am Main, Germany

                    Comment

                    • tnguyen
                      Junior Member
                      • Sep 2011
                      • 6

                      #11
                      Hi Rahul,

                      How large was your genome? How much memory was needed for your run? I received this error message at the start of Step 2:

                      "Could not allocate space for sequence"
                      Last edited by tnguyen; 09-22-2012, 07:01 AM.

                      Comment

                      • tnguyen
                        Junior Member
                        • Sep 2011
                        • 6

                        #12
                        Sorry the full error message was:

                        "Could not allocate space for sequence"
                        Last edited by tnguyen; 09-22-2012, 07:02 AM.

                        Comment

                        • rahularjun86
                          Member
                          • Jan 2011
                          • 58

                          #13
                          Hi tnguyen,
                          sorry for replying late. Genome was of ~20Mb and other one was in Gb's. Actually I ran on the cluster and I did'nt check the memory it used.
                          Best wishes,
                          Rahul
                          Rahul Sharma,
                          Ph.D
                          Frankfurt am Main, Germany

                          Comment

                          • tnguyen
                            Junior Member
                            • Sep 2011
                            • 6

                            #14
                            Thank you Rahul,
                            My genome size is ~1.7Gb, any idea how to make RepeatScout to work for large genome?
                            TN

                            Comment

                            • mike.t
                              Member
                              • Mar 2010
                              • 36

                              #15
                              You probably don't need to use the whole genome for RepeatScout. Just use a few chromosomes or supercontigs. If repeats are distributed across all the chromosomes in the genome, scanning just a few of them with RepeatScout should be enough to find then and create consensus sequences that you can input to RepeatMasker. Then, mask the whole genome with RepeatMasker.

                              Comment

                              Latest Articles

                              Collapse

                              • SEQadmin2
                                Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                                by SEQadmin2


                                I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.


                                Here are nine questions we think about, in roughly the order they matter, before...
                                Yesterday, 07:11 AM
                              • SEQadmin2
                                From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                                by SEQadmin2


                                Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                                The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                                ...
                                06-02-2026, 10:05 AM
                              • SEQadmin2
                                Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                                by SEQadmin2


                                With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                                Introduction

                                Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                                05-22-2026, 06:42 AM

                              ad_right_rmr

                              Collapse

                              News

                              Collapse

                              Topics Statistics Last Post
                              Started by SEQadmin2, 06-17-2026, 06:09 AM
                              0 responses
                              16 views
                              0 reactions
                              Last Post SEQadmin2  
                              Started by SEQadmin2, 06-09-2026, 11:58 AM
                              0 responses
                              37 views
                              0 reactions
                              Last Post SEQadmin2  
                              Started by SEQadmin2, 06-05-2026, 10:09 AM
                              0 responses
                              43 views
                              0 reactions
                              Last Post SEQadmin2  
                              Started by SEQadmin2, 06-04-2026, 08:59 AM
                              0 responses
                              49 views
                              0 reactions
                              Last Post SEQadmin2  
                              Working...