Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Computing time evaluate

    Hi,all

    I am doing RNA-seq alignment with STAR, my workstation is dual core(E5-2620v3), 12 thread in total, 64G RAM. I am running 9 mouse RNA-seq sample data(1GB/sample), parallelly, two days has gone, still no result. No error, the log just say start mapping. Can any one with similar experience tell me how can I make it faster or is there anything wrong.

    Below is my command:
    /STAR-STAR_2.4.2a/bin/Linux_x86_64/STAR --outFilterIntronMotifs RemoveNoncanonicalUnannotated --runThreadN 2 --outTmpDir /pathdir/ --outSAMtype BAM SortedByCoordinate --genomeDir /pathfordatabaseformus/ --readFilesIn XXX.clean.fq.gz --readFilesCommand zcat

    Thanks for all the guys who offer me help!

  • #2
    Originally posted by cindylanzao View Post
    Hi,all

    I am doing RNA-seq alignment with STAR, my workstation is dual core(E5-2620v3), 12 thread in total, 64G RAM. I am running 9 mouse RNA-seq sample data(1GB/sample), parallelly, two days has gone, still no result. No error, the log just say start mapping. Can any one with similar experience tell me how can I make it faster or is there anything wrong.

    Below is my command:
    /STAR-STAR_2.4.2a/bin/Linux_x86_64/STAR --outFilterIntronMotifs RemoveNoncanonicalUnannotated --runThreadN 2 --outTmpDir /pathdir/ --outSAMtype BAM SortedByCoordinate --genomeDir /pathfordatabaseformus/ --readFilesIn XXX.clean.fq.gz --readFilesCommand zcat

    Thanks for all the guys who offer me help!
    Hi @cindylanzao

    you can check check Log.progress.out for the current mapping speed and progress.
    Are you trying to map all 9 samples at the same time with 2 threads for each job? If so, you need to use shared memory option, e.g. --genomeLoad LoadAndKeep. Without shared memory, the genomes for each sample have to be loaded in RAM, which requires ~9*25 GB.
    A simpler option would be to map one sample at a time with 12 threads.

    Cheers
    Alex

    Comment


    • #3
      Thanks for your reply

      Originally posted by alexdobin View Post
      Hi @cindylanzao

      you can check check Log.progress.out for the current mapping speed and progress.
      Are you trying to map all 9 samples at the same time with 2 threads for each job? If so, you need to use shared memory option, e.g. --genomeLoad LoadAndKeep. Without shared memory, the genomes for each sample have to be loaded in RAM, which requires ~9*25 GB.
      A simpler option would be to map one sample at a time with 12 threads.

      Cheers
      Alex
      Thanks for your kind reply Alex, I have checked the Log.progress.out, it contains nothing but the title after a whole night's running with only one sample with 12 thread. Do you have any other suggestion? Thanks!

      Comment


      • #4
        Originally posted by cindylanzao View Post
        Thanks for your kind reply Alex, I have checked the Log.progress.out, it contains nothing but the title after a whole night's running with only one sample with 12 thread. Do you have any other suggestion? Thanks!
        Hi @cindylanzao,

        please post or email me the Log.out file from the failed single sample 12-thread run.

        Cheers
        Alex

        Comment


        • #5
          Originally posted by alexdobin View Post
          Hi @cindylanzao,

          please post or email me the Log.out file from the failed single sample 12-thread run.

          Cheers
          Alex
          Thanks for your help, but the project is in a hurry, so I have decided to change to tophat as the aligner, and have deleted all the related output file from STAR.

          Comment

          Latest Articles

          Collapse

          • seqadmin
            Current Approaches to Protein Sequencing
            by seqadmin


            Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
            04-04-2024, 04:25 PM
          • seqadmin
            Strategies for Sequencing Challenging Samples
            by seqadmin


            Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
            03-22-2024, 06:39 AM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by seqadmin, 04-11-2024, 12:08 PM
          0 responses
          18 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-10-2024, 10:19 PM
          0 responses
          22 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-10-2024, 09:21 AM
          0 responses
          17 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-04-2024, 09:00 AM
          0 responses
          49 views
          0 likes
          Last Post seqadmin  
          Working...
          X