Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • bfast localalign issues

    Hi, anyone has an idea what causes this error?
    Please, help!

    Thanks in advance.



    bfast localalign -f mature_converted.fa -m bfast.matches.s_1_1_sequence_clean.bmf > bfast.aligned.s_1_1_sequence_clean.baf
    ************************************************************
    Checking input parameters supplied by the user ...
    Validating fastaFileName mature_converted.fa.
    Validating matchFileNamebfast.matches.s_1_1_sequence_clean.bmf.
    **** Input arguments look good! *****
    ************************************************************
    ************************************************************
    Printing Program Parameters:
    programMode: [ExecuteProgram]
    fastaFileName: mature_converted.fa
    matchFileName: bfast.matches.s_1_1_sequence_clean.bmf
    matchFileNameOne: [Not Using]
    matchFileNameTwo: [Not Using]
    scoringMatrixFileName: [Not Using]
    ungapped: [Not Using]
    unconstrained: [Not Using]
    space: [NT Space]
    startReadNum: 1
    endReadNum: 2147483647
    offsetLength: 20
    maxNumMatches: 384
    avgMismatchQuality: 10
    numThreads: 1
    queueLength: 10000
    timing: [Not Using]
    ************************************************************
    ************************************************************
    Reading in reference genome from mature_converted.fa.nt.brg.
    In total read 17341 contigs for a total of 377003 bases
    ************************************************************
    ************************************************************
    Reading match file from bfast.matches.s_1_1_sequence_clean.bmf.
    ************************************************************
    Performing alignment...
    Reads processed: 0bfast: RGMatches.c:91: RGMatchesRead: Assertion `m->readNameLength < 4028' failed.
    Aborted

  • #2
    Looks like you may have some file corruption as the read name check should have been caught at the match step if it was real. Re-run the match process, and if that does not work, post to [email protected]

    Comment


    • #3
      Thanks, Nils!
      Matches were re-run, and everything went through smoothly for running localalign.

      -Yanming

      Comment


      • #4
        Originally posted by nilshomer View Post
        Looks like you may have some file corruption as the read name check should have been caught at the match step if it was real. Re-run the match process, and if that does not work, post to [email protected]
        I met the same issue. After rerun it, It still threw this exception. I tested the data with Bowtie, it worked. So, what is the read name check? and why it failed in this step? Thanks.

        Comment


        • #5
          Originally posted by xinwu View Post
          I met the same issue. After rerun it, It still threw this exception. I tested the data with Bowtie, it worked. So, what is the read name check? and why it failed in this step? Thanks.
          The read name cannot be longer than 4028 characters.

          Comment


          • #6
            Originally posted by nilshomer View Post
            The read name cannot be longer than 4028 characters.
            Thanks for the quick reply, Nils. But I am sure the name should not exceed
            4028 which is really a large number for a name. There must be something wrong somewhere.

            Comment


            • #7
              There definitely is. Could you try aligning subsets of your reads until you can identify the problem read? That is how I would begin to debug this problem.

              Comment


              • #8
                Originally posted by nilshomer View Post
                There definitely is. Could you try aligning subsets of your reads until you can identify the problem read? That is how I would begin to debug this problem.
                I've checked all read names, the max length of them is 37. I've tried a small set with 1000 reads, it worked fine. The original reads file is about 7G with ~130M reads (100bp). Does its size matter?

                Comment


                • #9
                  Originally posted by xinwu View Post
                  I've checked all read names, the max length of them is 37. I've tried a small set with 1000 reads, it worked fine. The original reads file is about 7G with ~130M reads (100bp). Does its size matter?
                  It shouldn't matter no. Try breaking it up into chunks of 1M reads, then processing them separately.

                  Comment

                  Latest Articles

                  Collapse

                  • seqadmin
                    Essential Discoveries and Tools in Epitranscriptomics
                    by seqadmin




                    The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
                    04-22-2024, 07:01 AM
                  • seqadmin
                    Current Approaches to Protein Sequencing
                    by seqadmin


                    Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                    04-04-2024, 04:25 PM

                  ad_right_rmr

                  Collapse

                  News

                  Collapse

                  Topics Statistics Last Post
                  Started by seqadmin, Yesterday, 08:47 AM
                  0 responses
                  13 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 04-11-2024, 12:08 PM
                  0 responses
                  60 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 04-10-2024, 10:19 PM
                  0 responses
                  60 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 04-10-2024, 09:21 AM
                  0 responses
                  54 views
                  0 likes
                  Last Post seqadmin  
                  Working...
                  X