Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • #16
    Originally posted by ZhigangLi View Post
    I have a similar problem. I want to combine contigs/scaffolds assembled with different dataset, e.g. sanger,454 and solexa. I wanted to combine them based on Mummer alignments. However, it's so hand for me. The organism is 40M and the largest scaffold is 2M. Can I use these software to finish my job?
    Cap3 and Mummer have issues in scaling, besides cap3 is for ESTs anyway.

    Comment


    • #17
      Hi.

      I also have come across this issue. I have illumina PE data from an archaea and I now have two datasets were one is done with 100bp read-length and the other one with 150bp.

      Doing an assembly on a merged dataset doesn't seem as a good approach, since you can't set multiple k-mer lengths in velvet.

      Comment


      • #18
        What do you guys want to do exactly? Do you want to make a consensus of the assemblies, or do you want to extend one of the assemblies by other assemblies?

        You should be aware that you can merge repeated regions if they are at the boundaries of the contigs, and thus concatenate distant regions because of the repeat.

        Anyway, what you can do is break the assemblies into smaller pieces and do a new denovo. I have a perl script which breaks all assemblies in user-defined k-mers and tries to do a new de novo assembly based on the users 'coverage'. Say you have four assemblies with different k-mers, and you only want to extend a contig by a k-mer if it is supported by e.g. three assemblies.

        If you would like to have it, please contact me at [email protected]

        Regards,
        Boetsie

        Comment


        • #19
          Hi Seth,

          I got few question regarding CAP3 might need your advice.
          I'm currently facing the following problem when trying to form a single set of non-redundant unigenes by CAP3
          I have total of 8 *.fasta right now (RNA-seq scaffold sequence that extracted from same tissue but treated the sample with different condition for sequencing).
          I would like to use CAP3 to assemble all the unigenes from different samples (but same tissue just treated the sample with different condition for sequencing) to form a single set of non-redundant unigenes.

          Can I know what is the proper command I should apply when running CAP3 in order to form a single set of non-redundant unigenes of my RNA-seq data?
          All my 8 sample scaffold in fasta format which is assembled by third party assembler program, Illumina pair-end read, 2X50bp, insert size 200.

          This is all the info I have right now.
          Many thanks for any advice.

          Comment


          • #20
            Originally posted by kbushley View Post
            Hello Mike,

            I'm also trying to do this. I think CAP3 might be the best tool but am still exploring this...there is a guy in our department who's written a program using CAP3 to merge velvet and abyss assemblies. It might be of some use. Let me know if you've found any other solutions. You also are in the great state of Oregon...where are you located?
            Hi kbushley,

            I am also trying to merge six different k-mer assemblies from Abyss. Were you able to merge yours? Can you share the program the guy from your department wrote? Please let me know. I would really appreciate your help.

            Thank you,

            -Milo

            Comment


            • #21
              To merge multiple assemblies with different kmer lengths, I recommend using Dedupe.

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Essential Discoveries and Tools in Epitranscriptomics
                by seqadmin




                The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
                04-22-2024, 07:01 AM
              • seqadmin
                Current Approaches to Protein Sequencing
                by seqadmin


                Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                04-04-2024, 04:25 PM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, Yesterday, 11:49 AM
              0 responses
              15 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-24-2024, 08:47 AM
              0 responses
              16 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-11-2024, 12:08 PM
              0 responses
              61 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-10-2024, 10:19 PM
              0 responses
              60 views
              0 likes
              Last Post seqadmin  
              Working...
              X