Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • nearly identical sequence in Celera contigs

    Hi all,
    I have used Celera assembler to create a contig set for the liver fluke genome.
    Then I did a self-align on the contigs and found some very long,nearly identical sequence. Part of Blast m8 sorted result is below.
    Our group thought these sequences maybe artificial duplicates but no report about this case.
    I wonder how these sequences come about and how to deal with them?

    Best wishes!

    ctg120293203258 ctg120293209927 99.98 11912 2 0 3255 15166 56050 44139 0.0 2.360e+04
    ctg120293209927 ctg120293203258 99.98 11912 2 0 44139 56050 15166 3255 0.0 2.360e+04
    ctg120293194963 ctg120293210163 99.99 10009 1 0 57736 67744 16614 6606 0.0 1.983e+04
    ctg120293210163 ctg120293194963 99.99 10009 1 0 6606 16614 67744 57736 0.0 1.983e+04
    ctg120293192725 ctg120293199964 99.99 9405 1 0 1 9405 17264 26668 0.0 1.864e+04
    ctg120293199964 ctg120293192725 99.99 9405 1 0 17264 26668 1 9405 0.0 1.864e+04
    ctg120293192725 ctg120293192737 99.99 9404 1 0 2 9405 9404 1 0.0 1.863e+04
    ctg120293192737 ctg120293199964 100.00 9404 0 0 1 9404 26668 17265 0.0 1.864e+04
    ctg120293192737 ctg120293192725 99.99 9404 1 0 1 9404 9405 2 0.0 1.863e+04
    ctg120293199964 ctg120293192737 100.00 9404 0 0 17265 26668 9404 1 0.0 1.864e+04
    ctg120293192725 ctg120293199964 99.99 9403 1 0 10337 19739 26666 17264 0.0 1.863e+04
    ctg120293199964 ctg120293192725 99.99 9403 1 0 17264 26666 19739 10337 0.0 1.863e+04
    ctg120293192725 ctg120293192737 99.99 9402 1 0 10337 19738 3 9404 0.0 1.863e+04
    ctg120293192737 ctg120293192725 99.99 9402 1 0 3 9404 10337 19738 0.0 1.863e+04

Latest Articles

Collapse

  • seqadmin
    Essential Discoveries and Tools in Epitranscriptomics
    by seqadmin


    The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
    Yesterday, 07:01 AM
  • seqadmin
    Current Approaches to Protein Sequencing
    by seqadmin


    Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
    04-04-2024, 04:25 PM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, 04-11-2024, 12:08 PM
0 responses
39 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-10-2024, 10:19 PM
0 responses
41 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-10-2024, 09:21 AM
0 responses
35 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-04-2024, 09:00 AM
0 responses
55 views
0 likes
Last Post seqadmin  
Working...
X