Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • get mate pair info from xml file

    Hello i'm trying to assembly a sequencing done 100% with sanger reads, using celera assembler

    i have the fasta, fasta.qual, anc, clip and xml files from NCBI Trace

    the xml files have the mate pair info, but when i run the assembler, skip this info

    from the final .qc file

    [Mates]
    ReadsWithNoMate=691348(100.00%)
    ReadsWithGoodMate=0(0.00%)
    ReadsWithBadShortMate=0(0.00%)
    ReadsWithBadLongMate=0(0.00%)
    ReadsWithSameOrientMate=0(0.00%)
    ReadsWithOuttieMate=0(0.00%)
    ReadsWithBothChaffMate=0(0.00%)
    ReadsWithChaffMate=0(0.00%)
    ReadsWithBothDegenMate=0(0.00%)
    ReadsWithDegenMate=0(0.00%)
    ReadsWithBothSurrMate=0(0.00%)
    ReadsWithSurrogateMate=0(0.00%)
    ReadsWithDiffScafMate=0(0.00%)
    ReadsWithUnassignedMate=0(0.00%)
    TotalScaffoldLinks=0
    MeanScaffoldLinkWeight=0.00



    the xml file looks like


    <trace>
    <CENTER_NAME>JGI</CENTER_NAME>
    <CLIP_LEFT>0</CLIP_LEFT>
    <CLIP_RIGHT>690</CLIP_RIGHT>
    <SOURCE_TYPE>GENOMIC</SOURCE_TYPE>
    <SPECIES_CODE>XX</SPECIES_CODE>
    <SUBMISSION_TYPE>NEW</SUBMISSION_TYPE>
    <TEMPLATE_ID>PACH2306</TEMPLATE_ID>
    <TI>451441523</TI>
    <TRACE_DIRECTION>FORWARD</TRACE_DIRECTION>
    <TRACE_FORMAT>SCF</TRACE_FORMAT>
    <TRACE_NAME>PACH2306.x1</TRACE_NAME>
    <TRACE_TYPE_CODE>WGS</TRACE_TYPE_CODE>
    </trace>
    <trace>
    <CENTER_NAME>JGI</CENTER_NAME>
    <CLIP_LEFT>0</CLIP_LEFT>
    <CLIP_RIGHT>933</CLIP_RIGHT>
    <SOURCE_TYPE>GENOMIC</SOURCE_TYPE>
    <SPECIES_CODE>XX</SPECIES_CODE>
    <SUBMISSION_TYPE>NEW</SUBMISSION_TYPE>
    <TEMPLATE_ID>PACH2306</TEMPLATE_ID>
    <TI>451443045</TI>
    <TRACE_DIRECTION>REVERSE</TRACE_DIRECTION>
    <TRACE_FORMAT>SCF</TRACE_FORMAT>
    <TRACE_NAME>PACH2306.y1</TRACE_NAME>
    <TRACE_TYPE_CODE>WGS</TRACE_TYPE_CODE>
    </trace>
    <trace>
    <CENTER_NAME>JGI</CENTER_NAME>
    <CLIP_LEFT>0</CLIP_LEFT>
    <CLIP_RIGHT>888</CLIP_RIGHT>
    <SOURCE_TYPE>GENOMIC</SOURCE_TYPE>
    <SPECIES_CODE>XX</SPECIES_CODE>
    <SUBMISSION_TYPE>NEW</SUBMISSION_TYPE>
    <TEMPLATE_ID>ACBG121468</TEMPLATE_ID>
    <TI>452858124</TI>
    <TRACE_DIRECTION>FORWARD</TRACE_DIRECTION>
    <TRACE_FORMAT>SCF</TRACE_FORMAT>
    <TRACE_NAME>ACBG121468.b2</TRACE_NAME>
    <TRACE_TYPE_CODE>WGS</TRACE_TYPE_CODE>
    </trace>
    <trace>
    <CENTER_NAME>JGI</CENTER_NAME>
    <CLIP_LEFT>0</CLIP_LEFT>
    <CLIP_RIGHT>872</CLIP_RIGHT>
    <SOURCE_TYPE>GENOMIC</SOURCE_TYPE>
    <SPECIES_CODE>XX</SPECIES_CODE>
    <SUBMISSION_TYPE>NEW</SUBMISSION_TYPE>
    <TEMPLATE_ID>ACBG121468</TEMPLATE_ID>
    <TI>452859852</TI>
    <TRACE_DIRECTION>REVERSE</TRACE_DIRECTION>
    <TRACE_FORMAT>SCF</TRACE_FORMAT>
    <TRACE_NAME>ACBG121468.g2</TRACE_NAME>
    <TRACE_TYPE_CODE>WGS</TRACE_TYPE_CODE>
    </trace>



    i need to extract this info in some way, and pass to celera assembler??

    or i need a special flag so celera assembler use this??

    thx in advance

    Cristian

Latest Articles

Collapse

  • seqadmin
    Recent Innovations in Spatial Biology
    by seqadmin


    Spatial biology is an exciting field that encompasses a wide range of techniques and technologies aimed at mapping the organization and interactions of various biomolecules in their native environments. As this area of research progresses, new tools and methodologies are being introduced, accompanied by efforts to establish benchmarking standards and drive technological innovation.

    3D Genomics
    While spatial biology often involves studying proteins and RNAs in their...
    Yesterday, 07:30 PM
  • seqadmin
    Advancing Precision Medicine for Rare Diseases in Children
    by seqadmin




    Many organizations study rare diseases, but few have a mission as impactful as Rady Children’s Institute for Genomic Medicine (RCIGM). “We are all about changing outcomes for children,” explained Dr. Stephen Kingsmore, President and CEO of the group. The institute’s initial goal was to provide rapid diagnoses for critically ill children and shorten their diagnostic odyssey, a term used to describe the long and arduous process it takes patients to obtain an accurate...
    12-16-2024, 07:57 AM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, 12-30-2024, 01:35 PM
0 responses
23 views
0 likes
Last Post seqadmin  
Started by seqadmin, 12-17-2024, 10:28 AM
0 responses
41 views
0 likes
Last Post seqadmin  
Started by seqadmin, 12-13-2024, 08:24 AM
0 responses
55 views
0 likes
Last Post seqadmin  
Started by seqadmin, 12-12-2024, 07:41 AM
0 responses
41 views
0 likes
Last Post seqadmin  
Working...
X