Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Multi-Genome Alignment for QC...

    In a previous post on our HiSeq I mentioned that we were running a multi-genome alignment (MGA) as a QC tool. Comments made me think it would be an interesting topic to post in the Bioinformatics section, not one I usually post in!

    The work for this was done by Matt Edlridge, our head of bioinformatics. Big thanks to him for doing it!
    1. The MGA takes a sample of sequence reads from a lane and aligns the first 36bp using Bowtie. The sampling allows the MGA to run fast and this is part of our normal data pipeline, we get to see the report in our LIMs alongside the Gerald report (which I think we will soon be ditching entirely).
    2. Of course reads can align to multiple genomes (conserved regions). If this happens we assign the read to the genome with most reads. This approach should show up cases of genome contamination and maximise the difference between first and second genomes in the list.
    3. We also use Exonerate to identify sequences containing Illumina adapters.


    Currently we run against: Human, Mouse, Rat, Xenopus, Arabidopsis, C.elegans, Yeast, Bacteria and Viruses (the last two being amalgamations of >1500 genomes each). There are other genomes as well which are specific to the work for projects in our lab, I guess at some level it would be possible to run against all genomes?

    The output is a descending list of genomes with the highest number of aligned reads expressed as a percentage. Hopefully the genome the user was expecting! We did have a case about three years ago where one user accidentally sequenced a genome to 80x coverage of an organism that was also growing in his lab. It took a little time to work out what was wrong with his experiment and I believe the data was handed over to that community. Serendipity at its best!
    There are often un-aligned reads and the assumption initially was that these were junk low quality reads. Running this kind of aligner might allow us to see if that assumption is true but we have not looked at this at this time.

    The reason I wanted this MGA in our pipeline was to see what amount of PhiX was in lanes where we had not actually put it. The assumption was that any sloppy practices in a lab where all flowcells are set up would be obvious in this instance. It was immediately clear that the level of PhiX ‘contamination’ from lane to lane was very low. We identified two or three flowcells where there was a potential issue but this was out of over many hundred. We were also able to get run reports and data from anther large centre nearby and they had similar results. All in all I was very happy with the low contamination from lane to lane and am very happy that the protocols are reasonably robust.

    PhiX must be being breathed in as aerosols in labs the word over, might we get some Cronenberg style PhiX-Human hybrid. Let me know if you see one...

    Let me know what you think.

Latest Articles

Collapse

  • seqadmin
    Strategies for Sequencing Challenging Samples
    by seqadmin


    Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
    03-22-2024, 06:39 AM
  • seqadmin
    Techniques and Challenges in Conservation Genomics
    by seqadmin



    The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

    Avian Conservation
    Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
    03-08-2024, 10:41 AM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, Yesterday, 06:37 PM
0 responses
10 views
0 likes
Last Post seqadmin  
Started by seqadmin, Yesterday, 06:07 PM
0 responses
9 views
0 likes
Last Post seqadmin  
Started by seqadmin, 03-22-2024, 10:03 AM
0 responses
51 views
0 likes
Last Post seqadmin  
Started by seqadmin, 03-21-2024, 07:32 AM
0 responses
67 views
0 likes
Last Post seqadmin  
Working...
X