Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • how ro see adapter contamination in Illumina reads

    I have illumina read file..which is bacterial DNA sequence...I have used geneious software to assembly it, while assembly I have found that there was vector contamination and it was removed by software since I have given trimming option and I got 1,610 contigs.

    but now I am performing the same assembly by using velvet. I have my fastqc report and according to that report sequence duplication level is bad, overrepresented sequences and kmer content showing warning. (I have attached these three files) So, I reached to conclusion that I have adapter contamination on the basis of the sequence I have got in overrepresented sequences. I have seen that GATCGGAAGAGC is adapter contamination because I have seen it in adapter files provided to custmoer given by illumina technology.

    Problem is my PI asked me to find that adaptor contamination sequence in my reads, which I was not able to So, he asked me que. that why can't u find it?? I am new to de novo assembly, I dont know what am I supposed to answer and he gave me 1 hrs. to find it. Please help!!!

  • #2
    Originally posted by paa6 View Post
    I have illumina read file..which is bacterial DNA sequence...I have used geneious software to assembly it, while assembly I have found that there was vector contamination and it was removed by software since I have given trimming option and I got 1,610 contigs.

    but now I am performing the same assembly by using velvet. I have my fastqc report and according to that report sequence duplication level is bad, overrepresented sequences and kmer content showing warning. (I have attached these three files) So, I reached to conclusion that I have adapter contamination on the basis of the sequence I have got in overrepresented sequences. I have seen that GATCGGAAGAGC is adapter contamination because I have seen it in adapter files provided to custmoer given by illumina technology.

    Problem is my PI asked me to find that adaptor contamination sequence in my reads, which I was not able to So, he asked me que. that why can't u find it?? I am new to de novo assembly, I dont know what am I supposed to answer and he gave me 1 hrs. to find it. Please help!!!
    try
    Code:
    $ grep -c 'GATCGGAAGAGC' reads.fastq
    $ grep -c reads.fastq | awk '{print $1/4}'
    then you will get an estimation of the contaminant ratio.

    for adapter trimming, I suggest using skewer. For your case, you don't need to specify the adapter sequence since it's the same as the default TruSeq3 adapter sequence.

    Good luck!

    Comment


    • #3
      Originally posted by relipmoc View Post
      try
      Code:
      $ grep -c 'GATCGGAAGAGC' reads.fastq
      $ grep -c reads.fastq | awk '{print $1/4}'
      then you will get an estimation of the contaminant ratio.

      for adapter trimming, I suggest using skewer. For your case, you don't need to specify the adapter sequence since it's the same as the default TruSeq3 adapter sequence.

      Good luck!
      THanks for the quick reply!! I have typed $ grep -c 'GATCGGAAGAGC' reads.fastq
      and I got 28875..what is this mean??
      also I am doing SE assembly while skewer is for PE...
      Last edited by paa6; 03-09-2014, 10:42 PM.

      Comment


      • #4
        You can type grep --help for a brief description of OPTIONS for grep.

        -c, --count only print a count of matching lines per FILE
        The result you got was 28875, suggesting that 28875 reads contained the substring of 'GATCGGAAGAGC' - which is most likely adapter contamination.

        Comment


        • #5
          Originally posted by yueluo View Post
          you can type grep --help for a brief description of options for grep.



          The result you got was 28875, suggesting that 28875 reads contained the substring of 'gatcggaagagc' - which is most likely adapter contamination.
          ohh ok thanks!!!

          Comment

          Latest Articles

          Collapse

          • seqadmin
            Advancing Precision Medicine for Rare Diseases in Children
            by seqadmin




            Many organizations study rare diseases, but few have a mission as impactful as Rady Children’s Institute for Genomic Medicine (RCIGM). “We are all about changing outcomes for children,” explained Dr. Stephen Kingsmore, President and CEO of the group. The institute’s initial goal was to provide rapid diagnoses for critically ill children and shorten their diagnostic odyssey, a term used to describe the long and arduous process it takes patients to obtain an accurate...
            12-16-2024, 07:57 AM
          • seqadmin
            Recent Advances in Sequencing Technologies
            by seqadmin



            Innovations in next-generation sequencing technologies and techniques are driving more precise and comprehensive exploration of complex biological systems. Current advancements include improved accessibility for long-read sequencing and significant progress in single-cell and 3D genomics. This article explores some of the most impactful developments in the field over the past year.

            Long-Read Sequencing
            Long-read sequencing has seen remarkable advancements,...
            12-02-2024, 01:49 PM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by seqadmin, 12-17-2024, 10:28 AM
          0 responses
          33 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 12-13-2024, 08:24 AM
          0 responses
          49 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 12-12-2024, 07:41 AM
          0 responses
          34 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 12-11-2024, 07:45 AM
          0 responses
          46 views
          0 likes
          Last Post seqadmin  
          Working...
          X