Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • how to cut the adapter without knowing the seq?

    i'm a green hand in RNA-seq data analysis. i have a raw data that need to cut the adapters. i'm using the cutadapter software to do the job. but i don't know the adapter seq. what can i do to solve the problem.
    thank you very much!

  • #2
    You could run fastqc and see if there are any overrepressented adapter sequences. It will identify which adapter or primer sequences are in there, if it finds too many of them.

    Comment


    • #3
      If you're using Illumina sequences then you can trim with the common Illumina prefix which is present at the start of pretty much all of their adapters. If you use our trim_galore trimmer (which is a wrapper around cutadapt) then it will do this for you automatically.

      Comment


      • #4
        I find the kmer plots of fastqc useful for this. If there are adapters at the end or something, you'll see the corresponding kmers coming up at the end. You can usually use this plus the contaminants list from fastqc to figure out what to trim.

        I'm more familiar with trimming for small RNA data though. I don't usually trim anything from RNA-seq.

        Comment


        • #5
          Sangenix

          Sangenix can perform adapter trimming , from fastqc report. so no need to provide adapter seq for trimming. and it is all automated.

          Try Sangenix: http://www.sangenix.com/

          Comment


          • #6
            Originally posted by vineet jha View Post
            Sangenix can perform adapter trimming , from fastqc report. so no need to provide adapter seq for trimming. and it is all automated.

            Try Sangenix: http://www.sangenix.com/
            How does it decide what to trim based on the FastQC report? I'm not sure I could reliably figure that out manually from just the information FastQC provides.

            Comment


            • #7
              Sangenix Trim Adapter from fastQC report

              Sangenix Reads the fastQC report , and create a list of over represented sequences . It uses these Sequence as adapter trimming from corresponding fastQ file . it trims the bases based upon user set parameter also. the attached figure show , before and after images of adapter trimming after QC and filtering step done by sangenix

              Attached Files
              Last edited by vineet jha; 12-06-2013, 03:32 AM.

              Comment


              • #8
                Originally posted by simonandrews View Post
                If you're using Illumina sequences then you can trim with the common Illumina prefix which is present at the start of pretty much all of their adapters. If you use our trim_galore trimmer (which is a wrapper around cutadapt) then it will do this for you automatically.
                thx very much! i have tried it

                Comment


                • #9
                  Originally posted by vineet jha View Post
                  Sangenix can perform adapter trimming , from fastqc report. so no need to provide adapter seq for trimming. and it is all automated.

                  Try Sangenix: http://www.sangenix.com/
                  Well, .. and were can the software be downloaded? I could not find any download link!?

                  Comment


                  • #10
                    Originally posted by vineet jha View Post
                    Sangenix Reads the fastQC report , and create a list of over represented sequences . It uses these Sequence as adapter trimming from corresponding fastQ file . it trims the bases based upon user set parameter also. the attached figure show , before and after images of adapter trimming after QC and filtering step done by sangenix
                    I'm not sure that's a great idea. The overrepresented sequences identification will really only spot if you have adapter dimers or other concatamers. If you have more normal read-through adapter contamination then these won't show up as the starting sequences will be more diverse. Also if you do get dimers they don't always show the full length of the adapter sequence to trim. If nothing else you'll normally find they're missing the initial 'A' at the start of the adapter (from the overhang ligation) so you'd need to add that, but we've also seen other truncated forms turn up. You're also potentially open to hitting other sequences which are overrepresented in your library. Doing this on a smallRNA library for example would cause a real mess.

                    Comment


                    • #11
                      Yea using overrepresented sequences as the adapter sounds like a horrible idea. If you tried it with an amplicon or mate pair library all of your data is going to get wiped away.

                      Comment


                      • #12
                        Well, the idea would be to only use fastqc to find which adapters were used, then trim those. If fastqc is finding overrepresented sequences other than known adapters, you ignore it and don’t trim it.

                        Comment


                        • #13
                          But even then it'll have to show up in the overrepresented sequences and is fully dependent on whether or not the contaminant list is up to date. If I recall correctly the default list doesn't have Nextera nor the new targeted RNA adapters.

                          Comment


                          • #14
                            Originally posted by kcchan View Post
                            But even then it'll have to show up in the overrepresented sequences and is fully dependent on whether or not the contaminant list is up to date. If I recall correctly the default list doesn't have Nextera nor the new targeted RNA adapters.
                            Further contributions gratefully received

                            You can also add in more sequences yourself to the contaminants.txt file.

                            Comment


                            • #15
                              Originally posted by Noah Pieta View Post
                              i'm a green hand in RNA-seq data analysis. i have a raw data that need to cut the adapters. i'm using the cutadapter software to do the job. but i don't know the adapter seq. what can i do to solve the problem.
                              thank you very much!
                              Why can't you ask from the person who gave you the data?
                              savetherhino.org

                              Comment

                              Latest Articles

                              Collapse

                              • seqadmin
                                Essential Discoveries and Tools in Epitranscriptomics
                                by seqadmin


                                The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
                                Today, 07:01 AM
                              • seqadmin
                                Current Approaches to Protein Sequencing
                                by seqadmin


                                Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                                04-04-2024, 04:25 PM

                              ad_right_rmr

                              Collapse

                              News

                              Collapse

                              Topics Statistics Last Post
                              Started by seqadmin, 04-11-2024, 12:08 PM
                              0 responses
                              37 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 04-10-2024, 10:19 PM
                              0 responses
                              41 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 04-10-2024, 09:21 AM
                              0 responses
                              35 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 04-04-2024, 09:00 AM
                              0 responses
                              54 views
                              0 likes
                              Last Post seqadmin  
                              Working...
                              X