Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • how to install cutadapt in ubuntu 12.04

    I have tried everything.......from installing python2.7 to python 3.3 and then installing cython as well.....but I get error everytime while installing cutadapt...I am working on root as well..

    .Error compiling Cython file:
    ------------------------------------------------------------
    ...
    # kate: syntax Python;
    # cython: profile=False
    from __future__ import print_function, division, absolute_import
    ^
    ------------------------------------------------------------

    cutadapt/_seqio.pyx:3:49: future feature absolute_import is not defined
    cutadapt/_seqio.c:1:2: error: #error Do not use this file, it is the result of a failed Cython compilation.
    error: Setup script exited with error: command 'gcc' failed with exit status 1
    root@ubuntu:~/.local/share#

    I don't understand why it shows that error compiling cython file...
    I am trying to install cutadapt because I want to remove adapter sequence from my illumina reads...
    friends please suggest something

  • #2
    I suggest BBDuk.

    Just download and unzip. It requires Java.

    For paired files, the command would be:

    bbduk.sh -Xmx1g in1=reads1.fq in2=reads2.fq out1=clean1.fq out2=clean2.fq ref=adapters.fa ktrim=r k=28 mink=12

    "ktrim=r" will locate adapters and trim them, removing the kmer and all bases to its right. "k=28 mink=12" will use 28-mers for the bulk of the read, and drop as low as 12-mers for adapters on the far right end of the read (which may have fewer than k bases in the read).

    You can optionally specify "hdist=X" (where X is a number 1-3) to allow that many substitution errors in the kmer.

    Comment


    • #3
      thanks for quick reply , I will try this one.. but I am using single end Illumina reads....and my adapters are default ones from trueseq. which means I have 13 bases AGATCGGAAGAGC.
      please tell me codes for single end..
      thanks in advance..

      Comment


      • #4
        For single-ended data:

        bbduk.sh in=reads.fq out=clean.fq ktrim=r k=13 literal=AGATCGGAAGAGC minlen=20

        Truseq adapters are actually much longer than 13 bases. I don't have them on my current computer but if you want increased specificity you should use a file with the whole adapter; I'll post it tomorrow, unless someone else does first. Though 13 bases should give you something like a 1/67 million false positive rate, so it's probably fine. "minlen=20" is optional; it tells it to discard reads that are shorter than 20bp after trimming.

        Comment


        • #5
          I know what u meant by trueseq. adapters and I know they r pretty long. Actually in my fastqc report I got error in Sequence duplication level. And I got warning in overrepresented sequences....
          I got this sequence in report
          CCTTCAGATCGGAAGAGCGGTTCAGCAGGAATGCCGAGACCGA

          but while analyzing my kmer content report (I got warning here as well) I didnt get any kmer which were matching to overrepresented sequence and they are...
          1. CTTCA
          2.GAAGA
          3.TGCCG
          4.CAGCA
          5.AGCAG
          6.TCAGC
          I got many but writing these ones because they r mentioned in the graph.
          My per base sequence quality and per base GC content report is good so I came on conclusion that there is nothing wrong with my sequence but might be I got adapter contamination. A part of trueseq. adapter those 13 bases I can see in my overrepresented sequence and i.e. why I have decided to trim only these 13 bases.

          Comment


          • #6
            @paa6: Option suggested by Brian is pretty straight forward and will not need any software installation.

            If you are having problems installing cutadapt then give trimmomatic a try: http://www.usadellab.org/cms/?page=trimmomatic. Trimmomatic includes the adapter sequence files you need to trim adapters.

            Comment


            • #7
              Originally posted by GenoMax View Post
              @paa6: Option suggested by Brian is pretty straight forward and will not need any software installation.

              If you are having problems installing cutadapt then give trimmomatic a try: http://www.usadellab.org/cms/?page=trimmomatic. Trimmomatic includes the adapter sequence files you need to trim adapters.
              Actually I have tried brian idea but the problem is nothing is mentioned in site how to install it neither he mentioned it so I am thinking to give try to trimmomatic...

              Comment


              • #8
                Originally posted by paa6 View Post
                Actually I have tried brian idea but the problem is nothing is mentioned in site how to install it neither he mentioned it so I am thinking to give try to trimmomatic...
                There is nothing to install. As long as you have Java available on your system you will be able to run Brian's code. He has included examples of how to use the programs in this post: http://seqanswers.com/forums/showthread.php?t=41057
                Last edited by GenoMax; 03-26-2014, 05:35 AM.

                Comment


                • #9
                  Originally posted by paa6 View Post
                  Actually I have tried brian idea but the problem is nothing is mentioned in site how to install it neither he mentioned it so I am thinking to give try to trimmomatic...
                  Just unzip and untar the file you download, then you can run the shellscripts.

                  Comment


                  • #10
                    [QUOTE=Brian Bushnell;136213]Just unzip and untar the file you download, then you can run the shellscripts.[/QUOTE
                    I did unzip and untar after that I gave code like u said but I am getting this error
                    singhparul@ubuntu:~/trim_ends$ bbduk.sh in=Illumina_raw_reads.fastq out=clean.fastq ktrim=r k=13 literal=AGATCGGAAGAGC minlen=20
                    bbduk.sh: command not found

                    Comment


                    • #11
                      Originally posted by Brian Bushnell View Post
                      Just unzip and untar the file you download, then you can run the shellscripts.
                      I HAVE TRIED THIS WAY TOO..... but still getting error
                      singhparul@ubuntu:~/Desktop$ chmod +x bbduk.sh
                      singhparul@ubuntu:~/Desktop$ sh ./bbduk.sh
                      ./bbduk.sh: 84: ./bbduk.sh: Bad substitution
                      ./bbduk.sh: 115: ./bbduk.sh: [[: not found
                      ./bbduk.sh: 123: ./bbduk.sh: [[: not found
                      ./bbduk.sh: 132: ./bbduk.sh: Illegal number: unlimited

                      Comment


                      • #12
                        Hmm... you can bypass the shellscript like this:

                        java -ea -Xmx1g -cp /path/to/current/ jgi.BBDukF in=Illumina_raw_reads.fastq out=clean.fastq ktrim=r k=13 literal=AGATCGGAAGAGC minlen=20

                        Comment


                        • #13
                          Originally posted by Brian Bushnell View Post
                          Hmm... you can bypass the shellscript like this:

                          java -ea -Xmx1g -cp /path/to/current/ jgi.BBDukF in=Illumina_raw_reads.fastq out=clean.fastq ktrim=r k=13 literal=AGATCGGAAGAGC minlen=20
                          am sorry but again I am getting error
                          singhparul@ubuntu:~/trim_ends$ java -ea -Xmx1g -cp /path/to/current/ jgi.BBDukF in=Illumina_raw_reads.fastq out=clean.fastq ktrim=r k=13 literal=AGATCGGAAGAGC minlen=20
                          Error: Could not find or load main class jgi.BBDukF
                          please tell me step by step...I am newbie in linux

                          Comment


                          • #14
                            how to install short read alignment and working principle?

                            hi,how to install short read alignment and working principle?

                            Comment


                            • #15
                              Originally posted by paa6 View Post
                              Originally posted by Brian Bushnell View Post
                              Just unzip and untar the file you download, then you can run the shellscripts.[/QUOTE
                              I did unzip and untar after that I gave code like u said but I am getting this error
                              singhparul@ubuntu:~/trim_ends$ bbduk.sh in=Illumina_raw_reads.fastq out=clean.fastq ktrim=r k=13 literal=AGATCGGAAGAGC minlen=20
                              bbduk.sh: command not found
                              Parul: Can you try the shell script like below? I am assuming that execute bit has been set on the shell script per one of your previous commands.

                              Note: You may have to provide full file system paths if the files are not accessible in the current directory.

                              Code:
                              $ ./bbduk.sh in=Illumina_raw_reads.fastq out=clean.fastq ktrim=r k=13 literal=AGATCGGAAGAGC minlen=20
                              or if you are not in the directory where the bbduk.sh script is then do the following:
                              Code:
                              $ /full_path_to/bbduk.sh in=Illumina_raw_reads.fastq out=clean.fastq ktrim=r k=13 literal=AGATCGGAAGAGC minlen=20

                              Comment

                              Latest Articles

                              Collapse

                              • seqadmin
                                Essential Discoveries and Tools in Epitranscriptomics
                                by seqadmin




                                The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
                                Yesterday, 07:01 AM
                              • seqadmin
                                Current Approaches to Protein Sequencing
                                by seqadmin


                                Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                                04-04-2024, 04:25 PM

                              ad_right_rmr

                              Collapse

                              News

                              Collapse

                              Topics Statistics Last Post
                              Started by seqadmin, 04-11-2024, 12:08 PM
                              0 responses
                              57 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 04-10-2024, 10:19 PM
                              0 responses
                              53 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 04-10-2024, 09:21 AM
                              0 responses
                              45 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 04-04-2024, 09:00 AM
                              0 responses
                              55 views
                              0 likes
                              Last Post seqadmin  
                              Working...
                              X