Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Picard

    Can anyone please tell me how to actually run Picard? I have downloaded and unzipped all the files, but do not know how to run them. Is there any software that you recommend for running a java based program out of a Windows 7 system? Are there any tutorials that you recommend?

    Please let me know! I am keen to use this program, but don't know where to begin.

  • #2
    Picard is a set of command line tools, so you would need a Linux or Mac system to use a Unix command line. You could try running it on Cygwin.

    Comment


    • #3
      Originally posted by pbluescript View Post
      Picard is a set of command line tools, so you would need a Linux or Mac system to use a Unix command line. You could try running it on Cygwin.
      Picard tools are java programs; so long as you have the appropriate Java VM installed you can run them on the Windows command line.

      @gwilymh, the standard procedure for invoking tools packaged as java jar files is:

      Code:
      C:\> java [options for java] -jar <full path to jar file> [options for jar]
      
      For example:
      
      C:\Documents and Settings\myHome> java -Xmx2g -jar C:\Program Files\picard-tools-1.61\MarkDuplicates.jar I=myInput.bam O=myDupMarked.bam
      See the Picard tools website for detailed documentation of general options and options specific to each of the command line tools.

      Comment


      • #4
        Thank you kmcarr and pblue script. Your help is much appreciated.

        Comment


        • #5
          Hi,

          I am using CalculateHsMetrics in picard-tools to get an idea about the target region sequenced. However, could anybody tell me what is the normalized coverage in the PER_TARGET_COVERAGE file?

          Thanks

          Comment


          • #6
            It looks to be mean_coverage/MEAN_TARGET_COVERAGE (from the metrics file)
            The description of MEAN_TARGET_COVERAGE is "The mean coverage of targets that recieved at least coverage depth = 2 at one base." - http://http://picard.sourceforge.net...html#HsMetrics

            Comment


            • #7
              How exactly are you supposed to interpret normalized coverage? For instance, I have a bunch of samples that got sequenced. For 1 specific target the length of the target is approximately 3000bp. Most of the samples have a low number of reads that map to the target, less than 500 reads. 2 samples have a more than 2000 reads that map to the same target. When I'm comparing the normalized coverage value, some of the samples with very low overall mapped read counts but have a high normalized coverage value for this target. The samples with a large overall mapped read counts have a lower normalized_coverage value for the same target.

              That makes comparing the normalized_coverage value across samples difficult and probably not appropriate. Should I only look at the normalized_coverage values for within the sample comparisons? I'm reverting back to comparing mean coverage per target instead.

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Essential Discoveries and Tools in Epitranscriptomics
                by seqadmin


                The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
                Yesterday, 07:01 AM
              • seqadmin
                Current Approaches to Protein Sequencing
                by seqadmin


                Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                04-04-2024, 04:25 PM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, 04-11-2024, 12:08 PM
              0 responses
              55 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-10-2024, 10:19 PM
              0 responses
              52 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-10-2024, 09:21 AM
              0 responses
              45 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-04-2024, 09:00 AM
              0 responses
              55 views
              0 likes
              Last Post seqadmin  
              Working...
              X