Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • python error in cuffdiff script

    I used this in command line:

    cuffdiff -o diff_out4 -b ../genome/ce10.fa -p 2 -L larval,early -u merged_asm/merged.gtf ../tophat/em/SRR493359_60_61_thout/accepted_hits.bam ../tophat/em/SRR493363_64_65_thout/accepted_hits.bam

    here SRR493359_60_61 is one group and SRR493363_64_65 is the other group. This worked for me.

    Now I want to run this from a python script. So, I gave a call in this way.

    do.call([cfg.tool_cmd("cuffdiff"), "-p", str(cfg.project["analysis"]["threads"]), "-b", str(cfg.project["genome"]["fasta"]), "-u", cfg.project["samples"][0]["files"]["merging_gtf"], "-L", str(cfg.project["phenotype"][0]), str(cfg.project["phenotype"][1]), "-o", output_folder] + [cfg.project["samples"][0]["files"]["bam"] cfg.project["samples"][1]["files"]["bam"]], cfg.project["analysis"]["log_file"])

    here
    str(cfg.project["phenotype"][0]) is larval
    str(cfg.project["phenotype"][1]) is early

    I get an error: invalid syntax. Can anyone please help in this. Thanks in Advance

  • #2
    It looks to me like you're perhaps missing a comma after cfg.project["samples"][0]["files"]["bam"]. As a general debugging strategy, when you have massive blocks of code like that, it's often easier to break it up into component parts, so those could be debugged individually. So something like:

    Code:
    base_options = [cfg.tool_cmd("cuffdiff"),
                            "-p", str(cfg.project["analysis"]),
                            "etc etc"]
    
    files = [
        cfg.project["samples"][0]["files"]["bam"],
        cfg.project["samples"][1]["files"]["bam"],
        cfg.project["analysis"]["log_file"]
    ]
    
    do.call(base_options + files)

    Comment


    • #3
      Originally posted by rflrob View Post
      It looks to me like you're perhaps missing a comma after cfg.project["samples"][0]["files"]["bam"]. As a general debugging strategy, when you have massive blocks of code like that, it's often easier to break it up into component parts, so those could be debugged individually. So something like:

      Code:
      base_options = [cfg.tool_cmd("cuffdiff"),
                              "-p", str(cfg.project["analysis"]),
                              "etc etc"]
      
      files = [
          cfg.project["samples"][0]["files"]["bam"],
          cfg.project["samples"][1]["files"]["bam"],
          cfg.project["analysis"]["log_file"]
      ]
      
      do.call(base_options + files)
      Thanks for the reply. You may be right in one case. But I want to run this for groups.

      cfg.project["samples"][0]["files"]["bam"] is group1
      cfg.project["samples"][1]["files"]["bam"] is group2

      so it should be group1 vs group2. So, the bam files should be separated with space. And If I run this with space its giving an error. Please tell me how to run this with space between the bam files.

      Comment


      • #4
        I'm not familiar with this "do" module you seem to be using, but I'm pretty sure that passing a list is the equivalent of spaces on the command line. For instance, in the original cuffdiff call, it's not "cuffdiff, -o, diff_out4, ...". So you can see that the syntax in python isn't exactly the same as the command that gets generated. Thus, you really do want that comma there.

        More generally, Python doesn't know how to combine the two variables if there's no comma between them. Depending on the type (which the syntax parser doesn't know in advance), you could want to concatenate them (if strings), multiply them (if they're numbers), or do something else entirely. Recall the Zen of Python: "In the face of ambiguity, refuse the temptation to guess." Since the syntax could be ambiguous, it instead throws a syntax error.

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Strategies for Sequencing Challenging Samples
          by seqadmin


          Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
          03-22-2024, 06:39 AM
        • seqadmin
          Techniques and Challenges in Conservation Genomics
          by seqadmin



          The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

          Avian Conservation
          Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
          03-08-2024, 10:41 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, Yesterday, 06:37 PM
        0 responses
        10 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, Yesterday, 06:07 PM
        0 responses
        9 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 03-22-2024, 10:03 AM
        0 responses
        50 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 03-21-2024, 07:32 AM
        0 responses
        67 views
        0 likes
        Last Post seqadmin  
        Working...
        X