Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • StevenW
    Member
    • May 2011
    • 15

    #61
    Hi,
    I am responsible for developing FastQ Screen.

    The standard way to remove contamination is:
    Run FastQ Screen (latest version) with --subset (to process the entire dataset) and --nohits. In the config file include the Bowtie1/2 indices of all the potential contaminants (human genome indices should not be included).

    A FastQ file should then be produced containing all the reads that did not map to any of the contaminants.

    I am wondering, why do you need to only remove the hits that are classified as 'one-hit/one-library' AND 'multiple-hits/one-library'? Also is this single-end data?

    Please feel free to contact me directly to discuss this further.

    Kindest regards,
    Steven W

    Comment

    • touchsk
      Junior Member
      • Aug 2015
      • 7

      #62
      Thanks Steven

      I apologize for the delayed reply.

      Yes, I am doing similar to what you have mentioned. I have two config files in place:
      - one with just the contaminants for filtering the fastq file as you mentioned
      - and one with contaminants and mammalian genomes to generate a figure using your tool that depicts the level of contamination in comparison to its real hit in a mammal (similar plot to the example on the fast_screen webpage.

      This is single-end data.

      The question on the single-hit was motivated because we have some contaminant-like sequences (custom to our study), that we want to quantify but not remove if they have homologs in mammalian genomes. But again using a similar approach as above, I am able to tackle that (stepwise).

      Thanks for a very useful tool.

      Comment

      • StevenW
        Member
        • May 2011
        • 15

        #63
        Excellent, so everything is fine?

        PS I'm releasing a new version of the software today.

        Comment

        • turnersd
          Senior Member
          • May 2011
          • 115

          #64
          hey, what ever happened to the --paired option? Is it still possible to screen paired end data?

          Comment

          • StevenW
            Member
            • May 2011
            • 15

            #65
            Hi,
            Thanks for your message; I am part of the team responsible for developing FastQ Screen.
            We removed the --paired option from the script in a recent update as we felt it was unnecessary and was causing confusion. Mapping forward or reverse reads independently should be perfectly adequate to ascertain whether there is contamination, and will also provide the user with additional information if the forward reads are more prone generally to contamination than the reverse reads (or vice versa). Also, some users were reporting that the script was sometimes failing to detect contamination in --paired mode. For example, if the read pair did not constitute a contiguous region of DNA, or if the paired reads were separated by are large distance (such as RNA seq).
            So we now recommend that you screen both read files independently.
            Is there any particular reason you would have to use the –paired mode?

            Comment

            • jaas
              Junior Member
              • May 2013
              • 2

              #66
              FastQ Screen for bisulfite samples

              Hello,

              I'm trying to use fastq screen for bisulfite sequencing samples. I've run the test data and that works fine. However, I get a file handle error when running my bisulfite samples:
              my code:

              PHP Code:
              fastq_screen --bisulfite G3.S22.fastq.gz 
              Output:
              PHP Code:
              Using fastq_screen v0.11.2
              Defaulting to Bowtie 2 
              for --bisulfite mode
              Reading configuration from 
              '/data/Bismark/fastq_screen_v0.11.2/fastq_screen.conf'
              Using '/usr/lib/bowtie2/bin/bowtie2' as Bowtie 2 path
              Using 
              '/data/Bismark/bismark' as Bismark path
              Adding database Daphnia
              Using 8 threads 
              for searches
              Option 
              --subset set to 100000 reads
              Processing G3
              .S22.fastq.gz
              Counting sequences in G3
              .S22.fastq.gz
              Making reduced sequence file with ratio 69
              :1
              Searching G3
              .S22.fastq.gz_temp_subset.fastq against Daphnia
              open
              No such file or directory
              [main_samviewfail to open "/data/Bismark/fastq_screen_v0.11.2/Daphnia.G3.S22.fastq.gz_temp_subset_bismark_bt2.bam" for reading.
              Cannot close filehandle on '/data/Bismark/fastq_screen_v0.11.2/Daphnia.G3.S22.fastq.gz_temp_subset_bismark_bt2.bam' :  at fastq_screen line 1059. 
              I do get the outputfile and mapping report of the subsample against the first database, so it seems that the mapping did work. It happens regardless of the databases I use. However when using my samples in non bisulfite mode, and mapping them against the regular genome indices, this does not happen. So I do not think my sample file is this issue. Also, I know my bismark genome build indices are fine as I used them with bismark as well.

              Any ideas on what is wrong or why this is happening?

              Thanks!

              Comment

              • StevenW
                Member
                • May 2011
                • 15

                #67
                FastQ Screen Bisulfite Problem

                Hi jaas,

                I am one of the developers of FastQ Screen. Hopefully we can get this problem resolved quickly.

                Would you be able to send me the configuration file you used when running FastQ Screen. This will help me resolve the problem.

                Many thanks,

                Steven

                Comment

                • jaas
                  Junior Member
                  • May 2013
                  • 2

                  #68
                  Here's the config file. I can't figure out how to send it to you alone. I have changed the extension to a txt file to be able to upload it.

                  Thanks in advance for your help
                  Attached Files

                  Comment

                  Latest Articles

                  Collapse

                  • GATTACAT
                    Reply to Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                    by GATTACAT
                    Love this - good data definitely starts from good input, and poor input can only give relatively poor data. I particularly like the mention of Nanodrop/absorbance based methods for quantification. It's such a toss up if you'll get an accurate reading or what amounts to a randomly generated number, and a lot of library/sequencing related issues can be traced back to poor quant.
                    07-01-2026, 11:43 AM
                  • SEQadmin2
                    Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                    by SEQadmin2


                    I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

                    Here are nine questions we think about, in roughly the order they matter, before...
                    06-18-2026, 07:11 AM

                  ad_right_rmr

                  Collapse

                  News

                  Collapse

                  Topics Statistics Last Post
                  Started by SEQadmin2, 07-02-2026, 11:08 AM
                  0 responses
                  12 views
                  0 reactions
                  Last Post SEQadmin2  
                  Started by SEQadmin2, 06-30-2026, 05:37 AM
                  0 responses
                  15 views
                  0 reactions
                  Last Post SEQadmin2  
                  Started by SEQadmin2, 06-26-2026, 11:10 AM
                  0 responses
                  20 views
                  0 reactions
                  Last Post SEQadmin2  
                  Started by SEQadmin2, 06-17-2026, 06:09 AM
                  0 responses
                  54 views
                  0 reactions
                  Last Post SEQadmin2  
                  Working...