Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Bowtie2 paired end --un-conc

    Hi all,
    As part of a normalization strategy for a ChIP-seq experiment, I am using Bowtie2 to align paired end ChiP-seq reads first to the fly genome and then take the unaligned reads and align to the human genome.

    In order to output the reads that fail to align to fly concordantly I'm using --un-conc like this:

    bowtie2 -p 8 --no-unal --un-conc unaligned.fq -x DMBDGP6.dna.primary_assembly -1 sample1_R1.fastq.gz -2 sample2_R2.fastq.gz -S output.sam

    This produces two fastq files called unaligned.1.fq and unaligned.2.fq (.1 and .2 added to make per-mate filenames). I want to use these files downstream to align to the human as paired-end data. But, the output is different in length (different numbers of reads), which causes Bowtie2 to fail when mapping paired-end with -1 and -2.

    Is there a way to have Bowtie2 output only the paired reads to these two files that fail to align concordantly? Or do I need to align downstream as independent unpaired reads, which completely defeats the purpose of using PE reads in the first place.

    Thanks for the advice!
    T

Latest Articles

Collapse

  • seqadmin
    Essential Discoveries and Tools in Epitranscriptomics
    by seqadmin


    The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
    Today, 07:01 AM
  • seqadmin
    Current Approaches to Protein Sequencing
    by seqadmin


    Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
    04-04-2024, 04:25 PM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, 04-11-2024, 12:08 PM
0 responses
37 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-10-2024, 10:19 PM
0 responses
41 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-10-2024, 09:21 AM
0 responses
35 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-04-2024, 09:00 AM
0 responses
54 views
0 likes
Last Post seqadmin  
Working...
X