Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • bwa alignment failed

    Hello, I am trying to align my Illumina hiseq data using bwa:

    ./bwa index -c -a bwtsw -p hg19 hg19.fa

    ./bwa aln -t 4 -c hg19 quatrim.fq > quatrim.sai

    ./bwa samse -f quatrim.sam hg19 quatrim.sai quatrim.fq

    Then .sam was convered to .bam

    I used samstool to check the alignment:

    ./samtools idxstats quatrim.bam

    chr1 249250621 1 0
    chr2 243199373 0 0
    chr3 198022430 2 0
    chr4 191154276 1 0
    chr5 180915260 1 0
    chr6 171115067 0 0
    chr7 159138663 0 0
    chr8 146364022 1 0
    chr9 141213431 0 0
    chr10 135534747 0 0
    chr11 135006516 1 0
    chr12 133851895 0 0
    chr13 115169878 0 0
    chr14 107349540 1 0
    chr15 102531392 0 0
    chr16 90354753 0 0
    chr17 81195210 0 0
    chr18 78077248 0 0
    chr19 59128983 3 0
    chr20 63025520 1 0
    chr21 48129895 0 0
    chr22 51304566 0 0
    chrX 155270560 0 0
    chrY 59373566 0 0
    chrM 16571 0 0
    * 0 0 3081338

    When I ran the above steps, everything looked fine. The quality scores of the reads are also very good. I blasted some, and they mapped perfectly to the genome. I am wondering which step(s) might have problems. Thanks!

  • #2
    bwa aln -c is for colorspace data (i.e. from SOLiD platforms). As is bwa index -c

    You state you have Illumina data.

    I suggest removing the -c options from both commands.

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Essential Discoveries and Tools in Epitranscriptomics
      by seqadmin




      The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
      04-22-2024, 07:01 AM
    • seqadmin
      Current Approaches to Protein Sequencing
      by seqadmin


      Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
      04-04-2024, 04:25 PM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, Today, 08:47 AM
    0 responses
    12 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-11-2024, 12:08 PM
    0 responses
    60 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 10:19 PM
    0 responses
    59 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 09:21 AM
    0 responses
    54 views
    0 likes
    Last Post seqadmin  
    Working...
    X