Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • Julia_m
    Junior Member
    • Jun 2016
    • 4

    paired end data not aligned in NextSeq 500

    Hello!!
    I'm quite new in NGS and I got these fastq files (read1 and read2) from illumina nextseq 500.
    Now the problem is that in somehow the two file are not aligned, I mean, I checked for the coordinates in read1 and read2 on the same row and it appears that sometimes are not the same!
    Since I'm quite new, this can be possible? or anyway is there any tools or bash script that can help me in the alignment of these files?


    Hope seriously in somebody that can help me!
  • Krish_143
    Member
    • Jan 2012
    • 45

    #2
    Hi Julia_m,

    Is that Genomic data or RNAseq - transcriptomic data. Okay !

    If it was not aligning that could mean.. paired read was not found in one of the other file.

    Probably you might have the trimmed data and considered all the ones. If it was trimmed you should keep the read that present in both the files. (Check any trimming software eg: cutadapt and try mapping)

    Hope it works.
    Krishna

    Comment

    • Julia_m
      Junior Member
      • Jun 2016
      • 4

      #3
      cutadapt is for adaptor cutting isn't it?
      i mean my problem is quite different, I have two fastq files like these:
      READ1.fastq
      @NB501365:8:HF3HLAFXX:1:11101:22082:1033 1:N:0:CGAGTA
      READ2.fastq
      @NB501365:8:HF3HLAFXX:1:11101:22082:1033 2:N:0:CGAGTA

      the coordinates 22082:1033 for read1 and 22082:1033 for read2 shouldn't be the same in paired ends?

      Comment

      • Julia_m
        Junior Member
        • Jun 2016
        • 4

        #4
        sorry big mistake READ2.fast has the following header:
        @NB501365:8:HF3HLAFXX:1:11101:64563:1033 2:N:0:CGAGTA

        Comment

        • dpryan
          Devon Ryan
          • Jul 2011
          • 3478

          #5
          Your data looks as it should. The "1:N:0:CGAGTA" portion, as an example, just says "I'm read 1, I didn't fail quality control filtering on the machine, and my barcode was CGAGTA". Read 2 should look the same (and have the same read name), with the exception that there's a 2 rather than a 1 in the second block of text.

          So go ahead and quality/adapter trim this dataset (e.g., with "Trim Galore!" or trimmomatic) and then use an aligner (bowtie2, bbmap, hisat2, bwa, STAR, etc.).

          Edit: Oh, if the read names are really different (I just now noticed your most recent post) then you'll need to resync the files. There's a tool in BBMap that can do this for you (it has a LOT of convenient tools).

          Comment

          • GenoMax
            Senior Member
            • Feb 2008
            • 7142

            #6
            As @Devon said your R1/R2 files may be out of sync. You can use repair.sh from BBMap suite to re-sync the files.

            You will find example of that command line and lots of other things BBMap suite can do in this thread.

            Comment

            • sklages
              Senior Member
              • May 2008
              • 628

              #7
              It would still be interesting, what has caused the reads to be out-of-sync.
              Have these reads been pre-processed with some tool before or is this the data that has been sent by sequencing provider?

              Fixing is important, avoiding, or knowing how to avoid it, is more important ;-)

              Just my 2p.

              Comment

              • Julia_m
                Junior Member
                • Jun 2016
                • 4

                #8
                problem solved!! seems that something happened during the unzipping process (I don't know why), if you process everything directly from the gzip file it's fine!.

                Comment

                Latest Articles

                Collapse

                ad_right_rmr

                Collapse

                News

                Collapse

                Topics Statistics Last Post
                Started by SEQadmin2, 06-09-2026, 11:58 AM
                0 responses
                30 views
                0 reactions
                Last Post SEQadmin2  
                Started by SEQadmin2, 06-05-2026, 10:09 AM
                0 responses
                38 views
                0 reactions
                Last Post SEQadmin2  
                Started by SEQadmin2, 06-04-2026, 08:59 AM
                0 responses
                43 views
                0 reactions
                Last Post SEQadmin2  
                Started by SEQadmin2, 06-02-2026, 12:03 PM
                0 responses
                64 views
                0 reactions
                Last Post SEQadmin2  
                Working...