Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • 7tbear7
    Junior Member
    • Jul 2017
    • 3

    ONT Data Analysis Questions

    Hey Guys,

    I'm collaborating with someone who has the minION. I just got my first data set, and am stoked to start looking into it! But I was expected a fast5 file and was sent a fastq file....

    None of the nanopore tools will take a fastq input for their analysis commands, and I can't seem to use the fastx toolkit for normal filtering... I keep getting a ***stack smashing detected*** error.

    Does anybody have any experience they'd like to share?

    Thanks!
  • VinceM
    Junior Member
    • Mar 2016
    • 7

    #2
    Hi 7tbear7,

    I'm surprised you are having trouble with fastq files. Fast5 files were a nightmare to convert to fastq when I first started using the MinION; once in fastq they were relatively easy to analyse. Now I think fastq and fasta are produced automatically by the MinION software, in addition to fast5 files.

    Which tools are you having problems with? Have you had a look at the fastq file in a text editor, just to see if the data is actually in fastq format?

    Vince

    Comment

    • 7tbear7
      Junior Member
      • Jul 2017
      • 3

      #3
      Thanks for your reply Vince! I have looked at the fastq file and it looks normal to me. I keep getting the segmentation fault or stack smashing -- both having to do with memory limitations? I have a large amount of memory available though. I tried using FastQC to filter the data and that also froze and was unable to process it.

      It's 7,000,000 lines long. I attached a screen shot of the first few lines.

      Originally posted by VinceM View Post
      Hi 7tbear7,

      I'm surprised you are having trouble with fastq files. Fast5 files were a nightmare to convert to fastq when I first started using the MinION; once in fastq they were relatively easy to analyse. Now I think fastq and fasta are produced automatically by the MinION software, in addition to fast5 files.

      Which tools are you having problems with? Have you had a look at the fastq file in a text editor, just to see if the data is actually in fastq format?

      Vince
      Attached Files

      Comment

      • VinceM
        Junior Member
        • Mar 2016
        • 7

        #4
        Certainly looks OK. Have you tried converting the fastq to fasta to see if that would work? If it's a memory issue that might help.

        Not sure I can contribute much more - wet lab person, not bioinformatics. Anyone else like to pitch in?

        Comment

        • VinceM
          Junior Member
          • Mar 2016
          • 7

          #5
          Just one more suggestion. Copy a small part of the fastq file (for example, 50 - 100 sequences) and run that truncated file through the analysis programs.

          Comment

          • 7tbear7
            Junior Member
            • Jul 2017
            • 3

            #6
            Thanks again for your input Vince.

            I copied the first 3 lines (head) into a new file and tried to use the fastx toolkit to quality filter and it gave me the same segmentation fault. So something is clearly wrong and it's hard to imagine it's the file. Must be one of the dependencies? I have a feeling it's Java...

            If I convert to FASTA I lose all my quality scores though, correct?

            Comment

            • VinceM
              Junior Member
              • Mar 2016
              • 7

              #7
              Originally posted by 7tbear7 View Post

              If I convert to FASTA I lose all my quality scores though, correct?


              Yes, you would lose all your quality data.

              Comment

              • gringer
                David Eccles (gringer)
                • May 2011
                • 845

                #8
                Originally posted by 7tbear7 View Post
                I copied the first 3 lines (head) into a new file and tried to use the fastx toolkit to quality filter and it gave me the same segmentation fault.
                Er... don't use fastx-toolkit for nanopore files. It was designed more than a few years ago, and is probably not coded to be able to handle lines with tens of thousands of bases.

                And before the question is asked, "Well, what should be used instead?", have a think about what it is that you want to do and turn that desire into a question; ask about your problem, rather than your attempts at a solution.
                Last edited by gringer; 08-01-2017, 11:56 PM.

                Comment

                • jordi
                  Member
                  • Apr 2009
                  • 49

                  #9
                  Hi @7tbear7,
                  since you have MinION device, I am sure you have access to Nanopore Community. Look at the "Community-developed data-analysis tools" topic to find out which tools are devoted to deal with nanopore reads.
                  Regarding the usage of FASTQC, you can find te following option:

                  --nano Files come from naopore sequences and are in fast5 format. In
                  this mode you can pass in directories to process and theprogram
                  will take in all fast5 files within those directories and produce
                  a single output file from the sequences found in all files.

                  However, I think it is still under development and someone here will be more adequate to answer some questions about it

                  Hope this helps!

                  Comment

                  • maimounak
                    Junior Member
                    • Apr 2017
                    • 2

                    #10
                    hi! i am bioinformatic student in Mali i work in autism gene i am now need to determine an frequence of some autism gènes in another country eg europe america and so off i need to know if exist and genomics database who can help me, i hope on another issus if possible. thank you.

                    Comment

                    • gringer
                      David Eccles (gringer)
                      • May 2011
                      • 845

                      #11
                      If you find a functional gene in one human population, it is highly likely that that gene exists in all human populations. Perhaps you could post your question as a separate thread; it doesn't seem to be related to ONT Data Analysis.

                      Comment

                      • maimounak
                        Junior Member
                        • Apr 2017
                        • 2

                        #12
                        autism gene mutations identification

                        thank you for your answer,

                        now i need to know for this major autism gene mef2c mutation frequency for europeen, African and American autist. thank you

                        Comment

                        Latest Articles

                        Collapse

                        • SEQadmin2
                          From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                          by SEQadmin2


                          Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                          The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                          ...
                          06-02-2026, 10:05 AM
                        • SEQadmin2
                          Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                          by SEQadmin2


                          With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                          Introduction

                          Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                          05-22-2026, 06:42 AM
                        • SEQadmin2
                          Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
                          by SEQadmin2

                          Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


                          Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
                          05-06-2026, 09:04 AM

                        ad_right_rmr

                        Collapse

                        News

                        Collapse

                        Topics Statistics Last Post
                        Started by SEQadmin2, 06-02-2026, 12:03 PM
                        0 responses
                        19 views
                        0 reactions
                        Last Post SEQadmin2  
                        Started by SEQadmin2, 06-02-2026, 11:40 AM
                        0 responses
                        14 views
                        0 reactions
                        Last Post SEQadmin2  
                        Started by SEQadmin2, 05-28-2026, 11:40 AM
                        0 responses
                        29 views
                        0 reactions
                        Last Post SEQadmin2  
                        Started by SEQadmin2, 05-26-2026, 10:12 AM
                        0 responses
                        31 views
                        0 reactions
                        Last Post SEQadmin2  
                        Working...