Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • QTL overlap (bedtools)

    Dear all, I am trying to compare some QTL regions with specific genome regions that I have, for it I am using bedtools.
    I used before bedtools with a list of genes and it worked well, but this time I am getting the error message: "***** ERROR: too many digits/characters for integer conversion in string . Exiting...".

    My input files are:
    QTL file (with about 5,000 regions):
    Chr1 241009 241109 GROW
    Chr1 241009 241109 PLASCO
    Chr1 2095900 3104645 BW
    Chr1 2329958 2400142 AF
    Chr1 2329958 2400142 BW


    and my genomic regions input (with about 700 regions):
    Chr1 18340001 18360000
    Chr1 18350001 18370000
    Chr1 18570001 18590000
    Chr1 18960001 18980000
    Chr1 18970001 18990000
    Chr1 18980001 19000000

    I tried different command lines, and I got the same error every time (and some times I had a output until the chromosome 8, and sometimes no result):
    Code:
    intersectBed -c -a QTLfile.bed  -b regionFile.bed > outfile
    I am not sure what is the problem, I checked my files and it seems that they are ok....I tried also to increase the memory and split the chromosomes, and I also had the same error..

    thanks
    Last edited by clarissaboschi; 06-01-2016, 12:27 PM.

  • #2
    Sounds like an error with your input files. MAybe the separators are wrong on one line.

    Try reading both files into R / Calc etc and doing some summary statistics. You'll probably find that there is a space not a tab on one line.

    Comment


    • #3
      Thanks Colindaven, yes I was thinking that the issue is on my QTL file that I got from a database.
      I already found different issues in this file, like start position bigger than end position and also lines with no positions, but I will do more checks!

      Comment


      • #4
        Originally posted by clarissaboschi View Post
        Thanks Colindaven, yes I was thinking that the issue is on my QTL file that I got from a database.
        I already found different issues in this file, like start position bigger than end position and also lines with no positions, but I will do more checks!
        Hi, the problem is caused by the redundancy of the QTL file. There are many line with the same start and end position, if you using the abbreviation as ID, there will be all the same, just like the following lines:

        Code:
        1       7232667 7273886 DRIPL   +
        1       7232667 7273886 DRIPL   +
        you can use "sort -u" to remove the redundancy.

        Another reason is the NULL position, like the following line:
        Code:
        Chr.10  Animal QTLdb    Meat_and_Carcass_Association                    .       .       .       QTL_ID=65998;Name=Juiciness score;Abbrev=JUICE;PUBMED_ID=22297614;trait_ID=65;trait=Juiciness score;breed=yorkshire;FlankMarkers;PTO_name=meat juiciness;Map_Type=Linkage;Model=Mendelian;Test_Base=-;peak_cM=73.1;Significance=Significant;P-value=0.0433;gene_ID=100517235;gene_IDsrc=NCBIgene
        Just remove these lines.
        Last edited by pengchy; 07-04-2016, 04:31 PM.

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Best Practices for Single-Cell Sequencing Analysis
          by seqadmin



          While isolating and preparing single cells for sequencing was historically the bottleneck, recent technological advancements have shifted the challenge to data analysis. This highlights the rapidly evolving nature of single-cell sequencing. The inherent complexity of single-cell analysis has intensified with the surge in data volume and the incorporation of diverse and more complex datasets. This article explores the challenges in analysis, examines common pitfalls, offers...
          Yesterday, 07:15 AM
        • seqadmin
          Latest Developments in Precision Medicine
          by seqadmin



          Technological advances have led to drastic improvements in the field of precision medicine, enabling more personalized approaches to treatment. This article explores four leading groups that are overcoming many of the challenges of genomic profiling and precision medicine through their innovative platforms and technologies.

          Somatic Genomics
          “We have such a tremendous amount of genetic diversity that exists within each of us, and not just between us as individuals,”...
          05-24-2024, 01:16 PM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, Today, 06:58 AM
        0 responses
        13 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, Yesterday, 08:18 AM
        0 responses
        19 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, Yesterday, 08:04 AM
        0 responses
        18 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 06-03-2024, 06:55 AM
        0 responses
        13 views
        0 likes
        Last Post seqadmin  
        Working...
        X