Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • kjaja
    Member
    • Aug 2011
    • 58

    gender check using sequencing data

    Hi All,

    I have exome sequencing data from siblings and would like to confirm their gender and relationship using genetics to make sure they are in fact siblings. I also want to confirm that the gender information is correct. Are there tools out there that can do this using exome seq data?

    Thanks,
  • vivek_
    PhD Student
    • Jul 2012
    • 164

    #2
    To verify pedigree this post might help:

    The Center for Public Health Genomics at UVA is focused on translational and personalized medicine — moving gene discovery into the delivery of health care.


    To check gender, may be you could check the number of aligned reads to Y chromsome?

    Comment

    • jgibbons1
      Senior Member
      • Oct 2009
      • 135

      #3
      I would second that -- map reads against a panel of Y-chromosome genes/exons.

      Comment

      • xied75
        Senior Member
        • Feb 2012
        • 129

        #4
        But I have pure female reads that can map a lot Y?

        Comment

        • LiLin
          Member
          • May 2011
          • 15

          #5
          The average sequencing depth of chrY and chrX

          Comment

          • balaji
            Junior Member
            • Feb 2011
            • 9

            #6
            in plink there is an option to check sex using X chr, I hope you are looking for this..
            plink --bfile data --impute-sex --make-bed --out newfile

            Comment

            • husamia
              Member
              • Apr 2010
              • 66

              #7
              I also use the average depth of X and Y

              Comment

              • bw.
                Member
                • Mar 2012
                • 21

                #8
                Testing sex determination with CCLE samples

                I've tried using [num reads mapped to chrX] / [num reads mapped to chrY]
                to determine sex in some CCLE exome-seq samples. The ratios turned out to be:

                9.4 -- s1
                304.6 -- s2
                272.9 -- s3
                168.3 -- s4
                220.6 -- s5
                297.8 -- s6
                226.1 -- s7
                257.1 -- s8
                241.9 -- s9
                287.0 -- s10
                278.6 -- s11
                260.3 -- s12
                9.7 -- s13
                8.7 -- s14
                261.2 -- s15
                279.3 -- s16
                9.0 -- s17
                8.5 -- s18
                260.7 -- s19
                297.4 -- s20
                8.7 -- s21
                261.8 -- s22
                189.0 -- s23
                147.4 -- s24
                291.2 -- s25
                So it looks like the difference is pretty wide -
                [num reads mapped to chrX] / [num reads mapped to chrY] is < 10 for all male samples and > 100 for all female samples.

                Still, I'm not sure whether these thresholds are stable across exome-seq kits, gene panels, etc. I wonder if there's a more robust way to determine sex.
                Last edited by bw.; 02-13-2014, 11:07 AM.

                Comment

                • oyvindbusk
                  Member
                  • Jan 2011
                  • 14

                  #9
                  How about using % heterozygosity on X (without the pseudoautosomal regions (X:60000-2699520 and X:154931043-155260560). In our lab, male = < 30 % and female = > 50 %.

                  Comment

                  • swbarnes2
                    Senior Member
                    • May 2008
                    • 910

                    #10
                    Gender = biological sex + culture. You don't care about people's gender, you care about their sex. (And even the biology is not black and white 100% of the time)

                    Comment

                    • bw.
                      Member
                      • Mar 2012
                      • 21

                      #11
                      @swbarnes2 cool. never realized there was a difference.

                      @oyvindbusk thanks, I also tried this and ended up with similar thresholds (male < 40% and female > 50%). I didn't try to filter out pseudoautosomal regions since their coordinates differ across species and assembly versions (based on PAR coordinates at:


                      ).


                      Looking at 322 CCLE samples, 233 were called Male, 73 Female, and 10 Unknown (which is >= 40% and <= 50%). Out of the 233 Male, only 5 would have been called differently with your thresholds. I will see if I can check the thresholds against a different approach. Also, a lot of the CCLE cells have copy number amplifications / deletions, so these results might be skewed by that.

                      Here is the distribution of nHet / nHomo for chrX in CCLE samples (I used this instead of nHet/(nHet+nHomo)). The 2 vertical blue lines are equivalent to 40% and 50% thresholds, and the 30% threshold is the red line.

                      Last edited by bw.; 02-12-2014, 11:48 PM.

                      Comment

                      Latest Articles

                      Collapse

                      • SEQadmin2
                        Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                        by SEQadmin2


                        I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.


                        Here are nine questions we think about, in roughly the order they matter, before...
                        06-18-2026, 07:11 AM
                      • SEQadmin2
                        From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                        by SEQadmin2


                        Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                        The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                        ...
                        06-02-2026, 10:05 AM

                      ad_right_rmr

                      Collapse

                      News

                      Collapse

                      Topics Statistics Last Post
                      Started by SEQadmin2, 06-17-2026, 06:09 AM
                      0 responses
                      26 views
                      0 reactions
                      Last Post SEQadmin2  
                      Started by SEQadmin2, 06-09-2026, 11:58 AM
                      0 responses
                      43 views
                      0 reactions
                      Last Post SEQadmin2  
                      Started by SEQadmin2, 06-05-2026, 10:09 AM
                      0 responses
                      48 views
                      0 reactions
                      Last Post SEQadmin2  
                      Started by SEQadmin2, 06-04-2026, 08:59 AM
                      0 responses
                      49 views
                      0 reactions
                      Last Post SEQadmin2  
                      Working...