Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • #61
    Originally posted by beajorrin View Post
    OK! In fact I have and inter size of 500bp, so I have to change it. I have to check the trim fastq to reduce the mispaired.
    Thanks
    Even if you are size selecting at 500bp it's probably best to give yourself some leeway for slightly longer inserts. Size selection isn't as exact as you might think and a 1kb cutoff should still remove most of the mapping noise which might otherwise be a problem.

    Comment


    • #62
      Advanced SeqMonk course

      After promising to do this for ages I've finally finished writing an Advanced SeqMonk Course. It won't get its first official outing for a couple of weeks, but I've released the course material onto our web site so everyone can have a look.

      There are a couple of things in the course which require features which won't be released until v0.21.0 - but that should be coming fairly soon now.

      Comment


      • #63
        Thanks for that Simon, it's a very nice document.

        Comment


        • #64
          Originally posted by simonandrews View Post
          After promising to do this for ages I've finally finished writing an Advanced SeqMonk Course. It won't get its first official outing for a couple of weeks, but I've released the course material onto our web site so everyone can have a look.

          There are a couple of things in the course which require features which won't be released until v0.21.0 - but that should be coming fairly soon now.
          Hi Simon,
          That advanced course is really helpful, thanks! Do you know when use difference filter to identify differentially expressed genes, what is the appropriate interval for RNA-Seq experiments? I have four KO samples and four WT and I have calculated RPKM for all the samples. Thank you in advance!

          Comment


          • #65
            Originally posted by mediator View Post
            Hi Simon,
            That advanced course is really helpful, thanks! Do you know when use difference filter to identify differentially expressed genes, what is the appropriate interval for RNA-Seq experiments? I have four KO samples and four WT and I have calculated RPKM for all the samples. Thank you in advance!
            For this type of experiment we'd recommend using the intensity difference filter rather than a straight difference filter. The intensity difference filter is a statistical filter where cutoffs are set as p-values, and we'd normally go with the default 0.05 cutoff. Details of how the filter works are in the advanced course.

            In your case as you have 4 x 4 replicates you could use a combination of the replicate stats filter for a conventional statistical analysis and the intensity difference filter between the two replicate groups to determine the significant deviations from a difference from 0. Do the intensity difference filter first though since this relies on seeing the whole distribution of points.

            Comment


            • #66
              Thank you Simon!

              Comment


              • #67
                Originally posted by simonandrews View Post
                For this type of experiment we'd recommend using the intensity difference filter rather than a straight difference filter. The intensity difference filter is a statistical filter where cutoffs are set as p-values, and we'd normally go with the default 0.05 cutoff. Details of how the filter works are in the advanced course.

                In your case as you have 4 x 4 replicates you could use a combination of the replicate stats filter for a conventional statistical analysis and the intensity difference filter between the two replicate groups to determine the significant deviations from a difference from 0. Do the intensity difference filter first though since this relies on seeing the whole distribution of points.
                Hi Simon,
                Do you know if SeqMonk can show the exact base pairs for each reads? It will be very helpful for detecting indels and de novo mutation. Thank you!

                Comment


                • #68
                  Hello Simon,

                  Can we use SeqMonk to visualize CNVs? I know there are several tools for predicting copy number changes, but am just wondering if there is a way of visualizing these Copy Number changes using SeqMonk from NGS data.

                  Thanks,
                  Praful

                  Comment


                  • #69
                    Originally posted by mediator View Post
                    Hi Simon,
                    Do you know if SeqMonk can show the exact base pairs for each reads? It will be very helpful for detecting indels and de novo mutation. Thank you!
                    Sorry but no it can't. SeqMonk operates purely on mapped positions. This allows it to analyse a billion plus reads on a normal desktop PC, but does mean that there's no direct connection to the original sequences of the submitted reads. We've thought about allowing it to keep connection to the original genomic sequence (so you could for example look for trends vs specific motifs, or GC content etc.) but it's very unlikely we're ever going to add in mutation information to each read since this would kill the very optimised data model we have for storing and manipulating these reads.

                    Comment


                    • #70
                      Originally posted by aggp11 View Post
                      Hello Simon,

                      Can we use SeqMonk to visualize CNVs? I know there are several tools for predicting copy number changes, but am just wondering if there is a way of visualizing these Copy Number changes using SeqMonk from NGS data.
                      Hi Praful,

                      SeqMonk should certainly be able to do this. You'd probably want to do a simple read count over tiled probes which are large enough to contain enough data to get a reliable measure of the read depth, but small enough to catch smaller deletions. There are then a number of different tools to allow you to compare different samples and find differences between samples, or outliers from the normal coverage distribution in a single sample.

                      This isn't something our group works on much, but we've certainly used the program to confirm targeted knockouts that we've made, so the same principles could be used to find novel deletions or duplications.

                      Comment


                      • #71
                        Hi Simon,
                        first again lots of compliments for seqmonk, I don't feel like I can fully grasp a new RNA-seq experiment until I've viewed it in seqmonk. !

                        This told, I have a question, maybe trivial: is there a way to load a custom set of genes (let's say a particular class of genes) for, e.g. getting a chromosome overview of their expression and mapping over chromosomes ?

                        If I also can suggest an improvement, I' d like to be able to resize the sample window (e.g: If have lots of samples, I may like to focus on only one interesting sample to let also visualize fully the mapped reads; with more than 5-6 samples is hard to visualize everything and so it's better to select one or few samples (e.g. for deciphering alternative splicing claims) ... I know I can delete a sample but resizing / hiding one or more samples maybe a better solution?
                        thanks a lot for considering those notes !
                        pbseq

                        Comment


                        • #72
                          Originally posted by pbseq View Post
                          Hi Simon,
                          first again lots of compliments for seqmonk, I don't feel like I can fully grasp a new RNA-seq experiment until I've viewed it in seqmonk. !
                          Thanks! It's always great for us to hear feedback from other people using the program.

                          Originally posted by pbseq View Post
                          This told, I have a question, maybe trivial: is there a way to load a custom set of genes (let's say a particular class of genes) for, e.g. getting a chromosome overview of their expression and mapping over chromosomes ?
                          Sure, but I guess this will depend on how your're defining your group. The method we're using most commonly is to use the fearture search tool (Edit > Find Feature) to identity a group of genes/transcripts based on their annotation. This would include things like GeneOntology terms or anything else you find in the annotation. Once you have the list of hits visible you can use the option at the bottom to turn the hits into a new annotation track. Once you have a track just containing your features of interest then you can either just quantitate over these features, or you could do a wider quantitation and then use the feature filter to pull out just the probes which overlapped with your selected set of features.

                          Originally posted by pbseq View Post
                          If I also can suggest an improvement, I' d like to be able to resize the sample window (e.g: If have lots of samples, I may like to focus on only one interesting sample to let also visualize fully the mapped reads; with more than 5-6 samples is hard to visualize everything and so it's better to select one or few samples (e.g. for deciphering alternative splicing claims) ... I know I can delete a sample but resizing / hiding one or more samples maybe a better solution?
                          I'm not sure I get what you mean here. You can remove a sample from the main chromosome view without deleting it from your project. Just go to View > Set Data Tracks and you can choose which samples you want to have visible, and in which order. The removed samples are still in your project and can be added back to the view whenever you like.

                          I suspect I may be missing the point you're making though.

                          If you're interested in looking at alternative splicing then if you haven't seen this already then a really neat option is to import just the spliced introns into your project. If you have a spliced mapped SAM/BAM file (eg from TopHat), then if you import this and select "Split Spliced Reads" and "Import Introns rather than exons" then you'll see just the splices which you've observed. You can quantitatively analyse these by using the Read Position Probe Generator followed by the Exact Overlap Count Quantitation. We've found this way of looking at the data to be really helpful in deciding if there really is a change in the splicing pattern between samples.

                          Comment


                          • #73
                            Thanks a lot Simon, great hints. Seqmonk has really a lot of features to explore !

                            pbseq

                            Comment


                            • #74
                              Hi Simon,
                              For bed file (generated by Scripture, from RNA-Seq data), which quantification pipeline would you recommend? I am trying to compare bed files between patients and healthy controls in order to find splice variants unique to patients. Thank you!

                              Comment


                              • #75
                                Originally posted by mediator View Post
                                Hi Simon,
                                For bed file (generated by Scripture, from RNA-Seq data), which quantification pipeline would you recommend? I am trying to compare bed files between patients and healthy controls in order to find splice variants unique to patients. Thank you!
                                I've not used scripture before, but looking at the documentation it looks like the data you get out of scripture is probably more processed than you'd want to put into SeqMonk as a data track. We'd normally import the output of Tophat into the program, either importing the spliced exonic reads, or the introns depending on what we were looking for.

                                From what I can see scripture tries to create assembled transcripts from your raw data, so I guess the best way to handle this would be to import it as an annotation track rather than a data track. If the features it produces are spliced then you'd need to import them as GTF or GFFv3 files since none of the other annotation formats supported by SeqMonk can handle multi-location features.

                                Once you have these elements in place then you could quantitate the various scripture transcripts in your datasets and then compare these. You could use the standard RNA-Seq quantitation pipeline and follow the basic RNA-Seq methodology (I'm actually in the process of producing an improved RNA-Seq guide since we have a pretty solid way of dealing with this data now).

                                Comment

                                Latest Articles

                                Collapse

                                • seqadmin
                                  Strategies for Sequencing Challenging Samples
                                  by seqadmin


                                  Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                                  03-22-2024, 06:39 AM
                                • seqadmin
                                  Techniques and Challenges in Conservation Genomics
                                  by seqadmin



                                  The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

                                  Avian Conservation
                                  Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
                                  03-08-2024, 10:41 AM

                                ad_right_rmr

                                Collapse

                                News

                                Collapse

                                Topics Statistics Last Post
                                Started by seqadmin, Yesterday, 06:37 PM
                                0 responses
                                10 views
                                0 likes
                                Last Post seqadmin  
                                Started by seqadmin, Yesterday, 06:07 PM
                                0 responses
                                9 views
                                0 likes
                                Last Post seqadmin  
                                Started by seqadmin, 03-22-2024, 10:03 AM
                                0 responses
                                50 views
                                0 likes
                                Last Post seqadmin  
                                Started by seqadmin, 03-21-2024, 07:32 AM
                                0 responses
                                67 views
                                0 likes
                                Last Post seqadmin  
                                Working...
                                X