Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • #31
    Originally posted by GenoMax View Post
    @michaellim: What is your ultimate aim with this RNAseq study? Are you looking to do differential expression or just checking to see what is expressed under some specific condition(s)?

    For the immediate issue of not being able to see annotations you can use the gff file from Piet's example and see if that works with IGV. You should compare the pre- and post-trimming FastQC plots to see if there is an improvement in stats. Plots you have posted don't look bad but it is difficult to say if you have adapter contamination unless you try the trimming. No fastq grooming in galaxy should be necessary with MiSeq data. It is already in sanger fastq format.

    Once you go away from "model" organisms tools such as galaxy start becoming limiting (as you have already discovered). Depending on your overall goals it may be beneficial to start learning how to do these analyses on command line. If this is a small part of whatever you are trying to do then enlisting the help of a friend/local bioinformatics support folks may be the easiest thing to do so you can get a set of hypotheses to test at the bench and move on.
    Hi Genomax,

    Thank you for the advise, do you have any recommended handbook/guides/manual/website which teaches some RNA-seq analyses using command lines/softwares?

    I would really like to get a bioinformatician who knows about RNA-seq to help, however, no one in my department knows about it. That's why I have been hunting for information online.

    It would be easier if all the sequences has been input into Galaxy and IGV, however like what you've said, they only have certain 'model organism' and the rest are not in there. That's why I was thinking of inputting the reference genome (chromosome and plasmid; separately or combined,whichever more appropriate for analyses) myself.

    Thank you.

    Comment


    • #32
      Originally posted by michaellim View Post
      But how do I add the plasmid sequences too
      usually you will map your reads against a set of reference sequences. All reference sequences are stored in a single FASTA file and this FASTA file is passed to the mapping program (bwa for example). If the genome you want to use as a reference comprises one chromosome and two plasmids than you have to copy the three sequences into a single file. This is called a concatenation and is often done with the Unix command line program 'cat'. For a small genome like E.coli you may also do it with a text editor like notepad.

      Comment


      • #33
        Originally posted by piet View Post
        usually you will map your reads against a set of reference sequences. All reference sequences are stored in a single FASTA file and this FASTA file is passed to the mapping program (bwa for example). If the genome you want to use as a reference comprises one chromosome and two plasmids than you have to copy the three sequences into a single file. This is called a concatenation and is often done with the Unix command line program 'cat'. For a small genome like E.coli you may also do it with a text editor like notepad.
        Hi Piet,

        Does that mean I just paste all the chromosome & plasmid sequence together without breaks using one heading, example

        ">combinedsequences
        ccccccccccccccp1p1p1p1p1p1p1p1p2p2p2p2p2p2p2p2p2"

        c is chromosome sequence
        p1 is plasmid one sequence
        p2 is second plasmid's sequence

        OR

        in the notepad file, there are three headings?
        ">chromosomeseq
        cccccccccccccccccccccccc

        >plasmid1
        p1p1p1p1p1p1p1p1

        >plasmid2
        p2p2p2p2p2p2p2p2"

        Many thanks!

        Comment


        • #34
          ">chromosomeseq
          cccccccccccccccccccccccc
          >plasmid1
          p1p1p1p1p1p1p1p1
          >plasmid2
          p2p2p2p2p2p2p2p2"

          ...would be correct.

          Comment


          • #35
            Originally posted by Brian Bushnell View Post
            ">chromosomeseq
            cccccccccccccccccccccccc
            >plasmid1
            p1p1p1p1p1p1p1p1
            >plasmid2
            p2p2p2p2p2p2p2p2"

            ...would be correct.

            Thanks Brian,

            By the way, I've downloaded the BBMap, unzipped it, but I must be not doing it correctly, I cannot seem to find an executable file to click to open the software. Can you please advise?

            Thank you.

            Comment


            • #36
              Michael,

              It is written in Java, so the executable is actually java.exe, which needs to already be on the computer. If not you can get it here.

              As for installing BBMap, you need to unzip it (producing a .tar file), then untar it (producing a directory "bbmap"). After that it depends on your operating system, but in any case you need a command prompt (which you get in Windows by generally going to start->run->"cmd" (then push enter).

              For Linux and OSX running is very easy, just execute a shellscript (such as "bbmap.sh") by typing the name then hitting enter, which will give you instructions. For Windows it's a little more involved and depends on the specific program, but for BBMap, the command would be something like this, if you unzipped it in C:

              java -ea -Xmx1g -cp C:\bbmap\ align2.BBMap ref=ecoli.fasta in=reads.fq out=mapped.sam

              If you still have any trouble, please let me know your operating system, where you unzipped the program, and where the reference and read files are located so I can give more specific instructions.

              Comment


              • #37
                Originally posted by Brian Bushnell View Post
                Michael,

                It is written in Java, so the executable is actually java.exe, which needs to already be on the computer. If not you can get it here.

                As for installing BBMap, you need to unzip it (producing a .tar file), then untar it (producing a directory "bbmap"). After that it depends on your operating system, but in any case you need a command prompt (which you get in Windows by generally going to start->run->"cmd" (then push enter).

                For Linux and OSX running is very easy, just execute a shellscript (such as "bbmap.sh") by typing the name then hitting enter, which will give you instructions. For Windows it's a little more involved and depends on the specific program, but for BBMap, the command would be something like this, if you unzipped it in C:

                java -ea -Xmx1g -cp C:\bbmap\ align2.BBMap ref=ecoli.fasta in=reads.fq out=mapped.sam

                If you still have any trouble, please let me know your operating system, where you unzipped the program, and where the reference and read files are located so I can give more specific instructions.
                Hi Brian,

                Yup in C:, copied the entire command but it states "could not find or load main class align2.bbmap"

                any idea what should I do? Maybe you can devise an executable icon file like those ClustalW alignment which we can just click and run for commandline dummies like me! haha..

                Thank you.

                Comment


                • #38
                  Is there an extra space in "C:\bbmap\ align2.BBMap" after bbmap\? Don't have a PC handy to test.

                  Comment


                  • #39
                    Originally posted by GenoMax View Post
                    Is there an extra space in "C:\bbmap\ align2.BBMap" after bbmap\? Don't have a PC handy to test.
                    Yup, there is a space. am I missing something?

                    Comment


                    • #40
                      If you cut-and-pasted this:
                      "could not find or load main class align2.bbmap"

                      ...then the problem is capitalization; Java is case-sensitive (should be BBMap). If not, can you copy and paste the full command line and full error message?

                      Comment


                      • #41
                        Originally posted by Brian Bushnell View Post
                        If you cut-and-pasted this:
                        "could not find or load main class align2.bbmap"

                        ...then the problem is capitalization; Java is case-sensitive (should be BBMap). If not, can you copy and paste the full command line and full error message?
                        Hi Brian,

                        Here's the picture for you to have a look.

                        Thank you.
                        Attached Files

                        Comment


                        • #42
                          Oh, that's completely my mistake. Sorry! Command should be:

                          java -ea -Xmx1g -cp C:\bbmap\current\ align2.BBMap ref=ecoli.fasta in=reads.fq out=mapped.sam

                          However, you need to replace "ecoli.fasta" and "reads.fq" with the actual filenames and paths to your reference file and reads file(s).

                          Comment


                          • #43
                            Originally posted by Brian Bushnell View Post
                            Oh, that's completely my mistake. Sorry! Command should be:

                            java -ea -Xmx1g -cp C:\bbmap\current\ align2.BBMap ref=ecoli.fasta in=reads.fq out=mapped.sam

                            However, you need to replace "ecoli.fasta" and "reads.fq" with the actual filenames and paths to your reference file and reads file(s).
                            Hi Brian,

                            Something happened this time, but still something missing? Oh ok, got it. thank you... Silly me.. hahaha

                            Thank you
                            Attached Files

                            Comment


                            • #44
                              Originally posted by Brian Bushnell View Post
                              Oh, that's completely my mistake. Sorry! Command should be:

                              java -ea -Xmx1g -cp C:\bbmap\current\ align2.BBMap ref=ecoli.fasta in=reads.fq out=mapped.sam

                              However, you need to replace "ecoli.fasta" and "reads.fq" with the actual filenames and paths to your reference file and reads file(s).
                              Hi Brian,

                              By the way, so for the mapping to work, does that mean, all the FASTQ files and the reference genome (fasta file) must be in the "current" folder or anywhere in the computer?

                              Thank you

                              Comment


                              • #45
                                Originally posted by michaellim View Post
                                Hi Brian,

                                By the way, so for the mapping to work, does that mean, all the FASTQ files and the reference genome (fasta file) must be in the "current" folder or anywhere in the computer?

                                Thank you
                                No, that does not matter. If they are in the current folder you can use the file name alone (e.g. "reads.fq") and if they are in a different folder you can use the absolute path (e.g. "C:\genetics\reads.fq").

                                Comment

                                Latest Articles

                                Collapse

                                • seqadmin
                                  Strategies for Sequencing Challenging Samples
                                  by seqadmin


                                  Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                                  03-22-2024, 06:39 AM
                                • seqadmin
                                  Techniques and Challenges in Conservation Genomics
                                  by seqadmin



                                  The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

                                  Avian Conservation
                                  Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
                                  03-08-2024, 10:41 AM

                                ad_right_rmr

                                Collapse

                                News

                                Collapse

                                Topics Statistics Last Post
                                Started by seqadmin, Yesterday, 06:37 PM
                                0 responses
                                8 views
                                0 likes
                                Last Post seqadmin  
                                Started by seqadmin, Yesterday, 06:07 PM
                                0 responses
                                8 views
                                0 likes
                                Last Post seqadmin  
                                Started by seqadmin, 03-22-2024, 10:03 AM
                                0 responses
                                49 views
                                0 likes
                                Last Post seqadmin  
                                Started by seqadmin, 03-21-2024, 07:32 AM
                                0 responses
                                67 views
                                0 likes
                                Last Post seqadmin  
                                Working...
                                X