Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • michaellim
    Member
    • Dec 2014
    • 28

    #31
    Originally posted by GenoMax View Post
    @michaellim: What is your ultimate aim with this RNAseq study? Are you looking to do differential expression or just checking to see what is expressed under some specific condition(s)?

    For the immediate issue of not being able to see annotations you can use the gff file from Piet's example and see if that works with IGV. You should compare the pre- and post-trimming FastQC plots to see if there is an improvement in stats. Plots you have posted don't look bad but it is difficult to say if you have adapter contamination unless you try the trimming. No fastq grooming in galaxy should be necessary with MiSeq data. It is already in sanger fastq format.

    Once you go away from "model" organisms tools such as galaxy start becoming limiting (as you have already discovered). Depending on your overall goals it may be beneficial to start learning how to do these analyses on command line. If this is a small part of whatever you are trying to do then enlisting the help of a friend/local bioinformatics support folks may be the easiest thing to do so you can get a set of hypotheses to test at the bench and move on.
    Hi Genomax,

    Thank you for the advise, do you have any recommended handbook/guides/manual/website which teaches some RNA-seq analyses using command lines/softwares?

    I would really like to get a bioinformatician who knows about RNA-seq to help, however, no one in my department knows about it. That's why I have been hunting for information online.

    It would be easier if all the sequences has been input into Galaxy and IGV, however like what you've said, they only have certain 'model organism' and the rest are not in there. That's why I was thinking of inputting the reference genome (chromosome and plasmid; separately or combined,whichever more appropriate for analyses) myself.

    Thank you.

    Comment

    • piet
      Member
      • Aug 2014
      • 21

      #32
      Originally posted by michaellim View Post
      But how do I add the plasmid sequences too
      usually you will map your reads against a set of reference sequences. All reference sequences are stored in a single FASTA file and this FASTA file is passed to the mapping program (bwa for example). If the genome you want to use as a reference comprises one chromosome and two plasmids than you have to copy the three sequences into a single file. This is called a concatenation and is often done with the Unix command line program 'cat'. For a small genome like E.coli you may also do it with a text editor like notepad.

      Comment

      • michaellim
        Member
        • Dec 2014
        • 28

        #33
        Originally posted by piet View Post
        usually you will map your reads against a set of reference sequences. All reference sequences are stored in a single FASTA file and this FASTA file is passed to the mapping program (bwa for example). If the genome you want to use as a reference comprises one chromosome and two plasmids than you have to copy the three sequences into a single file. This is called a concatenation and is often done with the Unix command line program 'cat'. For a small genome like E.coli you may also do it with a text editor like notepad.
        Hi Piet,

        Does that mean I just paste all the chromosome & plasmid sequence together without breaks using one heading, example

        ">combinedsequences
        ccccccccccccccp1p1p1p1p1p1p1p1p2p2p2p2p2p2p2p2p2"

        c is chromosome sequence
        p1 is plasmid one sequence
        p2 is second plasmid's sequence

        OR

        in the notepad file, there are three headings?
        ">chromosomeseq
        cccccccccccccccccccccccc

        >plasmid1
        p1p1p1p1p1p1p1p1

        >plasmid2
        p2p2p2p2p2p2p2p2"

        Many thanks!

        Comment

        • Brian Bushnell
          Super Moderator
          • Jan 2014
          • 2709

          #34
          ">chromosomeseq
          cccccccccccccccccccccccc
          >plasmid1
          p1p1p1p1p1p1p1p1
          >plasmid2
          p2p2p2p2p2p2p2p2"

          ...would be correct.

          Comment

          • michaellim
            Member
            • Dec 2014
            • 28

            #35
            Originally posted by Brian Bushnell View Post
            ">chromosomeseq
            cccccccccccccccccccccccc
            >plasmid1
            p1p1p1p1p1p1p1p1
            >plasmid2
            p2p2p2p2p2p2p2p2"

            ...would be correct.

            Thanks Brian,

            By the way, I've downloaded the BBMap, unzipped it, but I must be not doing it correctly, I cannot seem to find an executable file to click to open the software. Can you please advise?

            Thank you.

            Comment

            • Brian Bushnell
              Super Moderator
              • Jan 2014
              • 2709

              #36
              Michael,

              It is written in Java, so the executable is actually java.exe, which needs to already be on the computer. If not you can get it here.

              As for installing BBMap, you need to unzip it (producing a .tar file), then untar it (producing a directory "bbmap"). After that it depends on your operating system, but in any case you need a command prompt (which you get in Windows by generally going to start->run->"cmd" (then push enter).

              For Linux and OSX running is very easy, just execute a shellscript (such as "bbmap.sh") by typing the name then hitting enter, which will give you instructions. For Windows it's a little more involved and depends on the specific program, but for BBMap, the command would be something like this, if you unzipped it in C:

              java -ea -Xmx1g -cp C:\bbmap\ align2.BBMap ref=ecoli.fasta in=reads.fq out=mapped.sam

              If you still have any trouble, please let me know your operating system, where you unzipped the program, and where the reference and read files are located so I can give more specific instructions.

              Comment

              • michaellim
                Member
                • Dec 2014
                • 28

                #37
                Originally posted by Brian Bushnell View Post
                Michael,

                It is written in Java, so the executable is actually java.exe, which needs to already be on the computer. If not you can get it here.

                As for installing BBMap, you need to unzip it (producing a .tar file), then untar it (producing a directory "bbmap"). After that it depends on your operating system, but in any case you need a command prompt (which you get in Windows by generally going to start->run->"cmd" (then push enter).

                For Linux and OSX running is very easy, just execute a shellscript (such as "bbmap.sh") by typing the name then hitting enter, which will give you instructions. For Windows it's a little more involved and depends on the specific program, but for BBMap, the command would be something like this, if you unzipped it in C:

                java -ea -Xmx1g -cp C:\bbmap\ align2.BBMap ref=ecoli.fasta in=reads.fq out=mapped.sam

                If you still have any trouble, please let me know your operating system, where you unzipped the program, and where the reference and read files are located so I can give more specific instructions.
                Hi Brian,

                Yup in C:, copied the entire command but it states "could not find or load main class align2.bbmap"

                any idea what should I do? Maybe you can devise an executable icon file like those ClustalW alignment which we can just click and run for commandline dummies like me! haha..

                Thank you.

                Comment

                • GenoMax
                  Senior Member
                  • Feb 2008
                  • 7142

                  #38
                  Is there an extra space in "C:\bbmap\ align2.BBMap" after bbmap\? Don't have a PC handy to test.

                  Comment

                  • michaellim
                    Member
                    • Dec 2014
                    • 28

                    #39
                    Originally posted by GenoMax View Post
                    Is there an extra space in "C:\bbmap\ align2.BBMap" after bbmap\? Don't have a PC handy to test.
                    Yup, there is a space. am I missing something?

                    Comment

                    • Brian Bushnell
                      Super Moderator
                      • Jan 2014
                      • 2709

                      #40
                      If you cut-and-pasted this:
                      "could not find or load main class align2.bbmap"

                      ...then the problem is capitalization; Java is case-sensitive (should be BBMap). If not, can you copy and paste the full command line and full error message?

                      Comment

                      • michaellim
                        Member
                        • Dec 2014
                        • 28

                        #41
                        Originally posted by Brian Bushnell View Post
                        If you cut-and-pasted this:
                        "could not find or load main class align2.bbmap"

                        ...then the problem is capitalization; Java is case-sensitive (should be BBMap). If not, can you copy and paste the full command line and full error message?
                        Hi Brian,

                        Here's the picture for you to have a look.

                        Thank you.
                        Attached Files

                        Comment

                        • Brian Bushnell
                          Super Moderator
                          • Jan 2014
                          • 2709

                          #42
                          Oh, that's completely my mistake. Sorry! Command should be:

                          java -ea -Xmx1g -cp C:\bbmap\current\ align2.BBMap ref=ecoli.fasta in=reads.fq out=mapped.sam

                          However, you need to replace "ecoli.fasta" and "reads.fq" with the actual filenames and paths to your reference file and reads file(s).

                          Comment

                          • michaellim
                            Member
                            • Dec 2014
                            • 28

                            #43
                            Originally posted by Brian Bushnell View Post
                            Oh, that's completely my mistake. Sorry! Command should be:

                            java -ea -Xmx1g -cp C:\bbmap\current\ align2.BBMap ref=ecoli.fasta in=reads.fq out=mapped.sam

                            However, you need to replace "ecoli.fasta" and "reads.fq" with the actual filenames and paths to your reference file and reads file(s).
                            Hi Brian,

                            Something happened this time, but still something missing? Oh ok, got it. thank you... Silly me.. hahaha

                            Thank you
                            Attached Files

                            Comment

                            • michaellim
                              Member
                              • Dec 2014
                              • 28

                              #44
                              Originally posted by Brian Bushnell View Post
                              Oh, that's completely my mistake. Sorry! Command should be:

                              java -ea -Xmx1g -cp C:\bbmap\current\ align2.BBMap ref=ecoli.fasta in=reads.fq out=mapped.sam

                              However, you need to replace "ecoli.fasta" and "reads.fq" with the actual filenames and paths to your reference file and reads file(s).
                              Hi Brian,

                              By the way, so for the mapping to work, does that mean, all the FASTQ files and the reference genome (fasta file) must be in the "current" folder or anywhere in the computer?

                              Thank you

                              Comment

                              • Brian Bushnell
                                Super Moderator
                                • Jan 2014
                                • 2709

                                #45
                                Originally posted by michaellim View Post
                                Hi Brian,

                                By the way, so for the mapping to work, does that mean, all the FASTQ files and the reference genome (fasta file) must be in the "current" folder or anywhere in the computer?

                                Thank you
                                No, that does not matter. If they are in the current folder you can use the file name alone (e.g. "reads.fq") and if they are in a different folder you can use the absolute path (e.g. "C:\genetics\reads.fq").

                                Comment

                                Latest Articles

                                Collapse

                                • SEQadmin2
                                  From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                                  by SEQadmin2


                                  Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                                  The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                                  ...
                                  Yesterday, 10:05 AM
                                • SEQadmin2
                                  Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                                  by SEQadmin2


                                  With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                                  Introduction

                                  Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                                  05-22-2026, 06:42 AM
                                • SEQadmin2
                                  Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
                                  by SEQadmin2

                                  Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


                                  Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
                                  05-06-2026, 09:04 AM

                                ad_right_rmr

                                Collapse

                                News

                                Collapse

                                Topics Statistics Last Post
                                Started by SEQadmin2, Yesterday, 12:03 PM
                                0 responses
                                19 views
                                0 reactions
                                Last Post SEQadmin2  
                                Started by SEQadmin2, Yesterday, 11:40 AM
                                0 responses
                                14 views
                                0 reactions
                                Last Post SEQadmin2  
                                Started by SEQadmin2, 05-28-2026, 11:40 AM
                                0 responses
                                29 views
                                0 reactions
                                Last Post SEQadmin2  
                                Started by SEQadmin2, 05-26-2026, 10:12 AM
                                0 responses
                                31 views
                                0 reactions
                                Last Post SEQadmin2  
                                Working...