C3G Resource Usage


Where can I find more details on CCDB servers, file system, usage guidelines?

You can visit Digital Research Alliance of Canada, formerly Compute Canada (CCDB), Technical Documentation Wiki Site to learn more about the list of available servers, systems and services available, How-to guides and usage policy.

Is there a list of software installed on Digital Research Alliance servers?

See list of available software and globally deployed modules on Digital Research Alliance of Canada servers.

What are modules and why do we need them for GenPipes?

GenPipes is developed in Python. Modules in Python are a way to load software or tools just in time, when a program needs them.

The modules that come along with GenPipes allow you to use bioinformatics tools like Samtools, Homer, MACS2, without installing them yourself.

For details on why we need modules for GenPipes and which ones are required, deployed and pre-installed the Digital Research Alliance Canada Servers, see the list of available modules.

What are GenPipes genomes? Where can I access them from?

GenPipes pipelines are used for genomic analysis and they require reference genomes. C3G, in partnership with Digital Research Alliance Canada, maintains several genomes that are available on several HPC centres including Rorqual, Nibi, Fir, Trillium and Narval. In addition to the FASTA sequence, many genomes include aligner indices and annotation files.

To access these genomes, you need to add the following lines to your .bashrc file:

## GenPipes/MUGQIC genomes and modules
user@machine:~$ export MUGQIC_INSTALL_HOME=/cvmfs/soft.mugqic/CentOS6

To explore the available genomes, you can type:

user@machine:~$ ls $MUGQIC_INSTALL_HOME/genomes/species/

C3G, in partnership with Compute Canada, maintains several genomes that are available on several HPC centres. For a list of available genomes, visit Bioinformatics resources - genomes. In addition to the fasta sequence, many genomes include aligner indices and annotation files.

See genome files here.

What is meant by test dataset? Where can I find available test datasets?

GenPipes pipelines can be run using your sequencing instruments generated, measured, sampled read datasets in respective formats as required by individual pipelines or test datasets. Refer to GenPipes Test Datasets for various available test datasets that can be used to run various GenPipes pipelines, in case you don’t have your own dataset to be processed.