User Manual for the Enrichment Cytoscape Map Plugin

Overview

The Enrichment Map Cytoscape Plugin enables to visualize the results of gene-set enrichment as a network. Nodes represent gene-sets and edges represent mutual overlap; in this way, highly redundant gene-sets are grouped together as clusters, dramatically improving the capability to navigate and interpret results. Gene Set Enrichment

Gene-set enrichment is a data analysis technique receiving in input

and generating in output the list of enriched gene-sets, i.e. best summarizing the gene-list. It is common to refer to gene-set enrichment as functional enrichment because functional categories (e.g. Gene Ontology) are commonly used as gene-sets.

Installation

The Enrichment Map Plugin requires Cytoscape Version 2.6.x. If you don't have Cytoscape or an older Version (2.5 or older), please download the latest Release from http://www.cytoscape.org/ and install it on your computer.

Quick Start Guide

Creating an Enrichment Map

You have two main options:

The only difference between the two modes is the structure of the enrichment table(s). In either case, to use the plugin you will need the following files:

(*) GSEA saves the enrichment table as a .xls file; however, these are not true Excel files, they are tab-separated text files with a modified extension; Enrichment Map does not work with "true" Excel .xls files.

If your enrichment results were generated from GSEA, you will just have to pick the right files from your results folder. If you have generated the enrichment results using another method, you will have to go to the Full User Guide, File Format section, and make sure that the file format complies with Enrichment Map requirements.

You can use the parameter defaults. For a more careful choice of the parameter settings, please go to the Full User Guide, Tips on Parameter Choice.

Graphical Mapping of Enrichment

Exploring the Enrichment Map

Advanced tips

Full User Guide

File Formats

[#genesets -]

Gene sets file (GMT file)

Expression Data file (GCT, TXT or RNK file)

 $ replace_probeSetIDs.py --help
 Usage: replace_probeSetIDs.py [options] -i input.gct -o output.gct -c platform.chip

 Options:
   --version             show program's version number and exit
   -h, --help            show this help message and exit
   -i FILE, --input=FILE
                         input .gct file
   -o FILE, --output=FILE
                         output .gct file
   -c FILE, --chip=FILE  Chip File

Enrichment Results files

GSEA result files

Generic results files

Additional Information on GSEA File Formats

Additional Information on GSEA File Formats can be found here

RPT files

Advanced Settings

Tips on Parameter Choice

P-value and FDR Thresholds

Here are different sets of thresholds you may consider for GSEA:

We recommend to use permissive thresholds only if your having a hard time finding any enriched terms. For high quality, high coverage transcriptomic data, the number of enriched terms at the very conservative threshold is usually 100-250.

Jaccard vs. Overlap Coefficient

Overlap Thresholds

Jaccard Thresholds

The Input Panel

[ATTACH]

The Data Panel

  • The bottom (south) panel.

Expression Viewer

  • There are two different types of Expression Viewers, each is represnted as a separate tab in data panel:
    • EM Overlap - shows the expression of genes in the overlap (intersection) of all the genesets selected
    • EM Geneset - shows the expression of genes of the union of all the genesets selected.
  • Features of the Expression Viewer include:
    • Normalization
      • Data as is - represents the data as it was loaded
      • Row Normalize Data -
      • Log Transform Data - takes the log of each expression value
    • Sorting
      • Currently the data can be sorted by ranks loaded via the interface associated with dataset 1 or dataset2 * further functionality is currently under construction

    • Save Expression Set
      • The user can save the subset of expression values currently being viewed in the expression viewer as txt file.

Node Attributes

  • For each Enrichment map created the following attributes are created for each node:
    • EM#_Name - the gene set name
    • EM#_Formatted_name - a wrapped version of the gene set name so it is easy to visualize.

      • Note: This is the default label of the node but some users find it easier to arrange the network when the name is not wrapped. If this is the case in the vizmapper the user can switch the label mapping from EM#_formatted_name to EM#_name.

    • EM#_GS_DESCR - the gene set description (as specified in the second column of the gmt file)
  • Additionally there are attributes created for each dataset (a different set for each dataset if using two dataset mode):
    • EM#_pvalue_dataset(1 or 2) - Gene set p-value, as specified in GSEA enrichment result file.
    • EM#_qvalue_dataset(1 or 2) - Gene set q-value, as specified in GSEA enrichment result file.
    • EM#_Colouring_dataset(1 or 2) - Enrcihment map parameter calculated using the formulat 1-pvalue multiplied by the sign of the ES score (if using GSEA mode) or the phenotype (if using the Generic mode)
    • GSEA specific attributes (these attributes are not populated when creating an enrichment map using the generic mode)
      • EM#_ES_dataset(1 or 2) - Enrcihment score, as specified in GSEA enrichment result file.
      • EM#_NS_dataset(1 or 2) - Normalized Enrcihment score, as specified in GSEA enrichment result file.
      • EM#_fwer_dataset(1 or 2) - Family-wise error score, as specified in GSEA enrichment result file.

Edge Attributes

  • For each Enrichment map created the following attributes are created for each edge:
    • EM#_Overlap_size - the number of genes associated with the overlap of the two genesets that this edge connects.
    • EM#_jaccard_coeffecient - the calculated coeffecient for this edge.

      • Note: Although the name of this attribute specifies the jaccard, this attribute contains the coeffecient of either the jaccard or overlap depending on which was specified in the analysis.

The Results Panel

  • The right (east) panel

Parameters pane

  • under construction

Software/EnrichmentMap/UserManual (last edited 2009-09-03 15:44:44 by RuthIsserlin)