User Manual for the Enrichment Cytoscape Map Plugin

Overview

The Enrichment Map Cytoscape Plugin enables to visualize the results of gene-set enrichment as a network. Nodes represent gene-sets and edges represent mutual overlap; in this way, highly redundant gene-sets are grouped together as clusters, dramatically improving the capability to navigate and interpret results. Gene Set Enrichment

Gene-set enrichment is a data analysis technique receiving in input

and generating in output the list of enriched gene-sets, i.e. best summarizing the gene-list. It is common to refer to gene-set enrichment as functional enrichment because functional categories (e.g. Gene Ontology) are commonly used as gene-sets.

Installation

The Enrichment Map Plugin requires Cytoscape Version 2.6.x. If you don't have Cytoscape or an older Version (2.5 or older), please download the latest Release from http://www.cytoscape.org/ and install it on your computer.

Quick Start Guide

Creating an Enrichment Map

You have two main options:

The only difference between the two modes is the structure of the enrichment table(s). In either case, to use the plugin you will need the following files:

(*) GSEA saves the enrichment table as a .xls file; however, these are not true Excel files, they are tab-separated text files with a modified extension; Enrichment Map does not work with "true" Excel .xls files.

If your enrichment results were generated from GSEA, you will just have to pick the right files from your results folder. If you have generated the enrichment results using another method, you will have to go to the Full User Guide, File Format section, and make sure that the file format complies with Enrichment Map requirements.

You can use the parameter defaults. For a more careful choice of the parameter settings, please go to the Full User Guide, Tips on Parameter Choice.

Graphical Mapping of Enrichment

Exploring the Enrichment Map

Advanced tips

Full User Guide

File Formats

Gene sets file (GMT file)

Expression Data file (GCT, TXT or RNK file)

 $ replace_probeSetIDs.py --help
 Usage: replace_probeSetIDs.py [options] -i input.gct -o output.gct -c platform.chip

 Options:
   --version             show program's version number and exit
   -h, --help            show this help message and exit
   -i FILE, --input=FILE
                         input .gct file
   -o FILE, --output=FILE
                         output .gct file
   -c FILE, --chip=FILE  Chip File

Enrichment Results files

GSEA result files

Generic results files

Additional Information on GSEA File Formats

Additional Information on GSEA File Formats can be found here

RPT files

Advanced Settings

Tips on Parameter Choice

P-value and FDR Thresholds

Here are different sets of thresholds you may consider for GSEA:

We recommend to use permissive thresholds only if your having a hard time finding any enriched terms. For high quality, high coverage transcriptomic data, the number of enriched terms at the very conservative threshold is usually 100-250.

Jaccard vs. Overlap Coefficient

Overlap Thresholds

Jaccard Thresholds

The Data Panel

Expression viewer

* There are two different types of Expression Viewers, each is represnted as a separate tab in data panel
  • EM Overlap - shows the expression of genes in the overlap (intersection) of all the genesets selected
  • EM Geneset - shows the expression of genes of the union of all the genesets selected.
  • Features of the Expression Viewer include:
    • Normalization
      • Data as is - represents the data as it was loaded
      • Row Normalize Data -
      • Log Transform Data - takes the log of each expression value
    • Sorting
      • Currently the data can be sorted by ranks loaded via the interface associated with dataset 1 or dataset2.
      • further functionality is currently under construction

    • Save Expression Set
      • The user can save the subset of expression values currently being viewed in the expression viewer as txt file.
  • Node Attributes

    Edge Attributes

    • For each Enrichment map created the following attributes are created for each edge:
      • EM#_Overlap_size - the number of genes associated with the overlap of the two genesets that this edge connects.
      • EM#_jaccard_coeffecient - the calculated coeffecient for this edge.

        • ~-Note: Although the name of this attribute specifies the jaccard, this attribute contains the coeffecient of either the jaccard or overlap depending on which was specified in the analysis

    The Results Panel

    • The right (east) panel

    Parameters pane

    • under construction

    Geneset Summary panel

    • will be removed in the next version

    Software/EnrichmentMap/UserManual (last edited 2009-07-09 14:28:56 by RuthIsserlin)

    MoinMoin Appliance - Powered by TurnKey Linux