Size: 5251
Comment:
|
← Revision 164 as of 2014-09-24 14:23:23 ⇥
Size: 6463
Comment:
|
Deletions are marked like this. | Additions are marked like this. |
Line 1: | Line 1: |
## page was renamed from CancerStemCellProject/VeroniqueVoisin/PathwayAnalysisService | |
Line 3: | Line 4: |
{{attachment:logo.png|OICR_CSC Pathway and Network Analysis Logo Map Logo|align="right"}} | |
Line 4: | Line 6: |
<<BR>> <<BR>> |
|
Line 7: | Line 7: |
* '''The Pathway and Network Analysis Service''' is freely available to all Cancer Stem Cell program members. <<BR>> * '''High-throughput genomic experiments''' (e.g. gene expression, large-scale genetic screens) often lead to the identification of large gene lists. The interpretation of results and the formulation of consistent biological hypotheses from these gene lists can be challenging. Computational 'pathway and network analysis' approaches can aid interpretation by relating the gene list to knowledge about the biological system, such as pathways. |
'''The Pathway and Network Analysis Service is freely available to all OICR Cancer Stem Cell program members.''' <<BR>> == Goals of the service == * '''High-throughput genomic experiments''' (e.g. gene expression, large-scale genetic screens) often lead to the identification of large gene lists. The interpretation of results and the formulation of consistent biological hypotheses from these gene lists can be challenging. Pathway and network analysis (e.g enrichment analysis) approaches can aid interpretation by relating the gene list to knowledge about the biological system, such as pathways. |
Line 13: | Line 14: |
* '''Standard types of pathway analysis offered''' * '''Find pathways enriched in a list of genes (e.g. differentially expressed genes)''' Gene-set enrichment analysis helps characterize large gene lists by finding functionally coherent gene-sets, such as pathways, that are statistically over-represented in a given gene list. We have also developed a method to visualize the results of this analysis, called Enrichment Map. Enrichment Map organizes gene-sets in a network and it enables the user to quickly identify the major enriched functional themes. '''Input''': gene list from genomics experiment (statistically analyzed). '''Output''': enriched pathways visually displayed. * Related publications: [[http://www.ncbi.nlm.nih.gov/pubmed/16199517|Gene-set enrichment analysis (GSEA)]] [[http://www.ncbi.nlm.nih.gov/pubmed/21085593|Enrichment Map]] |
* We are also focusing on developing training materials and sessions to help researchers who are interested to perform these bioinformatics analyses themselves. Examples of published pathway maps and list of tutorials that could guide researcher can be found following this link: [[CSCPathwayAnalysisService/Tutorials | RESOURCES AND EXAMPLES]]. == Standard types of pathway analysis offered == * '''Pathway and network analysis: find pathways enriched in a list of genes (e.g. differentially expressed genes)''' * Gene-set enrichment analysis helps characterize large gene lists by finding functionally coherent gene-sets, such as pathways, that are statistically over-represented in a given gene list. We have also developed a method to visualize the results of this analysis, called Enrichment Map. Enrichment Map organizes gene-sets in a network and it enables the user to quickly identify the major enriched functional themes. '''Input''': gene list from genomics experiment (statistically analyzed). '''Output''': enriched pathways visually displayed. * '''Example of pathway and network analysis:''' [[CSCPathwayAnalysisService/Publication | (MORE EXAMPLES) ]] * A typical example (see figure below the text) comes from gene expression data comparing treated samples versus non-treated samples. The first step is to identify differential gene expression using statistics: genes are ranked using t-test t values with up-regulated genes at the top of the list and down-regulated genes at the bottom. * Next, GSEA is run to find out if gene-sets contain mostly up or down-regulated genes. [Gene-sets are a group of genes that have been annotated to have a similar biological function or belong to the same biological pathway e.g. mitosis and are collected from multiple databases]. * Then, Enrichment map helps visualize all the gene-sets that are significantly enriched in the treated (red circles) or in the non-treated samples (blue circles). [Each gene-set is represented by a circle, also known as a node]. If gene-sets have similar annotations, they cluster together on the map [e.g. all gene-sets related to chromosome condensation and replication fork cluster together] which ease interpretation of the map. In this example, many gene-sets related to mitosis and DNA replication/damage, or involved in the replication fork complex, are enriched in the treated samples (red nodes, genes in these gene-sets are mostly up-regulated). Gene-sets involved in ossification/bone morphogenesis are enriched in the non-treated samples (blue nodes). * As a result, the analysis output summarizes all of the known biological function/pathways that are changing in a particular experiment and more detailed analyses can be performed as a next step to validate or to generate new hypotheses. {{attachment:website2.png}} |
Line 18: | Line 33: |
* '''We can discuss custom analysis''' | * '''We are interested in discussing custom analysis''' - it is how we learn what you need. |
Line 20: | Line 35: |
* '''What can you expect from the service:''' * We run the analysis for you and help interpret the data. * We can help you at different stages: * at the experimental design stage * during the analysis: we offer training in data analysis and exploration * after an initial analysis is complete and any validation experiments have been performed, we can book a follow-up meeting to see if you need additional analyses or to help plan subsequent genomics experiments. <<BR>><<BR>> |
== Statistical Analysis == * Pathway and network analysis comes when a gene list has been generated from high throughput OMICs experiments and needs to be functionally interpreted. The data should have then been already statistically analyzed. If your list contains true positives, you are going to be more confident about the output of the pathway analysis. On the other hand, if the gene list contains more noise, we will have to be more cautious about the interpretation of the results and it will also require additional analyses that will delay the overall process of interpretation. Experience is showing us that taking a lot of care in the early steps of the statistical analysis -- by using the statistical method that best fit your data including normalization or removing outliers -- improve the pathway and network analysis results. For these reasons, we have also developed a biostatistics service that can help you if you need to choose a method or process your data in a correct format for subsequent pathway and network analyses: * Please look at http://www.baderlab.org/CSCBiostatService for more information. * You are also encouraged to contact us as soon '''as you plan''' your experiment: genomics technologies can be very sensitive to noise and a well designed experiment is very important for best results. Statistical consultation at the design stage is crucial for improved data quality and results. |
Line 29: | Line 40: |
{{attachment:website2.png}} | == How to use the service == * Please follow [[CSCPathwayAnalysisService/HowToUse | THIS LINK ]] to get more details about what to expect from the service and suggested data requirements. == Link to Tutorials == * Please follow [[CSCPathwayAnalysisService/Tutorials | THIS LINK ]] to get some Enrichment Map examples , some tutorial slides, workflows and tips. |
Line 32: | Line 48: |
----- == How to use the service == === Who can use the service === * '''Members of the OICR Cancer Stem Cell program''' may use the service if * '''''You''''' are planning to generate 'omics' (e.g. gene expression) data * '''''You''''' have a large gene list derived from a large-scale omics project that is ready to be analyzed * '''''You''''' require training in pathway and network analysis <<BR>> '''Please schedule an appointment with us:''' * '''Consulting meeting''' If you are planning a genomics experiment and you need some advice concerning the experiment design. Typical time: 30-60 minutes * '''Analysis planning meeting''' If you have data ready to analyze and they have been already statistically analyzed. Typical time: 60 minutes * '''Training session''' If you want to do your own pathway and network analyses, '''''we can explain how various state of the are software tools and methods work''''', such as GSEA, Enrichment Map and GeneMANIA. Typical time: Regular training schedule is currently being planned. Individual or group sessions can be arranged. * [[CancerStemCellProject/VeroniqueVoisin/PathwayAnalysisService/SOP | More details following this link ]] {{attachment:flowchart.png|flowchart}} === How to book an appointment === 1. Normal in person meetings are on Tuesdays at TMDT 8th floor. Let us know if this doesn't work for you. [[CancerStemCellProject/VeroniqueVoisin/PathwayAnalysisService/Calendar |Check our meeting calendar to see available times]] (30 min to 1 h meeting) *Send an e-mail to veronique.voisin@gmail.com with your preferred meeting time and the purpose of the meeting and wait for e-mail confirmation. *For first-time meetings, please send a paper that best describes your work prior to the meeting. *If you booked an initial meeting, please [[CancerStemCellProject/VeroniqueVoisin/PathwayAnalysisService/SOP| read our standard operating procedure to know what to expect]] *If you must cancel a meeting, please give 24 hours notice to veronique.voisin@gmail.com. ---- ----- [[CancerStemCellProject/VeroniqueVoisin/PathwayAnalysisService/SOP | LINK TO SERVICE SOP ]] ---- ----- [[CancerStemCellProject/VeroniqueVoisin/PathwayAnalysisService/Tutorials | LINK TO TUTORIAL PAGE ]] ---- ----- ------ |
'''Contact''' Dr. Veronique Voisin (Ph.D Biology) veronique.voisin@gmail.com |
OICR Cancer Stem Cell program - Pathway and Network Analysis Service
The Pathway and Network Analysis Service is freely available to all OICR Cancer Stem Cell program members.
Goals of the service
High-throughput genomic experiments (e.g. gene expression, large-scale genetic screens) often lead to the identification of large gene lists. The interpretation of results and the formulation of consistent biological hypotheses from these gene lists can be challenging. Pathway and network analysis (e.g enrichment analysis) approaches can aid interpretation by relating the gene list to knowledge about the biological system, such as pathways.
Our goal is to help researchers interpret results of genomics experiments. Analysis is conducted in close collaboration with researchers on each project to ensure correct input data and effective interpretation of results. Ideally researchers do as much of the analysis and interpretation as they can.
We are also focusing on developing training materials and sessions to help researchers who are interested to perform these bioinformatics analyses themselves. Examples of published pathway maps and list of tutorials that could guide researcher can be found following this link: RESOURCES AND EXAMPLES.
Standard types of pathway analysis offered
Pathway and network analysis: find pathways enriched in a list of genes (e.g. differentially expressed genes)
Gene-set enrichment analysis helps characterize large gene lists by finding functionally coherent gene-sets, such as pathways, that are statistically over-represented in a given gene list. We have also developed a method to visualize the results of this analysis, called Enrichment Map. Enrichment Map organizes gene-sets in a network and it enables the user to quickly identify the major enriched functional themes. Input: gene list from genomics experiment (statistically analyzed). Output: enriched pathways visually displayed.
Example of pathway and network analysis: (MORE EXAMPLES)
- A typical example (see figure below the text) comes from gene expression data comparing treated samples versus non-treated samples. The first step is to identify differential gene expression using statistics: genes are ranked using t-test t values with up-regulated genes at the top of the list and down-regulated genes at the bottom.
- Next, GSEA is run to find out if gene-sets contain mostly up or down-regulated genes. [Gene-sets are a group of genes that have been annotated to have a similar biological function or belong to the same biological pathway e.g. mitosis and are collected from multiple databases].
- Then, Enrichment map helps visualize all the gene-sets that are significantly enriched in the treated (red circles) or in the non-treated samples (blue circles). [Each gene-set is represented by a circle, also known as a node]. If gene-sets have similar annotations, they cluster together on the map [e.g. all gene-sets related to chromosome condensation and replication fork cluster together] which ease interpretation of the map. In this example, many gene-sets related to mitosis and DNA replication/damage, or involved in the replication fork complex, are enriched in the treated samples (red nodes, genes in these gene-sets are mostly up-regulated). Gene-sets involved in ossification/bone morphogenesis are enriched in the non-treated samples (blue nodes).
- As a result, the analysis output summarizes all of the known biological function/pathways that are changing in a particular experiment and more detailed analyses can be performed as a next step to validate or to generate new hypotheses.
Predict the function of an unknown gene GeneMANIA finds other genes that are related to a set of input genes, using a very large set of functional association data. Input: a gene or set of genes. Output: connections between input genes and suggestions for additional related genes.
Related publications: GeneMANIA
We are interested in discussing custom analysis - it is how we learn what you need.
Statistical Analysis
- Pathway and network analysis comes when a gene list has been generated from high throughput OMICs experiments and needs to be functionally interpreted. The data should have then been already statistically analyzed. If your list contains true positives, you are going to be more confident about the output of the pathway analysis. On the other hand, if the gene list contains more noise, we will have to be more cautious about the interpretation of the results and it will also require additional analyses that will delay the overall process of interpretation. Experience is showing us that taking a lot of care in the early steps of the statistical analysis -- by using the statistical method that best fit your data including normalization or removing outliers -- improve the pathway and network analysis results. For these reasons, we have also developed a biostatistics service that can help you if you need to choose a method or process your data in a correct format for subsequent pathway and network analyses:
Please look at http://www.baderlab.org/CSCBiostatService for more information.
You are also encouraged to contact us as soon as you plan your experiment: genomics technologies can be very sensitive to noise and a well designed experiment is very important for best results. Statistical consultation at the design stage is crucial for improved data quality and results.
How to use the service
Please follow THIS LINK to get more details about what to expect from the service and suggested data requirements.
Link to Tutorials
Please follow THIS LINK to get some Enrichment Map examples , some tutorial slides, workflows and tips.
Contact Dr. Veronique Voisin (Ph.D Biology) veronique.voisin@gmail.com