Flexible, affordable statistics education.
Designed to help you master the software you need to enhance your skills and the practical experience you need to get ahead.
Designed to help you master the software you need to enhance your skills and the practical experience you need to get ahead.

October 18, 2013 to November 15, 2013
Statistical Analysis of Microarray Data with R
taught by Sudha Purohit
Aim of Course:In this course, participants will learn the statistical tools required for the analysis of microarray data, how to apply them using R software and how to interpret the results meaningfully. We will review the biology relevant to microarray data, then cover microarray experiment set up, quantification of information generated from the experiment, preprocessing of data including statistical tools for between array and within array normalization and introduction to bioconductor, use of bioconductor packages for preprocessing of affydata. This will be followed by statistical inference procedures to identify differentially expressed genes under two different conditions, and its extension to situations involving more than two conditions using classical t- test and anova. Furthermore we include use of limma package of bioconductor to identify the differentially expressed genes in two and more conditions. This will be followed by discussion of two commonly used microarray specific designs and identification of differentially expressed genes using marray and limma packages of bioconductor. The course will also introduce multivariate statistical methods, such as principal component analysis and cluster analysis. These methods help to identify differentially expressed genes, sets of co-regulated genes, which in turn will help to assign functions to genes.
Course Program:Note: This course is not intended as a comprehensive introduction to either statistics or the biology of genetics. Rather, it is intended for participants who have some background in one or the other or both. Recognizing that this background may be varied, considerable review material is provided in both biology and statistics, as part of the regular course readings, as noted below. It is anticipated that participants will pick and choose to focus their attention on areas of need. The more of this material you need to cover in the review, the more time (perhaps even beyond the projected 15 hours per week) you should budget for the course.
Supplementary Background in Biology: Genome project, structure of eukaryotic cell, DNA, RNA, gene expression, transcription, splicing, translation, microarray experimental setup, quantification of information generated by microarray experiment.
Supplementary Background in Statistics: Descriptive Statistics for univariate data, correlation and regression for bivariate data, basics of statistical hypothesis testing, one sample and two sample t- test, paired t-test, F-test for equality of variances, Welch test, Shapiro - Wilks test, Wilcoxon rank sum test, signed rank test, one way ANOVA, Bartlett's test, problem of multiple hypothesis testing, false discovery rate, principal component analysis, cluster analysis.
HOMEWORK:
Homework in this course consists of short answer questions to test concepts and guided data analysis problems using software.
Add $50 service fee if you require a prior invoice, or if you need to submit a purchase order or voucher, pay by wire transfer or EFT, or refund and reprocess a prior payment. Please use this printed registration form, for these and other special orders.
Courses may fill up at any time and registrations are processed in the order in which they are received. Your registration will be confirmed for the first available course date, unless you specify otherwise. Those registering for multiple courses, Statistics.com's PASS students, and those affiliated with other academic institutions may be entitled to tuition discounts; read more.
Have you reviewed the REQUIREMENTS for this course?Statistical Analysis of Microarray Data with R
taught by Sudha Purohit
Who Should Take This Course:Biologists and geneticists who need to use statistical methods to analyze microarray data; also computer scientists and statisticians involved in microarray analysis projects. The course is designed to bridge the gap between several disciplines by providing the necessary information to participants with varied background.
Level:Intermediate
Please also read the note at the end of the course outline concerning the course's review materials in biology and statistics, and the time that you should budget for this course. If you are not skilled in the use of R, Statistics.com's Introduction to R is a prerequiste to this course.
This course takes place online at the Institute for 4 weeks. During each course week, you participate at times of your own choosing - there are no set times when you must be online. Course participants will be given access to a private discussion board. In class discussions led by the instructor, you can post questions, seek clarification, and interact with your fellow students and the instructor.
The course typically requires 15 hours per week. At the beginning of each week, you receive the relevant material, in addition to answers to exercises from the previous session. During the week, you are expected to go over the course materials, work through exercises, and submit answers. Discussion among participants is encouraged. The instructor will provide answers and comments, and at the end of the week, you will receive individual feedback on your homework answers.
The software used in course illustrations and assignments is R, an open-source, freely-available statistical programming environment. Click Here for information on obtaining a free copy. Participants should download and install the R software prior to the beginning of the course. If you are not confident and comfortable using R software, you should consider taking Statistics.com's Introduction to R as a prerequisite to this course.
"You really have come up with an ideal method for working academicians to improve their quantitative skills without spending a fortune and taking time off from work to travel."