Skip to content

Explore Courses | Elder Research | Contact | LMS Login

Statistics.com
  • Curriculum
    • Curriculum
    • About Us
    • Testimonials
    • Management Team
    • Faculty Search
    • Teach With Us
    • Credit & Credentialing
  • Courses
    • Explore Courses
    • Course Calendar
    • About Our Courses
    • Course Tour
    • Test Yourself!
  • Mastery Series
    • Mastery Series Program
    • Bayesian Statistics
    • Business Analytics
    • Healthcare Analytics
    • Marketing Analytics
    • Operations Research
    • Predictive Analytics
    • Python for Analytics
    • R Programming
    • Rasch & IRT
    • Spatial Statistics
    • Statistical Modeling
    • Survey Statistics
    • Text Mining and Analytics
  • Certificates
    • Certificate Program
    • Analytics for Data Science
    • Biostatistics
    • Programming for Data Science – R (Novice)
    • Programming for Data Science – R (Experienced)
    • Programming for Data Science – Python (Novice)
    • Programming for Data Science – Python (Experienced)
    • Social Science
  • Degrees
    • Degree Programs
    • Computational Data Analytics Certificate of Graduate Study from Rowan University
    • Health Data Management Certificate of Graduate Study from Rowan University
    • Data Science Analytics Master’s Degree from Thomas Edison State University (TESU)
    • Data Science Analytics Bachelor’s Degree – TESU
    • Mathematics with Predictive Modeling Emphasis BS from Bellevue University
  • Enterprise
    • Organizations
    • Higher Education
  • Resources
    • Blog
    • FAQs & Knowledge Base
    • Glossary
    • Site Map
    • Statistical Symbols
    • Weekly Brief Newsletter Signup
    • Word of the Week
Menu Close
  • Curriculum
    • Curriculum
    • About Us
    • Testimonials
    • Management Team
    • Faculty Search
    • Teach With Us
    • Credit & Credentialing
  • Courses
    • Explore Courses
    • Course Calendar
    • About Our Courses
    • Course Tour
    • Test Yourself!
  • Mastery Series
    • Mastery Series Program
    • Bayesian Statistics
    • Business Analytics
    • Healthcare Analytics
    • Marketing Analytics
    • Operations Research
    • Predictive Analytics
    • Python for Analytics
    • R Programming
    • Rasch & IRT
    • Spatial Statistics
    • Statistical Modeling
    • Survey Statistics
    • Text Mining and Analytics
  • Certificates
    • Certificate Program
    • Analytics for Data Science
    • Biostatistics
    • Programming for Data Science – R (Novice)
    • Programming for Data Science – R (Experienced)
    • Programming for Data Science – Python (Novice)
    • Programming for Data Science – Python (Experienced)
    • Social Science
  • Degrees
    • Degree Programs
    • Computational Data Analytics Certificate of Graduate Study from Rowan University
    • Health Data Management Certificate of Graduate Study from Rowan University
    • Data Science Analytics Master’s Degree from Thomas Edison State University (TESU)
    • Data Science Analytics Bachelor’s Degree – TESU
    • Mathematics with Predictive Modeling Emphasis BS from Bellevue University
  • Enterprise
    • Organizations
    • Higher Education
  • Resources
    • Blog
    • FAQs & Knowledge Base
    • Glossary
    • Site Map
    • Statistical Symbols
    • Weekly Brief Newsletter Signup
    • Word of the Week

Wilcoxon – Mann – Whitney U Test

Home » Glossaries » Wilcoxon – Mann – Whitney U Test

Wilcoxon – Mann – Whitney U Test

Wilcoxon - Mann - Whitney U Test:

The Wilcoxon-Mann-Whitney test uses the ranks of data to test the hypothesis that two samples of sizes m and n might come from the same population. The procedure is as follows:

  1. Combine the data from both samples
  2. Rank each value
  3. Take the ranks for the first sample and sum them
  4. Compare this sum of ranks to all the possible rank sums that could result from random rearrangements of the data into two samples.

If step 4 reveals that the rank sum for the observed first sample is larger (or smaller) than nearly all the random orderings, this indicates that the first sample is significantly different from the second sample.

Note: Hollander and Wolfe suggest that ties be resolved by using the average rank of the tied observations.

Here´s an example in Excel (in step 4, rather than comparing the observed sum of ranks to ALL POSSIBLE randomly ordered sums, thousands of randomly shuffled sums are used for comparison):

Is there a difference in the transfer of titrated water across a placental membrane between human fetuses at 12-26 weeks and at term? The permeability constant Pd of the membrane is used as the measure. (Source: Hollander & Wolfe, Nonparametric Statistical Methods, John Wiley and Sons, 1973)

Here are the data:

Here are the ranks; note the rank sum for the first sample is 30:

Here are the ranks, randomly shuffled using the Resampling Stats add-in for Excel (30-day trial available for download at <a href="http://www.resample.com">http://www.resample.com):

The column of 10,000 resulting values is sorted, and we see that 1291 of the 10,000 shufflings yielded a sum of ranks <= the observed value of 30. This translates into an estimated p-value of .1291 for a 1-sided test of the null hypothesis that the two samples might come from the same population (against the alternative that sample 1 is smaller than sample 2.)

Including the above method, there are several ways to determine this p-value:

  1. Compare the observed rank sum to the distribution of rank sums resulting from all possible orderings of the ranks (an exact permutation test)
  2. Compare the observed rank sum to repeatedly shuffled rank sums (an approximate or Monte Carlo permuitation test; the result approaches the result of #1 above as the number of repeats approaches infinity).
    This is the method described above.
  3. Transform the observed test statistic into an equivalent statistic, which is approximately normally-distributed.
  4. For certain sample sizes, you can consult tables of the exact distribution of the test statistic.

With the advent of high speed computing and the availability of resampling and permutation software, methods 1 and 2 have increasingly come to dominate 3 and 4. <!--

Special Offer for Statistics.com Users!

Buy the Resampling Stats Excel Add-In now and save $25 off the regular price!

Fax a copy of this printable order form to 703-522-5846 to take advantage of this special offer - $124 (regular price: $149) -->

This expanded glossary entry is sponsored by Resampling Stats.

<a href="http://www.resample.com/excel/">Click here for more information on the Resampling Stats Add-In for Excel

Browse Other Glossary Entries

Courses Using This Term

Biostatistics 1 – For Medical Science and Public Health
This course will teach you the principal statistical concepts used in medical and health sciences. Basic concepts common to all statistical analysis are reviewed, and those concepts with specific importance in medicine and health are covered in detail.
Return to Glossary Search

About Statistics.com

Statistics.com offers academic and professional education in statistics, analytics, and data science at beginner, intermediate, and advanced levels of instruction. Statistics.com is a part of Elder Research, a data science consultancy with 25 years of experience in data analytics.

Latest Blogs

  • Dec 14: Statistics in Practice
    December 11, 2020/
    0 Comments
  • PUZZLE OF THE WEEK – School in the Pandemic
    December 11, 2020/
    0 Comments
  • From Kaggle to Cancel: The Culture of AI
    December 11, 2020/
    0 Comments

Social Networks

Linkedin
Twitter
Facebook
Youtube

Contact

The Institute for Statistics Education
4075 Wilson Blvd, 8th Floor
Arlington, VA 22203
(571) 281-8817

ourcourses@statistics.com

© Copyright 2021 - Statistics.com, LLC | All Rights Reserved | Privacy Policy | Terms of Use

By continuing to use this website, you consent to the use of cookies in accordance with our Cookie Policy.

Accept