Structured vs. unstructured data:
Structured data is data that is in a form that can be used to develop statistical or machine learning models (typically a matrix where rows are records and columns are variables or features). Or data that is in a form that can be extracted and turned into such a matrix fairly easily (e.g. database tables). Unstructured data is data, often text data, that is heterogeneous in format and requires considerable pre-processing before it can be used in a model. Examples are tweets, social network profiles and postings, and tech support cases or maintenance requests.
Browse Other Glossary Entries
Planning on taking an introductory statistics course, but not sure if you need to start at the beginning? Review the course description for each of our introductory statistics courses and estimate which best matches your level, then take the self test for that course. If you get all or almost all the questions correct, move on and take the next test.
Find the right course for you
We'd love to answer your questions
Our mentors and academic advisors are standing by to help guide you towards the courses or program that makes the most sense for you and your goals.
300 W Main St STE 301, Charlottesville, VA 22903