SPSS Focus

Data sets

The best way to learn statistical modeling in SPSS is by practicing with some data sets. Below are some data sets that you can use to practice the techniques discussed in this book.
This table includes a brief description of the data, the names of the main variables, recommended modeling method, and links to download the data in CSV file format.

Data Variables Source Suggested modeling Download link
A study on the effectiveness of radiosurgery treatment of primary brain tumor patients. Survival time, Survival status Selinjerova et al (2016) Kaplan-Meier, Cox regression Brain tumor
Pima Indians Diabetes is a study on development of diabetes among female Pima ethnic group. Diabetes status, Blood pressure, BMI, Glucose level National Institute of Diabetes and Digestive and Kidney Diseases Logistic regression Pima Indians Diabetes
A study that compares arsenic level in water pipes with acceptable level. Arsenic level Hypothetical One-sample t-test Arsenic
Estimating body fat by knowing circumference measures of different body parts. Body fat, Circumference measures Penrose et al (1985) Multiple regression Body fat
Classification of breast cancers using cell measurements. This breast cancer data was obtained from the University of Wisconsin Hospitals, Madison from Dr. William H. Wolberg. Breast cancer types (benign, malign) Mangasarian and Wolberg (1990) Logistic regression Breast cancer
Does cardamom have any effect on blood pressure? Blood pressure Hypothetical Dependent-sample t-test Cardamom
Research on the reelationship between exercise and optimism. Exercise frequency, Optimisim scale Hypothetical Kendall Tau correlation Exercise and Optimism
A randomized trial evaluating hormonal treatment and the duration of chemotherapy in node-positive breast cancer patients in Germany. Recurrence free survival time (in days), Censored status, Therapy Schumacher et al (1994) Kaplan-Meier, Cox regression Breast tumor
Study on the relationship between hours of study and exam scores. Hours of study, Exam score Hypothetical Pearson correlation Study hours and exam scores
Study on the relationship between exercise and weight loss. Exercise hours, Weight loss Hypothetical Spearman correlation Exercise and weight loss
Study on the effectiveness of two math teaching methods. Teaching method, Math score Hypothetical Independent-samples t-test Math scores
A study on the survival time of recurrent malignant Gliomas patients. Survival time, Survival status, Cancer types Rostomily et al (1994) Kaplan-Meier, log-rank test, Cox regression Malignant Gliomas
Study on the effectivness three physical therapy methods on recovery time. PT method, Recovery time Hypothetical One-way ANOVA Physical therapy methods
Investigating the interaction effect of physical therapy methods and injury severity on recovery time. PT method, Injury severity, Recovery time Hypothetical Two-way ANOVA Physical therapy and Injury Severity
Does providing a school-based yoga program to school children reduce their math anxiety? Math anxiety Hypothetical Repeated measures ANOVA Math anxiety
Is there a relationship between Sleep position (sleeping on side versus on back) and Backache complaints? Sleep position, Backache Hypothetical Chi-square test Backache
What is the relationship between the number of hours students dedicate to studying and their test scores? Can the number of study hours predict test scores? Study hours, Test score Hypothetical Simple linear regression Study hours
What is the relationship between the number of hours students study, students' academic motivation and their test scores? Study hours, Motivation, Test score Hypothetical Multiple linear regression Study hours and motivation