Datafiles



  • Saratoga_house_prices Download .TXT file Open in Data Desk ?
  • Methods: Comparing Two Groups, Confidence Intervals for Proportions, Multiple Regression Inference, Regression, Regression Inference
  • Source: public records
  • Number of Cases: 1063
  • Excerpt: Prices of homes in Saratoga NY along with facts about them. Good basis for multiple regressions to predict the Price of the house. But several predictors are collinear.


  • Saratoga_houses Download .TXT file Open in Data Desk ?
  • Methods: Correlation, Regression, Scatterplot
  • Source: public records Saratoga County. Also in R library mosaic
  • Number of Cases: 1728
  • Excerpt: Prices of homes in Saratoga NY along with facts about them. Good basis for multiple regressions to predict the Price of the house. But several predictors are collinear.


  • SAT_scores Download .TXT file Open in Data Desk ?
  • Methods: R-squared, Regression, Residuals
  • Source: Real data—source unspecified to protect privacy
  • Number of Cases: 162
  • Excerpt: Scores on SAT tests for 162 students at the same school. (The identity of the school is not provided for privacy.) How are Math and Verbal scores related? Would a regression model be appropriate? Is there a difference in male and female scores? How would that difference be modeled?


  • School_system Download .TXT file Open in Data Desk ?
  • Methods: Analysis of Variance
  • Source: http://www.scottishhillracing.co.uk
  • Number of Cases: 120
  • Excerpt: A school district superintendent wants to test a new method of teaching arithmetic in the fourth grade at his 15 schools. He plans to select 8 students from each school to take part in the experiment, but to make sure they are roughly of the same ability, he first gives a test to all 120 […]


  • Scorecard Download .TXT file Open in Data Desk ?
  • Methods: Boxplots, Center, Comparing Groups, Conditional Distribution, Contingency Tables, Display Quantitative Variable, Outliers, Re-expression, Shape, Spread, Summaries
  • Source: https://collegescorecard.ed.gov/
  • Number of Cases: 3676
  • Excerpt: President Obama announced the redesigned College Scorecard to give students and families the most reliable, comprehensive, nationally comparable data ever produced on institutional outcomes. These include statistics on debt, federal loan repayment, completion rates, and post-college earnings of alumni in an easy-to-understand format. Key to boosting college completion is ensuring that students and families have […]


  • Scottish_Hill_Races Download .TXT file Open in Data Desk ?
  • Methods: Indicator Variables, Multiple Regression, Partial Regression Plots
  • Source: http://www.scottishhillracing.co.uk
  • Number of Cases: 94
  • Excerpt: Hill races are races that climb generally steep hills, held throughout Scotland throughout the year. The file holds records for men and women in these races the last time those were posted in an accessible table along with facts about the races. In particular, we know the length(km) and total climb(m). These are two independent […]


  • Sea_ice Download .TXT file Open in Data Desk ?
  • Methods: Regression
  • Source: Ice extent fromhttp://nsidc.orgTemperature from http://climate.nasa.gov/system/internal_resources/details/original/647_Global_Temperature_Data_File.txt
  • Number of Cases: 38
  • Excerpt: Climate scientists have been observing the extent of sea ice using satellite observations. Many have expressed concern because, since 1980, the extent of sea ice has declined precipitously—possibly due to global climate change. But a multiple regression of Extent on temp and year gives a coefficient for temp that is essentially zero.


  • Sea_ice_2020 Download .TXT file Open in Data Desk ?
  • Methods: Multiple Regression Inference, Regression Inference
  • Source: Ice extent from http://nsidc.org Temperature from http://climate.nasa.gov/system/internal_resources/details/original/647_Global_Temperature_Data_File.txt
  • Number of Cases: 42
  • Excerpt: Climate scientists have been observing the extent of sea ice using satellite observations. Many have expressed concern because, since 1980, the extent of sea ice has declined precipitously—possibly due to global climate change. But a multiple regression of Extent on temp and year gives a coefficient for temp that is essentially zero.




  • Seat_belts_2015 Download .TXT file Open in Data Desk ?
  • Methods: Inference
  • Source: National Highway Traffic Safety Administration. 2016. Seat belt use in 2015 — use rates in the states and territories. Report no. DOT HS-812-274. Washington, DC: U.S. Department of Transportation.found athttp://www.iihs.org/iihs/topics/t/general-statistics/fatalityfacts/state-by-state-overview#Rural-versus-urban
  • Number of Cases: 51
  • Excerpt: The National Highway Traffic Safety Administration reports seat belt use and fatalities in car accidents by state. How do fatalities relate to seat belt use?


  • Sex_sells Download .TXT file Open in Data Desk ?
  • Methods: Blocking, Paired Data
  • Source: Real study - unknown source
  • Number of Cases: 39
  • Excerpt: A group of Statistics students cut ads out of magazines. They were careful to find two ads for each of 10 similar items, one with a sexual image and one without. They arranged the ads in random order and had 39 subjects look at them for one minute. Then they asked the subjects to list […]