Datafiles

  • Fuel_economy_2016 Download .TXT file Open in Data Desk ?
  • Methods: Boxplots, Confidence Intervals for Means, Correlation, Regression, Scatterplot
  • Source: https://www.fueleconomy.gov/feg/download.shtml
  • Number of Cases: 1211
  • Excerpt: The U.S. government provides fuel economy (in miles per gallon) and other information about late model cars sold in the US. How would you model the relationship between fuel economy and engine displacement (in liters)? Are there any cars that don’t fit the model? Can you explain why?


  • Fuel_economy_sample_2016 Download .TXT file Open in Data Desk ?
  • Methods: Correlation, Scatterplot
  • Source: https://www.fueleconomy.gov/feg/download.shtml
  • Number of Cases: 35
  • Excerpt: The U.S. government provides fuel economy (in miles per gallon) and other information about late model cars sold in the US. How would you model the relationship between fuel economy and engine displacement (in liters)? Are there any cars that don’t fit the model? Can you explain why? This is a sample of 35 cars […]


  • Fuel_efficiency Download .TXT file Open in Data Desk ?
  • Methods: Correlation, Outliers, Re-expression, Regression, Residuals, Scatterplot
  • Source: Consumer Reports
  • Number of Cases: 84
  • Excerpt: We know from common sense and from Physics that heavier cars need more fuel, but exactly how does a car’s weight affect its fuel efficiency? The data set continues data on 38 cars including their fuel efficiency in miles per gallon measured on a track.








  • Holiday_spending Download .TXT file Open in Data Desk ?
  • Methods: Regression, Correlation, Scatterplot
  • Source: Data adapted from real spending data from American Expressamex.reasonable=Amex.full[Amex.full$Dec_2004<10000 & Amex.full$Jan_2005<10000 & Amex.full$Dec_2004>0,] set.seed(1000) amex.samp=sample(subset(amex.reasonable,!(Dec_2004<4000 & Jan_2005>4000)),750)
  • Number of Cases: 750
  • Excerpt:


  • Hopkins_Forest Download .TXT file Open in Data Desk ?
  • Methods: Boxplots, Comparing Groups, Correlation, Data Display, Indicator Variables, Multiple Regression, Outliers, Partial Regression Plots, Re-expression, Regression, Residuals, Scatterplot, Summaries
  • Source: hmf.williams.edu/researchacademics/data/
  • Number of Cases: 365
  • Excerpt: The Hopkins Memorial Forest is a 2500-acre reserve in Massachusetts, New York, and Vermont managed by the Williams College Center for Environmental Studies (CES). As part of its mission, the CES monitors forest resources and conditions over the long term.


  • Housing_prices Download .TXT file Open in Data Desk ?
  • Methods: Correlation, Indicator Variables, Multiple Regression, Partial Regression Plots, Re-expression, Regression, Scatterplot
  • Source: random sample of 1057 houses taken from full Saratoga Housing Data (De Veaux)
  • Number of Cases: 1057
  • Excerpt: House prices and properties in New York. What properties of a house can predict its price? Can we use such a model to identify houses that are extraordinarily expensive or inexpensive?


  • How_old_is_that_Tree Download .TXT file Open in Data Desk ?
  • Methods: Correlation, Re-expression, Regression
  • Source: unknown
  • Number of Cases: 27
  • Excerpt: One can determine how old a tree is by counting its rings, but that requires either cutting the tree down or extracting a sample from the tree’s core. Can we estimate the tree’s age simply from its diameter?A forester measured 27 trees of the same species that had been cut down, and counted the rings […]


  • Income_and_housing Download .TXT file Open in Data Desk ?
  • Methods: Correlation, Nonparametric Methods, R-squared, Re-expression, Regression, Residuals, Scatterplot
  • Source: Office of Federal Housing Enterprise Oversight
  • Number of Cases: 51
  • Excerpt: How are housing costs related to median family income?