Datafiles
- Cars Download .TXT file Open in Data Desk ?
- Methods: Analysis of Variance, Correlation, Experiment Design, Re-expression, Regression
- Source: Henderson and Velleman. (1981). "Building Regression ModelsInteractively". Biometrics 37, 400. Data originally collected from Consumer Reports.
- Number of Cases: 38
- Excerpt: Measurements on 38 1978-79 model automobiles. Gas mileage in miles per gallon as measured by Consumers’ Union on a test track. Other values as reported by automobile manufacturer. Used to illustrate regression model building and diagnosis. Be sure to check the residuals when predicting MPG.
- Case_study_I-PV Download .TXT file Open in Data Desk ?
- Methods: Regression, Correlation, Scatterplot
- Source: sample fromhttps://kdd.ics.uci.edu/databases/kddcup98/kddcup98.html
- Number of Cases: 7600
- Excerpt:
- Case-Shiller_by_city Download .TXT file Open in Data Desk ?
- Methods: Displaying Quantitative Data, Summarizing Quantitative Data
- Source: unknown
- Number of Cases: 311
- Excerpt: The S&P/Case-Shiller Home Price Indices track changes in the value of residential real estate nationally and in 20 metropolitan regions. (Some of these indices are actually traded on the Chicago Mercantile Exchange.) The data set Case-Shiller by City gives the monthly index values for each of the 20 cities tracked by the Case-Shiller index and […]
- CEO_Compensation_2014 Download .TXT file Open in Data Desk ?
- Methods: Boxplots, Summarizing Quantitative Data, Displaying Quantitative Data, Normal Probability Plots, Confidence Intervals for Means
- Source: https://www.glassdoor.com/research/ceo-pay-ratio/
- Number of Cases: 434
- Excerpt: Beginning in 2017, public companies will be required to disclose the ratio of CEO pay to median worker pay. The Glassdoor Economic Research Blog has published the data for 2014. The data includes CEO identities, companies, CEO compensation, median worker compensation (compiled by Glassdoor), and the ratio of CEO to worker compensation.
- CEO_Compensation_2018 Download .TXT file Open in Data Desk ?
- Methods: Boxplots, Comparing Groups, Confidence Intervals for Means, Correlation, Normal model, Outliers, Re-expression, Scatterplot, Standard Deviation, Summarizing Quantitative Data
- Source: https://aflcio.org/paywatch/company-pay-ratios
- Number of Cases: 480
- Excerpt: Beginning in 2017, public companies have been required to disclose the ratio of CEO pay to median worker pay.This is the 2018/9 data from the AFL/CIO for S&P 500
- CEO_Salary_2012 Download .TXT file Open in Data Desk ?
- Methods:
- Source: Forbes
- Number of Cases: 500
- Excerpt:
- Cereal_company Download .TXT file Open in Data Desk ?
- Methods: Comparing Two Groups, Nonparametric Methods
- Source: Data collected in a super market
- Number of Cases: 27
- Excerpt:
- Cereals Download .TXT file Open in Data Desk ?
- Methods: Analysis of Variance, Center, Data Display, Display Quantitative Variable, Indicator Variables, Inference, Multiple Regression, Multiple Regression Inference, Outliers, Partial Regression Plots, R-squared, Re-expression, Regression, Regression Inference, Residuals, Shape, Spread, Summaries
- Source: Data collected in a super market
- Number of Cases: 77
- Excerpt: Nutritionists are concerned that people have a good breakfast. But what does that mean? students collected nutrition information from the nutrition labels of cereals in one supermarket.
- Chicago_taxi Download .TXT file Open in Data Desk ?
- Methods: Comparing Groups
- Source:
- Number of Cases: 13082
- Excerpt: Chicago’s Department of Business Affairs and Consumer Protection provides monthly reports of all taxi trips in Chicago, tagged with trip distances, trip durations,fare amounts, and tip amounts. The dataset Chicago taxi holds data on 13,082 taxi trips for which the duration exceeded 1 minute and for which payment was made either in cash or with […]
- Chips Download .TXT file Open in Data Desk ?
- Methods: Correlation, Re-expression, Regression
- Source: invented example
- Number of Cases: 15
- Excerpt: A start-up company has developed an improved electronicchip for use in laboratory equipment. The company needs to project the manufacturing cost, so it develops a spreadsheet model that takes into account the purchase of production equipment, overhead, raw materials, depreciation, maintenance, and other business costs. The spreadsheet estimates the cost of producing 10,000 to 200,000 […]
- Chips_Ahoy! Download .TXT file Open in Data Desk ?
- Methods: Confidence Intervals for Means, Hypothesis Tests, Hypothesis Tests for Means, Statistical Inference
- Source: Chance, 12, no. 1[1999]
- Number of Cases: 16
- Excerpt: In 1998, as an advertising campaign, the Nabisco Company announced a “1000 Chips Challenge,” claiming that every 18-ounce bag of their Chips Ahoy! cookies contained at least 1000 chocolate chips. Dedicated statistics students at the Air Force Academy randomly selected bags of cookies and counted the chocolate chips. The data report their counts.
- Cholesterol_and_smoking Download .TXT file Open in Data Desk ?
- Methods: Boxplots, Comparing Groups, Outliers, Re-expression
- Source: unknown
- Number of Cases: 43
- Excerpt: A study examined the health risks of smoking measured the cholesterol levels of people who had smoked for at least 25 years and people of similar ages who had smoked for no more than 5 years and then stopped