Math Forum | Urban Systemic Initiative

Exploring Data - All Levels || Student Center || Teachers' Place

## Finding Data Sets on the Web

CHANCE Database Welcome Page
Materials designed to be helpful in teaching a CHANCE case study course based on current chance events as reported in daily newspapers and current journals and to supplement work in a more traditional probability or statistics course that introduces current events.

The Data and Story Library (DASL)
An online searchable library of datafiles and stories that illustrate the use of basic statistics methods, providing data from a wide variety of topics so that statistics teachers can find real-world examples that will be interesting to their students. Stories by topic (Archeology through Zoology); by method (ANCOVA through Two sample t-test); datafiles by subject (Agriculture through Weather). A form is available for submitting your story.

The Data Library - Pat Daley
This site includes collaborative projects - specific data collection projects that teachers and their students may become a part of; data sets that can be downloaded as Clarisworks spreadsheets and then sorted, manipulated, and graphed on your computer; and other sources of data - sites like The Bureau of Labor Statistics and The Chance Database that offer many more data sets in other formats.

The El Niño Theme Page - NOAA
The National Oceanic and Atmospheric Administration offers everything from introductory general science to graduate school meteorological data. Beginning with the basics, the site provides a full range of illustrated articles that explain the science and impacts of this weather phenomenon. There are links to data from a moored buoy, drifting ocean buoys, satellites, and all sorts of other places. In Spanish, Portuguese, and English.

The Ocean Planet
See the Overview and Floor Plan for this traveling exhibition from the Smithsonian Institution's National Museum of Natural History, and Diving Records from the "Ocean Planet" Exhibition.

Journal of Statistics Education Data Archive
A large collection of datasets, many with an accompanying article from the Journal.

K-12 Statistics (MSTE)
A list of statistics resources, including lessons and useful data sets, organized to follow the NCTM Statistics Standards for grades 9-12. Links to the AP Statistics site and even more mathematics lessons. From the Office for Mathematics, Science, and Technology Education (MSTE) at the University of Illinois at Urbana-Champaign.

The SeaWiFs Project
The global ocean color monitoring mission of the National Aeronautics and Space Administration (NASA), at the Goddard Space Flight Center. SeaWiFs images and digital data are now available. See Studying Ocean Color From Space - a Teacher's Guide with Activities.

StatLib - Datasets Archive
From the Carnegie Mellon University Statistics Department. Datasets from articles and books, and other sets submitted by readers.

StatLib - JASA Data Archive
Contributed datasets from articles published in the Journal of the American Statistical Association. Examples include data for the following articles: "The Chilean Plebiscite: Projections Without Historic Data"; "Kriging and Splines: An Empirical Comparison of their Predictive Performance in Some Applications"; "Quantity discounts and quality premia for illicit drugs"; "Analytical Use of Data from Army Training Exercises: A Case Study of Tactical Reconnaissance".

Tide Predictions - NOAA
The Oceanographic Products and Services Division (OPSD) of NOAA's National Ocean Service, Office of Coast Survey, collects, analyzes and distributes historical and real-time observations and predictions of water levels, coastal currents and other meteorological and oceanographic data.

Information from summary data files across cities, towns, counties, states, and other geographic and demographic boundaries, useful for many purposes from allocating representation in legislative bodies to tracking economic and population trends needed for strategic planning. The directory includes POPClock Projection, Census Bureau Bulletin Board, Press Releases, Center for Economic Studies, Financial Data for State and Local Governments and Schools, Census Bureau Art Gallery, and more.

United States Geological Survey (USGS), U.S. Department of the Interior
Locators & Catalogs, Collections, and Libraries include: geography information, geologic data, geospatial data, Water Data, and Water Use, etc.

### General Information, Courses, Interactive Resources, Software, etc.

ASA Student Poster Competition
Sponsored by the Center for Statistical Education and the Section on Statistical Graphics of the American Statistical Association, the Statistics Poster Competition encourages students K-12 in the United States and Canada to display data in creative ways. The purpose of the poster is to tell a statistical story visually, to illustrate conclusions that can be drawn from data. For more information on the contest, see What is a Statistical Poster? from the Cleveland chapter of the organization. The deadline for submissions is April 15.

An archive of questions answered by the "math doctors" of this Math Forum Internet project includes Middle School Probability and Statistics and High School Statistics. You may also search the Dr. Math archives by keyword.

AP Statistics - BB&N
A course from the Buckingham Browne & Nichols School. Advanced Placement Statistics acquaints students with the major concepts and tools for collecting, analyzing, and drawing conclusions from data, featuring work on projects involving the hands-on gathering and analysis of real world data. Ideas and computations presented in this course have immediate links and connections with actual events. Computers and calculators allow students to focus deeply on the concepts involved in statistics. Homework assignments include selected answers. The course prepares students for the AP exam in Statistics.

CMP Software - Michigan State Univ.
Downloadable software from the Connected Mathematics Project: The Factor Game, the Product Game, the Coin Game, The Tic-Tac4 Game, The Treasure Hunt Game. May be copied for classroom use only.

DAU Refresher Course - Probability and Statistics
An interactive tutorial covering basic probability, random variables, moments, distributions, data analysis including regression, moving averages, exponential smoothing, and clustering. The Probability and Statistics refresher module is intended for use by DAU course participants.

Exploring Data - Education Queensland
A website with activities, worksheets, overhead transparency masters, datasets, and assessment to support data exploration. It also contains an extensive collection of articles designed to enhance the statistics knowledge of the teacher. A resources page gives a select list of resources to support introductory statistics, including texts, websites, datasets, java applets, and mailing lists. The majority of the resources are available as web pages and as Word 2.0 documents. Topics: Patterns, Stemplots, Dotplots, Histograms (mean/median/mode), Measures of Spread, Boxplots, Normal Plots, Scatterplots, Assessment. Datasets are available in tab-delimited, Excel 4.0 and NCSS 6.0 Jr formats. Resources for teaching and learning statistics; Linear Regression; Normal Distribution, Probability, Sampling, Confidence Intervals, Hypothesis Testing, Curve Fitting.

Fisher's Exact Test
This page can be used to test statistically whether there is any relation between two categorical variables (with two levels). Fill in the table and press COMPUTE.

The Need For Statistical Literacy In Australian Society
Radio National Transcripts: Ockham's Razor, Sunday, 13th April, 1997. Some misconceptions: the way Australian children interpreted "average" after seeing a Ford Motor Co. ad on TV about a family with 2.3 children; a press release saying that "Seven in ten men who frequently eat canned tuna, sardines, salmon, mackerel or kippers admit to being ambitious" with a discussion of the underlying sample size and questions asked in the survey and why the statement was misleading; and a pie chart showing market share of groceries in Australia where a piece of the pie much less than half of the total was labelled 61% and the sum of all of the percentages in the chart yielded a total of 128.5%.

Resampling Statistics (University of Maryland)
Resampling involves the use of data, or a data-generating mechanism such as a coin or set of cards, to randomly generate additional samples, the results of which can be examined. This encompasses Monte Carlo simulation, randomization tests, bootstrap procedures, and other techniques. Resampling can be used to solve just about any problem requiring probabilistic and statistical analysis. Perhaps if computers had been widely available 100 years ago, statistical theory would have been built on this foundation. Books and articles, and Resampling Stats software.

Resampling Stats
A source for information about statistics and data analysis, including statistical software, and resampling methods in particular. Resampling is a statistical method of drawing repeated samples from the given data to determine the value of a parameter of interest. Some problems that are very difficult using traditional methods are easily solved using resampling. Resampling Stats is a computer program offered for sale (30-day free trial, discount for personal or academic use).

Statiscope - Mikael Bonnier
Statiscope presents summarizing data and descriptive statistical charts. In the applet you can enter data manually or download data over the Internet. Charts included: Distribution, Probability mass, Density, Box plot, Stem & leaf; also other features such as hypothesis testing and calculation of confidence intervals. Statiscope is intended both for people learning the concepts of statistics and for practical use by researchers. It lets people share data and do independent investigations.

Statistics for Writers
Numbers can't "talk," but they can tell you as much as human sources can. Here, described in plain English, are some basic concepts in statistics that every writer (and reader!) should know. Mean, Median, Percent, Per capita and Rates, Standard Deviation and Normal Distribution, Margin of Error and Confidence Interval, Data Analysis, Sample Sizes, and Correlation Coefficients.

Statistics Glossary
Definitions from the UCLA Statistics textbook on the Web, with an alphabetical index of all entries.

The Statistics Teacher Network - STN
A newsletter (beginning Winter 1997) for teachers including book reviews and articles on new teaching ideas, methods, and tools. Initial articles on: Review of Activity-Based Statistics; A Quick Introduction to the Statistical Capabilities of the TI-83 Calculator; Review of Graphing Statistics & Data: Creating Better Charts; A Statistician Goes to First Grade. Edited by Jerry Moreno.

A Taste for M&Ms - Ivars Peterson (MathLand)
The package of M&M's chocolate candies (plain) I had just opened contained 56 colorful, little oblate spheroids. It was obvious from a quick glance that certain colors were more common than others in this sample of M&M's. I counted up each color and confirmed my suspicion. Then I opened another packet to see if I would get comparable results. There were some interesting similarities and differences in the data, including the fact that the second package had three fewer M&Ms!

StatLib - Statistics (CMU)
A system for distributing statistical software, datasets, and information by electronic mail, FTP, gopher, and WWW, with quick and long summaries of contents. From the Carnegie Mellon University Statistics Department.

VassarStats Statistical Computation Website
Online programs on topics like statistical tests, simulation, correlation and regression: Procedures applicable to categorical frequency data, proportions, and ordinal data; t-Tests; analysis of variance; analysis of covariance; randomized sequence for the integers 1 through n.

ViSta: the Visual Statistics System - Forrest W. Young
Statistical software that features visual tools for revealing structure in data; for showing the results of statistical analyses; for guiding the analyst through an analysis; and for structuring the analyst's data analysis process. ViSta is designed for a wide range of users, particularly for those learning statistics and for their teachers.

Workshop Statistics: Discovery with Data - Allan J. Rossman
Links to a variety of information and teaching resources concerning Rossman's book Workshop Statistics, which contains activities to guide students to discover statistical concepts, explore statistical principles, and apply statistical techniques. There are a guide for instructors, calculator materials, a sample syllabus, sample exams, brief answers to selected in-class activities, and data sets and minitab macros.

WWW Resources for Teaching Statistics - Robin Lock
A paper presented at Technology in Statistics Education, a 1998 conference for teachers sponsored by the Boston Chapter of the American Statistical Association. Its purpose is to direct readers to typical statistics instruction support sites on the WWW. An Outline and an All-in-one-page version of the talk are provided. The contents include on-line course materials and texts, JAVA demonstrations, electronic journals and discussion lists, "Data, Data, and More Data" (dataset archives, pages of links to datasites, government and official agencies, data about the Web, and textbook data), and miscellaneous links.

WWW Virtual Library: Statistics (Univ. of Florida)
A list of known Web Information Systems relating to Statistics. Australia, Canada, The Netherlands, United Kingdom, United States. Others: CHANCE Database; CompStat; Central Statistical Office (Macro-Economic Time Series Data); JASA Data Archive; Netlib (Mathematical and Statistical Software); PASE; StatLib Index; U.S. Census Bureau.

Questions? Write to the workshop facilitators.