Making Conjectures Based on
Graphical and Basic Statistical Analysis of Data

Jay Yohe
Susquehanna Township High School
Grade 10-12
Selected Topics

Prerequisite skills:
    *Adept with TI-83 Graphics Calculator
    *Basic knowledge of graphical methods (line graphs, box plots, histograms, etc.)
    *Basic knowledge of mean, median and standard deviation.

Objectives:
 

Students will be able to graphically analyze data sets and be able to use statistics to make sound conjectures.
Students will use technology to create appropriate graphical representations of data using EXCEL, TI-83 calculators, Fathom or other software packages.
Students will use EXCEL, Fathom and the TI-83 calculator to calculate statistical measurements like mean, median, standard deviation, range, quartiles, etc.  Students will then use these statistical measurements to make conclusions.
Students will utilize the Internet and the Library Media Center to find more data and information to support and enhance their conjectures and they will create their own questions based on the results of their graphical analysis, statistical analysis and advanced research of the data.

PA Math Standards:
 
 

Academic Standard

Description

2.4.11 Mathematical Reasoning and Connections (B) Construct valid arguments from stated facts.
2.5.11 Mathematical Problem Solving 
and Communication
(B) Use symbols, mathematical terminology, standard notation, mathematical rules, graphing and other types of mathematical representations to communicate observations, predictions, concepts, procedures, generalizations, ideas and results.
2.5.11 Mathematical Problem Solving 
and Communication
(D) Conclude a solution process with a summary of results and evaluate the degree to which the results obtained represent an acceptable response to the initial problem and why the reasoning is valid.
2.6.11 Statistics and Data Analysis (B) Use appropriate technology to organize and analyze data taken from the local community.

Materials and Resources:

    *TI-83 calculators
    *Computers with MicroSoft EXCEL
    *Access to the Internet/Library Media Center
    *Attached MicroSoft WORD data sets
    *Fathom (Key Curriculum Press) --desired but optional

Instruction:

    Students will work together in small groups during the introductory problem and will also work together on the actual assignment.  Each student will be responsible for submitting individual written results and summaries.

Procedures:

    *Introduction Problem (below)
    *Explanation of tasks
    *Assignment of projects with explanation of assessment

Introductory Problem:

The following data set is used courtesy of (http://lib.stat.cmu.edu/DASL/)
Estimating the Speed of Light
Reference:
 

Moore, David S., and George P. McCabe (1989). Introduction to the Practice of Statistics.
Original source:
Stigler, S.M., "Do robust estimators work with real data?" 
Annals of Statistics, 5 (1977), pp. 1055-1078. 

Description:
Simon Newcomb measured the time required for light to travel from his laboratory on the Potomac River to a
mirror at the base of the Washington Monument and back, a total distance of about 7400 meters. These measurements were used to estimate the speed of light. The data file contains the estimated speed of light for 66 trials.

Number of cases: 66
Variable Name:

   1.Speed_of_Light: Speed of Light
    (Does this measurement actually represent the speed of light?)

The Data:

Speed_of_Light
28
22
36
26
28
28
26
24
32
30
27
24
33
21
36
32
31
25
24
25
28
36
27
32
34
30
25
26
26
25
-44
23
21
30
33
29
27
29
28
22
26
27
16
31
29
36
32
28
40
19
37
23
32
29
-2
24
25
27
24
16
29
20
28
27
39
23

Step 1.  Analyze the Data using EXCEL

***Adding Data Analysis to your Tool Bar

  1. From the "Tool" menu select "Add-Ins..."
  2. Check both options as illustrated below and click "OK"
Add-Ins tool Box



Step 2:  Calculate Statistical Data (in EXCEL)
 



Step 3:  Make conclusions and answer questions from the EXCEL data set and graph
 

  1. What kind of value is -44 in this data set?
  2. Newcomb is actually measuring the time that it took a light signal to pass from his laboratory on the Potomac River to a mirror on the base of the Washington Monument and back.  So what are the units of his measurements?
  3. We know that speed equals distance divided by time (d=r*t or r=d/t).  Newcomb used a distance of 7400 meters.  If your units are correct in the previous step, use them to calculate the speed of light based on the "mean" measurement.  What do you get?
  4. What is the actual speed of light?
  5. Assuming your units are correct from part 2 above, explain how different -44 is from the mean value calculated for time in Newcomb's speed of light test data.
  6. How many measured data points are within one standard deviation of the mean on both sides?
  7. Remove -44 and redo all the data including the Histogram graph and the formulas.  What does this do to the distribution of data?  Explain.
  8. What other experiments could be conducted to validate Newcomb's test data?  Can you find any other test data from other experiments that gives similar results?


Step 4:  Analyze Newcomb's data using the TI-83 Graphing Calculator
 



Step 5:  Observe the data with Fathom (optional)

This is a screen shot of Fathom with the Speed of Light Data: Fathom

Observe the Box Plot Created in the Fathom Software Package:

Fathom Box Plot


Closure Activities:

A -- Alternate Data for Calculating Speed of Light (class work)

Use the following data to calculate the speed of light.

Star/Planet/Object in Space

Time for Light to Travel from Space Entity to Earth

Moon 1.28 seconds
Sun 8.3 minutes
Communication Satellite (Geosynchronous Orbit) 0.12 seconds
Jupiter ?
Pluto ?

B -- Graded Project on Quarterback Statistics

Download the Quarterback File to your computer (this is a Word document)

Part 1:

Analyze 1991 Quarterback salaries across the football league using graphical and statistical analysis.

Requirements (Analyzing 1-Variable Statistics):
 

  1. You must create one graphical histogram representation on EXCEL and both the Histogram and Box Plot representation on your graphics calculator.  You may substitute Fathom for either EXCEL or the graphics calculator if available.  Use technology to calculate statistical formulas also.
  2. You must make one graph (of any type) by hand on poster board or construction paper.
  3. Create and answer 4 questions related to the observed data and support your results with the graphs and formulas.  You should write at least 3 descriptive sentences to answer each question.  Example:  What do the outliers mean if there are any?  What causes the outliers if there are any?
  4. Research the National Football League during the 1991 season.  Can you find reasons for the results suggested by the data?


Part 2: Two Variable Statistics

Use other graphical means to compare quarterback salaries to team salaries, NFC vs AFC differences, quarterback salaries vs wins and losses, etc.  Create and pose at least 4 questions related to this data even if you decide there is no correlation.  You may also compare 1-variable quarterback salaries with 1-variable team salaries to compare similarities or vast differences.  Summarize your data with graphs, charts and formulas.

Part 3:  Finding your own Data Set

Using the Internet or Library Media Center, find a set of 1-variable data to analyze.  Provide at least one graph and analyze this data set.  If the data is relative to the local community, you can gain extra credit.  You may also find additional data related to the current project from 1991 National Football League statistics.

Assessment:
 

Grade Requirements

50 points

All objectives for Part 1, 2 and 3 are complete.  All questions/answers are supported well by the data.  Students may also earn extra credit by doing extra research and by presenting creative questions (as long as they are supported by the data). All graphs and formulas have been completed.

45 points

One or two questions/answers are not answered adequately and/or the questions are not supported well by the data.  All graphs and formulas have been completed for part 1 and 2.

40 points

Questions/answers are not supported well by the data and/or some questions are missing altogether.  All graphs and formulas have been completed for part 1 and 2.

35 points

Questions/answers are not supported by the data. Questions are missing. Some graphs and formulas have not been completed.

 <35 points

Few objectives have been met!


References:

**Eisenhower Institute (July 2001)
**http://lib.stat.cmu.edu/DASL/    (data sets)
**http://home.earthlink.net/~ob1gui/    (some football statistics)
**http://astro.uchicago.edu/cara/outreach/resources/ysi94/light.html    (planet data)
**http://www.stolaf.edu/people/wallace/Courses/Statistics/backgroundinfo.htm     (Newcomb's summary)


A Shane/Greydon Production
Shane Greydon Production


Return to Hanna Web