Statistical Problem

The purpose of the paper is to permit you to independently develop research questions in an area of computer science, conduct a literature search to find data, use statistical tools to determine solutions, and present this important application of statistical knowledge to the class.

This paper must include topics from all of the five sections listed below covered in the first two chapters of the text:

Section 1.1: Displaying Distributions

Section 1.2: Describing Distributions with Numbers

Section 2.1: Scatter Plots

Section 2.2: Correlation

Section 2.3: Least-Squares Regression

Relevant sections used in the text must be included in parentheses in the Statistical Tools Summary section.

Provide a typed report that is no longer than three pages (front/back/front). You must have a 25 “X” and “Y” data points in your data set.

Use the following section headings for the report:

Description: A one paragraph (or one sentence) description of the topic.

Statistical Tools Summary: Five bullets listing the statistical tools that were used to analyze the data. Be specific. For example, it could be set up like this:

Created a time series plot of the temperature data over time (sec. 1.1)

Calculated a five-number summary of the temperature data (sec. 1.2)

Determined the form of the relationship between time and temperature based on the creation of a scatterplot (sec. 2.1)

Determined the direction and strength of the relationship between time (X) and temperature (Y) based on the calculation of a Pearson correlation coefficient (sec. 2.2)

Determined the linear regression equation for time (X) and temperature (Y), calculated R-squared, and interpreted it by giving a non-statistical explanation (sec. 2.3)

Data set(s): the raw data presented in a fully-labeled table. The source of the data must be fully cited in the caption. Findings: The answers listed in the order given in the Statistical Tools Summary section. Conclusion: A one paragraph summary of the big take-away(s) from the project. This must be based on your findings.

This is a professional paper, so proper spelling, grammar, punctuation, data (fully labeled tables and graphs), format (with the five subject headings above), etc. is expected. Be sure to write in third person!

A graded 2-4 minute oral presentation using PowerPoint is expected on the due date (the grading sheet is posted on Blackboard). Basically, you should lead the audience through your paper … especially the findings.

A hard copy of the paper is expected at the start of class. I do not want you to turn in the PowerPoint. Presentation order will be determined by a random drawing on the due date. You need to be ready to give your PowerPoint presentation to the class on the due date or you will receive a zero for the project. To make the presentations go faster, please bring your presentation on a flashdrive.

For the PP slides, you are going to need about 7 slides:

Title slide

Statistical Tools Summary slide (do not use full sentences, just briefly list the tools)

The Findings

The graph from section 1.1

The five number summary from section 1.2

The scatterplot from section 2.1

The Pearson from section 2.2

The regression equation and R-squared from section 2.3

