STAT 700 Data Analysis Project
Due by Dec. 14 (or earlier)
This project can be individual or in groups of up to 3.
The groups do not need to be the same as your HW groups.
Having trouble finding a dataset? Here are some useful sources
Presentation of Project (see Blackboard Annoucement for Submission of Slide)
approximately 10 minute Presentation (on M or W, Dec. 12 or 14) should include:
- Description of your data.
- Description of the problem of interest.
- Description the fixed and random effects you are modeling.
- Results of the best model.
Objective:
The objective of the project is to demonstrate and present data analysis
concepts from the class or literature to a problem or question of interest in
a dataset of your choice.
The project involves data and a problem.
Choice of Data:
The data of your choice
should be suitable for fitting a linear mixed model or nonlinear
mixed model. Appropriate models are
models with random effects for clustered data and
models for longitudinal or repeated measures data. If you doubt the suitability
of your data, please consult me.
The data should not have been analyzed before concerning the aspects you will be
investigating.
Written Report:
- The link to Guidelines for the Written Report gives details.
From the above Guideliness your report should include:
- Abstract or Summary
- Introduction
- Methodology
- Results
- Conclusions, etc.
- Bibliolography
More details of the Sections of the written report are as follows:
- Introduction. This section should provide a background for the problem.
It should contain a description and objective of the problem. Are there any
expectations based on literature or scientific knowledge?
Remember to explain the problem you are trying to solve!
- Methodology. Explain how the data was collected. List the variables and
give a description of each variable. Explain what methods you used in your
analysis.
- Results. Here is where you present your summary statistics and
plots, giving an overview of your dataset. Next, give the results
of your statistical analysis (hypothesis tests, p-values, estimates, CI's, ect.).
Do not include R code in the body of the written report.
Your R code and R summaries should be in the Appendix. Include a few model diagnostics plots.
- Conclusions, etc. This is your discussion and interpretation of findings.
- Appendix. Your report should be understood independent of the
Appendix. For this project I would like you to include a print out of the
beginning of your dataset, so that I can see the structure of your dataset.
Include your R code, summaries, and plots used in your analysis.
- The report should be 3-5 pages, 12 point font, one inch margins, and
single-spaced.
The 3-5 pages does not include figures, tables, nor bibliography.
(Here is link to Professor Kafadar's complete guidelines )