I have included the data set information I was provided in the EXCEL sheet I attached. I would like to just have the SPSS document produced that shows the display I need to explain the following, you don’t have to answer the questions, just provide the data I need from SPSS. Text me if you have any questions:
To complete questions 9 and 10, use the BP Study Dataset given in the resources.
Select three variables from the dataset. Two of the variables should be good candidates for a correlation, and the other variable should be a poor candidate for a correlation. Good candidates are variables that meet the assumptions (such as normally distributed, reliably measured, interval-ratio level of measurement). Poor candidates are variables that do not meet assumptions or that have clear problems (such as restricted range, extreme outliers, gross non-normality of distribution shape).
- Use the FREQUENCIES procedure to obtain a histogram and all univariate descriptive statistics for each of the three variables.
- Create a scatterplot for the two good candidate variables.
- Create a scatterplot for the poor candidate variable using one of the two good variables. Properly embed SPSS output where appropriate in your answer to question 9 below.
- Explain which variables are good and poor candidates for a correlation analysis and give your rationale. Comment on empirical results from your data screening—both the histograms and scatterplots—as evidence that these variables meet or do not meet the basic assumptions necessary for correlation to be meaningful and honest. What other information would you want to have about the variables to make better informed judgments?
- Is there anything that could be done (in terms of data transformations or eliminating outliers for instance) to make your poor candidate variable better? If so, what would you recommend?