# Statistic 7

## W7: Regression and Correlation

Look back to the raw data you collected in week 1. There are 7 variables listed:

Vehicle type/class

Year

Make

Model

Price

MPG (city)

MPG (highway)

Choose TWO variables that you feel are correlated and explain why you feel that they are correlated. Do you suspect the relation is positive or negative? Why? Which would be considered the independent variable, which the dependent variable? Why?

Run a regression analysis in Excel and provide the results in your post along with your raw data. Looking at the *R2* value, explain what this indicates about the strength of the relation. Then write out your Regression Equation, state if your p-value and conclusion.

I encourage you to review the * Week 7 Regression PDF *at the bottom of the discussions. This will give you a step by step example on how to calculate a correlation and run a Regression using Excel. I DO NOT recommend doing this by hand. Let Excel do the heavy lifting for you. You can also use this PDF in Quizzes section.

There are additional PDFs that were created to help you with the Homework, Lessons and Tests in Quizzes section. I encourage you to review these ASAP! These PDFs are also located at the bottom of the discussion.

“Before you post your initial discussion, you must submit it in the assignment area in a Word file, so its originality can be checked by Turnitin.com. I will take points off if you do not do this. Your score will appear in the same place you submit your file. It can take up to 24 hours for a score to return, but usually, it is less than 30 minutes. Before you post your discussion in the activity, make sure your originality index (%) is less than 15. If it is greater than 15%, rewrite your discussion, submit it again in the assignment area and check the %. Keep doing this until your % is less than 15%. Only post your discussion when the % is less than 15. Here are two hints to get your score below 15%: 1) leave your list of supporting material out of the file you submit for checking (don't forget to add these back when you post your discussion in the forum) and 2) use your own words, not quotes.

Once you have posted your initial discussion, you must reply to at least two other learner's post. Each post must be a different topic. So, you will have your initial post from one topic, your first follow-up post from a different topic, and your second follow-up post from one of the other topics. Of course, you are more than welcome to respond to more than two learners.”

**Instructions:** Make sure you include your data set in your initial post as well. You must also respond to at least 2 other students. Responses may include direct questions.

Peer response #1 - Looking at your peer's Excel output, and the Regression Equation they wrote out, interpret the slope of their Regression Equation. Use their Regression Equation to make a prediction and show the work for your predicted value based on your expression. For Example, if your peer used Year to predict Price, plug in a Year value into the regression equation and use it to predict the Price of a vehicle. Does this predicted Price value make sense with their data?

Peer response #2 - It is important to remember that typically a two-factor regression model cannot accurately describe the entire situation. Look at the dependent variable that your peer chose. Name at least 2 independent factors you would use to run a Multiple Linear Regression (MLR) and explain why you feel they are related. Then use those factors to run a Multiple Linear Regression (MLR) on your peer's data and see if the variables you chose are related to the dependent variable they chose. What is your MLR equation? Is your MLR significant? Are any of the Independent factors significant? What is the *R2* value? Explain and interpret this value and how it relates to the MLR. Make sure you include your MLR Excel output as an attachment in your response post.