In this case, it's 15.</p></li></ul></li></ul><h4id="985df62c−e784−4765−81f1−13a4c7abf494"data−toc−id="985df62c−e784−4765−81f1−13a4c7abf494"collapsed="false"seolevelmigrated="true">GeneralEquation</h4><ul><li><p>Generalequationofaline:y = b0 + b1x</p><ul><li><p>b_0isthey−intercept.</p></li><li><p>b_1istheslopecoefficient.</p></li></ul></li></ul><h4id="324b2442−31d2−421f−bd68−d9b3633a4ee9"data−toc−id="324b2442−31d2−421f−bd68−d9b3633a4ee9"collapsed="false"seolevelmigrated="true">Scattergram(ScatterPlot)</h4><ul><li><p>Plotofthedatapoints(xandyvalues).</p></li><li><p>Ascattergramisjustthedots,donotdrawalineofbestfit!</p></li><li><p>Givesaroughideaoftherelationshipbetweenxandy.</p></li><li><p>Helpsdetermineiftherelationshipispositive,negative,ornon−linear.</p></li></ul><h4id="8cc0d4ad−a28e−4fcf−844b−4d5d5fb9e282"data−toc−id="8cc0d4ad−a28e−4fcf−844b−4d5d5fb9e282"collapsed="false"seolevelmigrated="true">KeyPointsforScattergrams</h4><ul><li><p>Labeltheaxes(xandy).</p></li><li><p>Includethedotsrepresentingthedatapoints.</p></li><li><p>Ifthedotsmoveupward,there′sapositiverelationship.</p></li><li><p>Ifthedotsmovedownward,there′sanegativerelationship.</p></li><li><p>Ifthedotsarerandomlyscattered,there′snolinearrelationship.</p></li></ul><h4id="dc0bbab6−953b−4a97−94e5−2a6700768a19"data−toc−id="dc0bbab6−953b−4a97−94e5−2a6700768a19"collapsed="false"seolevelmigrated="true">LineofBestFit</h4><ul><li><p>Drawalinethatpassesthroughthemiddleofthedots.</p></li><li><p>Minimizetheerrorsbetweentheobservedvaluesandtheestimatedvaluesfromtheline.</p></li><li><p>Theseerrorsarecalledresidualerrors(e).</p></li></ul><h4id="bdafd94f−86f8−4208−8ad9−5dd77d563c1a"data−toc−id="bdafd94f−86f8−4208−8ad9−5dd77d563c1a"collapsed="false"seolevelmigrated="true">ResidualError</h4><ul><li><p>Residualerror(e)=y - \hat{y}</p><ul><li><p>yistheobservedvalue.</p></li><li><p>\hat{y}istheestimatedvalueofy(fromthelineofbestfit).</p></li></ul></li><li><p>Goal:minimizethesumoftheseerrors.</p></li><li><p>Problem:Someerrorsarepositive(abovetheline),andsomearenegative(belowtheline).</p></li><li><p>Positiveandnegativeerrorscancanceleachotherout.</p></li></ul><h4id="1afca1e2−98c9−4b77−8317−8a736289c8ed"data−toc−id="1afca1e2−98c9−4b77−8317−8a736289c8ed"collapsed="false"seolevelmigrated="true">MinimizingErrors</h4><ul><li><p>Toavoidcancellation,squaretheerrorsbeforesummingthemup.</p></li><li><p>Minimizethesumofthesquarederrorstofindthelineofbestfit.</p></li></ul><h4id="d3bee64a−1b64−4d57−b408−1acf71e854a3"data−toc−id="d3bee64a−1b64−4d57−b408−1acf71e854a3"collapsed="false"seolevelmigrated="true">SourcesofVariation(Errors)</h4><ul><li><p>Twomainerrors:</p><ol><li><p>ResidualError(SSE):</p><ul><li><p>Errorbetweentheobservedyandtheestimated\hat{y}.</p></li><li><p>Minimizethiserror.</p></li></ul></li><li><p>ErrorDuetoRegression(SSR):</p><ul><li><p>Errorbetweenthelineofbestfitandthemeanofy.</p></li></ul></li></ol></li><li><p>SST(TotalSumofSquares)=SSE+SSR</p></li></ul><h4id="313e4cf8−9f6a−4421−b01e−814460611761"data−toc−id="313e4cf8−9f6a−4421−b01e−814460611761"collapsed="false"seolevelmigrated="true">VisualRepresentationofErrors</h4><ul><li><p>SSE=distancefromobserveddatatothelineofbestfit.</p></li><li><p>SSR=distancefromthelineofbestfittothemeanofy.</p></li><li><p>SST=totalvariation.</p></li><li><p>SST = \sum(y - \bar{y})^2</p></li></ul><h4id="51434936−e16a−40b0−86d8−65ca15f34136"data−toc−id="51434936−e16a−40b0−86d8−65ca15f34136"collapsed="false"seolevelmigrated="true">OrdinaryLeastSquares(OLS)Regression</h4><ul><li><p>Goal:Findalineofbestfitthatbestrepresentsthelinearrelationshipbetweenxandy.</p></li><li><p>Choosetheslopeandy−intercepttominimizethesumofsquarederrors(SSE).</p></li><li><p>Ifthemodelhasalotoferrors,it′snotaccurateforpredictions.</p></li></ul><h4id="3f73cfc4−7891−4111−9b30−fea14d28bc55"data−toc−id="3f73cfc4−7891−4111−9b30−fea14d28bc55"collapsed="false"seolevelmigrated="true">RegressionLineFormula</h4><ul><li><p>\hat{y} = b0 + b1x</p></li><li><p>Needtofindb0(y−intercept)andb1(slopecoefficient).</p></li><li><p>First,calculateb1,thenuseb1tocalculateb_0.</p></li></ul><h5id="a074af28−67d5−4ced−b834−3c180d19746a"data−toc−id="a074af28−67d5−4ced−b834−3c180d19746a"collapsed="false"seolevelmigrated="true">FormulaeforfindingtheLineofBestFit</h5><p>b_1 = \frac{\sum{(x - \bar{x})(y - \bar{y})}}{\sum{(x - \bar{x})^2}}</p><p>Alternateformulaforcalculatingb_1is:</p><p>b_1 = \frac{n\sum{xy} - \sum{x}\sum{y}}{n\sum{x^2} - (\sum{x})^2}</p><p>b0 = \bar{y} - b1\bar{x}</p><h4id="2dfa72c7−c4b8−4037−818d−d74e5e06325e"data−toc−id="2dfa72c7−c4b8−4037−818d−d74e5e06325e"collapsed="false"seolevelmigrated="true">Example:Mary′sAnalysis</h4><ul><li><p>Marywantstofindtherelationshipbetweenyearsofexperience(x)andsalary(y)oftechnicians.</p></li><li><p>Needtofindthelineofbestfit.</p></li><li><p>Interprettheanswer.</p></li><li><p>Questionsmightaskforascattergram(plotthedotsonly).</p></li><li><p>DataGiven:</p></li></ul><tablestyle="min−width:50px"><colgroup><colstyle="min−width:25px"><colstyle="min−width:25px"></colgroup><tbody><tr><thcolspan="1"rowspan="1"style="text−align:left;"><p>Experience(x)</p></th><thcolspan="1"rowspan="1"style="text−align:left;"><p>Salary(y)</p></th></tr><tr><tdcolspan="1"rowspan="1"style="text−align:left;"><p>12</p></td><tdcolspan="1"rowspan="1"style="text−align:left;"><p>29</p></td></tr><tr><tdcolspan="1"rowspan="1"style="text−align:left;"><p>16</p></td><tdcolspan="1"rowspan="1"style="text−align:left;"><p>34</p></td></tr><tr><tdcolspan="1"rowspan="1"style="text−align:left;"><p>20</p></td><tdcolspan="1"rowspan="1"style="text−align:left;"><p>33</p></td></tr><tr><tdcolspan="1"rowspan="1"style="text−align:left;"><p>9</p></td><tdcolspan="1"rowspan="1"style="text−align:left;"><p>27</p></td></tr><tr><tdcolspan="1"rowspan="1"style="text−align:left;"><p>6</p></td><tdcolspan="1"rowspan="1"style="text−align:left;"><p>23</p></td></tr><tr><tdcolspan="1"rowspan="1"style="text−align:left;"><p>20</p></td><tdcolspan="1"rowspan="1"style="text−align:left;"><p>34</p></td></tr><tr><tdcolspan="1"rowspan="1"style="text−align:left;"><p>3</p></td><tdcolspan="1"rowspan="1"style="text−align:left;"><p>19</p></td></tr><tr><tdcolspan="1"rowspan="1"style="text−align:left;"><p>4</p></td><tdcolspan="1"rowspan="1"style="text−align:left;"><p>20</p></td></tr><tr><tdcolspan="1"rowspan="1"style="text−align:left;"><p>7</p></td><tdcolspan="1"rowspan="1"style="text−align:left;"><p>23</p></td></tr><tr><tdcolspan="1"rowspan="1"style="text−align:left;"><p>8</p></td><tdcolspan="1"rowspan="1"style="text−align:left;"><p>24</p></td></tr><tr><tdcolspan="1"rowspan="1"style="text−align:left;"><p>4</p></td><tdcolspan="1"rowspan="1"style="text−align:left;"><p>22</p></td></tr><tr><tdcolspan="1"rowspan="1"style="text−align:left;"><p>13</p></td><tdcolspan="1"rowspan="1"style="text−align:left;"><p>27</p></td></tr></tbody></table><ul><li><p>Inordertosolvethisquestion,weneedtofollowthebelowmentionedstepstoformulatethetable:</p></li></ul><tablestyle="min−width:200px"><colgroup><colstyle="min−width:25px"><colstyle="min−width:25px"><colstyle="min−width:25px"><colstyle="min−width:25px"><colstyle="min−width:25px"><colstyle="min−width:25px"><colstyle="min−width:25px"><colstyle="min−width:25px"></colgroup><tbody><tr><thcolspan="1"rowspan="1"style="text−align:left;"><p>Experience</p></th><thcolspan="1"rowspan="1"style="text−align:left;"><p>Salary</p></th><thcolspan="1"rowspan="1"style="text−align:left;"><p>x - \bar{x}</p></th><thcolspan="1"rowspan="1"style="text−align:left;"><p>y - \bar{y}</p></th><thcolspan="1"rowspan="1"style="text−align:left;"><p>(x-\bar{x})^2</p></th><thcolspan="1"rowspan="1"style="text−align:left;"><p>x*y</p></th><thcolspan="1"rowspan="1"style="text−align:left;"><p>x^2</p></th><thcolspan="1"rowspan="1"style="text−align:left;"><p>y^2</p></th></tr><tr><tdcolspan="1"rowspan="1"style="text−align:left;"><p>12</p></td><tdcolspan="1"rowspan="1"style="text−align:left;"><p>29</p></td><tdcolspan="1"rowspan="1"style="text−align:left;"><p>Value</p></td><tdcolspan="1"rowspan="1"style="text−align:left;"><p>Value</p></td><tdcolspan="1"rowspan="1"style="text−align:left;"><p>Value</p></td><tdcolspan="1"rowspan="1"style="text−align:left;"><p>Value</p></td><tdcolspan="1"rowspan="1"style="text−align:left;"><p>Value</p></td><tdcolspan="1"rowspan="1"style="text−align:left;"><p>Value</p></td></tr><tr><tdcolspan="1"rowspan="1"style="text−align:left;"><p>16</p></td><tdcolspan="1"rowspan="1"style="text−align:left;"><p>34</p></td><tdcolspan="1"rowspan="1"style="text−align:left;"><p>Value</p></td><tdcolspan="1"rowspan="1"style="text−align:left;"><p>Value</p></td><tdcolspan="1"rowspan="1"style="text−align:left;"><p>Value</p></td><tdcolspan="1"rowspan="1"style="text−align:left;"><p>Value</p></td><tdcolspan="1"rowspan="1"style="text−align:left;"><p>Value</p></td><tdcolspan="1"rowspan="1"style="text−align:left;"><p>Value</p></td></tr><tr><tdcolspan="1"rowspan="1"style="text−align:left;"><p>Andsoon..</p></td><tdcolspan="1"rowspan="1"style="text−align:left;"><p>Value</p></td><tdcolspan="1"rowspan="1"style="text−align:left;"><p>Value</p></td><tdcolspan="1"rowspan="1"style="text−align:left;"><p>Value</p></td><tdcolspan="1"rowspan="1"style="text−align:left;"><p>Value</p></td><tdcolspan="1"rowspan="1"style="text−align:left;"><p>Value</p></td><tdcolspan="1"rowspan="1"style="text−align:left;"><p>Value</p></td><tdcolspan="1"rowspan="1"style="text−align:left;"><p>Value</p></td></tr></tbody></table><h4id="2f5a486e−3f60−4435−989c−5b6022a87d8d"data−toc−id="2f5a486e−3f60−4435−989c−5b6022a87d8d"collapsed="false"seolevelmigrated="true">StepsToSolve</h4><ol><li><p>Calculatex_barusing:\frac{\sum{x}}{n}</p></li><li><p>Calculatey_barusing:\frac{\sum{y}}{n}</p></li><li><p>Subsequentlycalculatex - xbar,y - ybar & (x - x_bar)^2</p></li><li><p>Calculate b1 using:b1 = \frac{\sum{(x - \bar{x})(y - \bar{y})}}{\sum{(x - \bar{x})^2}}</p></li><li><p>Caluclate b0 using:\bar{y} - b1\bar{x}</p></li></ol><h4id="12a9908e−f524−40dd−8a5b−300524582e1b"data−toc−id="12a9908e−f524−40dd−8a5b−300524582e1b"collapsed="false"seolevelmigrated="true">LineofBestFit(Solved)</h4><ul><li><p>\hat{y} = b0 + b1x</p></li><li><p>\hat{y} = 19.37 + 0.7x</p></li><li><p>Usingtheaboveequation,wecanfindoutthepredictedsalariesfortechnicianswith15and30yearsofexperince.</p></li></ul><h4id="9c2117b1−d614−4ff8−b121−511d361da2fd"data−toc−id="9c2117b1−d614−4ff8−b121−511d361da2fd"collapsed="false"seolevelmigrated="true">AccuracyofPredictions</h4><ul><li><p>Predictingfor15yearsismoreaccurate,asthisvalueiswithinthesample.</p></li><li><p>Predictionof30yearsofexperienceisoutsidethesample.Suchpreductionsareknownas<em>outofsample</em>andlesslikelytobeaccurate.</p></li><li><p>In−samplepredictionsaregenerallymorereliablethanout−of−samplepredictions.</p></li></ul><h4id="b9d235de−5b1f−4418−b263−83e021a676b0"data−toc−id="b9d235de−5b1f−4418−b263−83e021a676b0"collapsed="false"seolevelmigrated="true">CorrelationCoefficient(r)</h4><ul><li><p>Measuresthestrengthanddirectionofalinearrelationshipbetweentwovariables.</p></li><li><p>Rangesfrom−1to+1(nounits).</p><ul><li><p>+1:Perfectpositivelinearrelationship.</p></li><li><p>−1:Perfectnegativelinearrelationship.</p></li><li><p>0:Nolinearrelationship.</p></li></ul></li><li><p>Valuescloseto+1or−1indicateastrongrelationship.</p></li></ul><h5id="93a44a05−2afd−41f1−9b86−9c24ba700351"data−toc−id="93a44a05−2afd−41f1−9b86−9c24ba700351"collapsed="false"seolevelmigrated="true">FormulaforFindingtheCorrelationCoeffiecient</h5><p>r = \frac{n\sum{xy} - \sum{x}\sum{y}}{\sqrt{[n\sum{x^2}-(\sum{x})^2][n\sum{y^2} - (\sum{y})^2]}}</p><ul><li><p>Forthisformulayouwillneedtoaugmenttheinitialdatatableusingthefollowingstepsandcolumns:</p></li></ul><ol><li><p>Implementthetableshownin</p></li></ol><p></p><p>Iapologize,buttheprovidedtextdoesnotcontainadefinitionorexplanationofwhatthecoefficientofdeterminationis.Thisstatisticalmeasure,oftendenotedasR^2,indicatestheproportionofthevarianceinthedependentvariablethatispredictablefromtheindependentvariable(s).Insimplerterms,itexplainshowwelltheregressionmodelfitstheobserveddata.AhigherR^2$$ suggests a better fit.