Lecture 9: Assessing correlations
-be able to explain why identifying correlations is useful for data wrangling/analysis
-understand what is correlation between a pair of features
-understand how correlation can be identified using visualisation
-understand the concept of a linear relation, versus a non linear relation for a pair of features
-understand why the concept of correlation is important, where it is used and understand why correlation is not the same as causation
A causes B
-understand the use of Euclidean distance for computing correlation between two features and its advantages/ disadvantages
-understand the use of Pearson correlation coefficient for computing correlation between two features and its advantages/ disadvantages
-understand the meaning of the variables in the Pearson correlation coefficient formula and how they can be calculated. Be able to compute this coefficient on a simple pair of features. The formula for this coefficient will be provided on the exam.
Example
-be able to interpret the meaning of a computed Pearson correlation coefficient
We will define a correlation measure rxy, assessing samples
from two features x and y
– Assess how close their scatter plot is to a straight line (a
linear relationship)
-understand the advantages and disadvantages of using the Pearson correlation coefficient for assessing the degree of relationship between two features
SAME AS PEARSON COORELATION
原创声明:本文系作者授权腾讯云开发者社区发表,未经许可,不得转载。
如有侵权,请联系 cloudcommunity@tencent.com 删除。
原创声明:本文系作者授权腾讯云开发者社区发表,未经许可,不得转载。
如有侵权,请联系 cloudcommunity@tencent.com 删除。