{"id":12451,"date":"2023-10-11T15:00:24","date_gmt":"2023-10-11T09:30:24","guid":{"rendered":"https:\/\/skillioma.com\/learn\/courses\/ai-ml-nascomm-fsp\/lesson\/central-tendency-and-variability\/"},"modified":"2024-02-02T16:15:55","modified_gmt":"2024-02-02T10:45:55","slug":"central-tendency-and-variability","status":"publish","type":"lesson","link":"https:\/\/skillioma.com\/learn\/courses\/ai-ml-and-data-science-foundation-nascomm-fsp\/lesson\/central-tendency-and-variability\/","title":{"rendered":"Central Tendency and Variability"},"content":{"rendered":"<p><span style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\">Central tendency and variability are fundamental concepts in statistics that provide insights into the distribution of a dataset.<\/span><\/p>\n<p><b>&nbsp;<\/b><\/p>\n<h4><b>Central Tendency:<\/b><\/h4>\n<p><span style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\">This refers to measures that identify the centre or &#8220;middle&#8221; of a distribution. The primary measures of central tendency are:<\/span><\/p>\n<p><b>&nbsp;<\/b><\/p>\n<ul>\n<li style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\"><b>Mean (or Average):<\/b><span style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\"> It is the sum of all the values in a dataset divided by the number of values. It gives the arithmetic centre of the distribution.<\/span><\/li>\n<li style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\"><b>Median:<\/b><span style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\"> It is the middle value in a dataset when the values are arranged in ascending or descending order. If the dataset has an odd number of observations, the median is the middle number. If there&#8217;s an even number of observations, the median is the average of the two middle numbers.<\/span><\/li>\n<li style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\"><b>Mode:<\/b><span style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\"> It is the value that appears most frequently in a dataset. A distribution can be unimodal (one mode), bimodal (two modes), or multimodal (more than two modes). It is the value that appears most frequently in a dataset. A distribution can be unimodal (one mode), bimodal (two modes), or multimodal (more than two modes).<\/span><\/li>\n<li style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\"><b>Skewness:<\/b><span style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\"> Skewness measures the asymmetry of a distribution about its mean.<\/span>\n<ul>\n<li style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\"><b>Positive Skewness:<\/b><span style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\"> When the tail on the right side (i.e., larger values) of the distribution is longer than on the left side, the distribution is positively skewed. In such cases, the mean is greater than the median.<\/span><\/li>\n<li style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\"><b>Negative Skewness:<\/b><span style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\"> When the tail on the left side (i.e., smaller values) is longer than on the right side, the distribution is negatively skewed. Here, the mean is less than the median.<\/span><\/li>\n<li style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\"><b>Zero Skewness:<\/b><span style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\"> When the values are symmetrically distributed around the mean, skewness is zero, implying that the mean and median are equal.<\/span><\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<ul>\n<li style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\"><b>Kurtosis:<\/b><span style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\"> Kurtosis measures the &#8220;tailedness&#8221; of a distribution, i.e., the relative concentration of values in the centre, shoulders, and tails of a distribution compared to a normal distribution.<\/span>\n<ul>\n<li style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\"><span style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\"><strong>Leptokurtic (Kurtosis &gt; 0):<\/strong> A distribution with positive kurtosis indicates that it has heavier tails and a sharper peak than a normal distribution. Such distributions are termed &#8220;leptokurtic.&#8221; They tend to have more extreme values (outliers).<\/span><\/li>\n<li style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\"><span style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\"><strong>Platykurtic (Kurtosis &lt; 0):<\/strong> A distribution with negative kurtosis suggests it has lighter tails and a flatter peak than a normal distribution. Such distributions are termed &#8220;platykurtic.&#8221; They tend to have fewer extreme values.<\/span><\/li>\n<li style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\"><span style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\"><strong>Mesokurtic (Kurtosis \u2248 0):<\/strong> A distribution with kurtosis close to zero is similar in shape to a normal distribution in terms of its tailedness. Such distributions are termed &#8220;mesokurtic.&#8221;<\/span><\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<p><b>&nbsp;<\/b><\/p>\n<p><span style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\">In summary, while skewness provides insights about the direction and degree of asymmetry of a distribution, kurtosis provides information about the thickness of the tails and the peakedness of a distribution relative to a normal distribution. Both measures can offer additional insights into the distribution and behaviour of a dataset beyond measures of central tendency and dispersion.<\/span><\/p>\n<p><b>&nbsp;<\/b><\/p>\n<h4><b>Variability (or Dispersion):<\/b><\/h4>\n<p><span style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\">This refers to measures that identify how spread out or scattered the values in a dataset are. The most common measures of variability are:<\/span><\/p>\n<p><b>&nbsp;<\/b><\/p>\n<ul>\n<li style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\"><b>Range:<\/b><span style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\"> It is the difference between the highest and lowest values in a dataset.<\/span><\/li>\n<\/ul>\n<p><span style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;Range = Highest value &#8211; Lowest value<\/span><\/p>\n<ul>\n<li style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\"><b>Variance:<\/b><span style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\"> Variance is the average of the squared differences from the mean.<\/span><\/li>\n<li style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\"><b>Standard Deviation:<\/b><span style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\"> It is the square root of the variance and provides a measure of the average distance between each data point and the mean.<\/span><\/li>\n<li style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\"><b>Percentiles:<\/b><span style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\"> Percentiles divide a dataset into 100 equal parts, representing the percentage of data points that fall below a given value.<\/span>\n<ul>\n<li style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\"><span style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\">For example, the 25th percentile (also known as the first quartile) represents the value below which 25% of the data points fall, and the 75th percentile (the third quartile) represents the value below which 75% of the data points fall.<\/span><\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<ul>\n<li style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\"><b>Quartiles:<\/b><span style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\"> Quartiles divide a dataset into four equal parts, with each part representing 25% of the data points. Quartiles are often used in conjunction with box plots and are particularly helpful for understanding the central tendency and spread of data.<\/span>\n<ul>\n<li style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\"><span style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\">First Quartile (Q1): This is the 25th percentile and represents the value below which 25% of the data points fall.<\/span><\/li>\n<li style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\"><span style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\">Second Quartile (Q2): This is the 50th percentile and represents the median, below which 50% of the data points fall.<\/span><\/li>\n<li style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\"><span style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\">Third Quartile (Q3): This is the 75th percentile and represents the value below which 75% of the data points fall.<\/span><\/li>\n<li style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\"><span style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\">Fourth Quartile: This represents the values above the third quartile.<\/span><\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<ul>\n<li style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\"><b>Interquartile Range (IQR):<\/b><span style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\"> It is the range between the first quartile (Q1, or the 25th percentile) and the third quartile (Q3, or the 75th percentile). It provides a measure of the middle 50% of data.<\/span><\/li>\n<\/ul>\n<p><b>&nbsp;<\/b><\/p>\n<p><span style=\"font-weight: 400\" data-mce-style=\"font-weight: 400;\">Understanding both the central tendency and variability of a dataset provides a comprehensive picture of its distribution. For example, two datasets can have the same mean but different standard deviations, indicating that one is more spread out than the other.<\/span><\/p>\n","protected":false},"comment_status":"open","ping_status":"closed","template":"","class_list":["post-12451","lesson","type-lesson","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/skillioma.com\/learn\/wp-json\/wp\/v2\/lesson\/12451","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/skillioma.com\/learn\/wp-json\/wp\/v2\/lesson"}],"about":[{"href":"https:\/\/skillioma.com\/learn\/wp-json\/wp\/v2\/types\/lesson"}],"replies":[{"embeddable":true,"href":"https:\/\/skillioma.com\/learn\/wp-json\/wp\/v2\/comments?post=12451"}],"wp:attachment":[{"href":"https:\/\/skillioma.com\/learn\/wp-json\/wp\/v2\/media?parent=12451"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}