Dataset statistics
| Number of variables | 12 |
|---|---|
| Number of observations | 293 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 27.6 KiB |
| Average record size in memory | 96.4 B |
Variable types
| NUM | 6 |
|---|---|
| BOOL | 4 |
| CAT | 2 |
Reproduction
| Analysis started | 2020-12-10 21:13:36.661704 |
|---|---|
| Analysis finished | 2020-12-10 21:13:48.656092 |
| Duration | 11.99 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 293 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 146.6518771 |
|---|---|
| Minimum | 0 |
| Maximum | 293 |
| Zeros | 1 |
| Zeros (%) | 0.3% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 14.6 |
| Q1 | 73 |
| median | 147 |
| Q3 | 220 |
| 95-th percentile | 278.4 |
| Maximum | 293 |
| Range | 293 |
| Interquartile range (IQR) | 147 |
Descriptive statistics
| Standard deviation | 85.12019084 |
|---|---|
| Coefficient of variation (CV) | 0.5804234661 |
| Kurtosis | -1.203043787 |
| Mean | 146.6518771 |
| Median Absolute Deviation (MAD) | 74 |
| Skewness | -0.004896904897 |
| Sum | 42969 |
| Variance | 7245.446889 |
| Monotocity | Strictly increasing |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 293 | 1 | 0.3% | |
| 109 | 1 | 0.3% | |
| 92 | 1 | 0.3% | |
| 93 | 1 | 0.3% | |
| 94 | 1 | 0.3% | |
| 95 | 1 | 0.3% | |
| 96 | 1 | 0.3% | |
| 97 | 1 | 0.3% | |
| 98 | 1 | 0.3% | |
| 99 | 1 | 0.3% | |
| 100 | 1 | 0.3% | |
| 101 | 1 | 0.3% | |
| 103 | 1 | 0.3% | |
| 104 | 1 | 0.3% | |
| 105 | 1 | 0.3% | |
| 106 | 1 | 0.3% | |
| 107 | 1 | 0.3% | |
| 91 | 1 | 0.3% | |
| 90 | 1 | 0.3% | |
| 89 | 1 | 0.3% | |
| 80 | 1 | 0.3% | |
| 74 | 1 | 0.3% | |
| 75 | 1 | 0.3% | |
| 76 | 1 | 0.3% | |
| 77 | 1 | 0.3% | |
| Other values (268) | 268 | 91.5% |
| Value | Count | Frequency (%) | |
| 0 | 1 | 0.3% | |
| 1 | 1 | 0.3% | |
| 2 | 1 | 0.3% | |
| 3 | 1 | 0.3% | |
| 4 | 1 | 0.3% | |
| 5 | 1 | 0.3% | |
| 6 | 1 | 0.3% | |
| 7 | 1 | 0.3% | |
| 8 | 1 | 0.3% | |
| 9 | 1 | 0.3% |
| Value | Count | Frequency (%) | |
| 293 | 1 | 0.3% | |
| 292 | 1 | 0.3% | |
| 291 | 1 | 0.3% | |
| 290 | 1 | 0.3% | |
| 289 | 1 | 0.3% | |
| 288 | 1 | 0.3% | |
| 287 | 1 | 0.3% | |
| 286 | 1 | 0.3% | |
| 285 | 1 | 0.3% | |
| 284 | 1 | 0.3% |
age
Real number (ℝ≥0)
| Distinct | 38 |
|---|---|
| Distinct (%) | 13.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 47.8225256 |
|---|---|
| Minimum | 28 |
| Maximum | 66 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 28 |
|---|---|
| 5-th percentile | 34 |
| Q1 | 42 |
| median | 49 |
| Q3 | 54 |
| 95-th percentile | 59 |
| Maximum | 66 |
| Range | 38 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 7.824875011 |
|---|---|
| Coefficient of variation (CV) | 0.1636232071 |
| Kurtosis | -0.5112967113 |
| Mean | 47.8225256 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | -0.2822808516 |
| Sum | 14012 |
| Variance | 61.22866894 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=38)
| Value | Count | Frequency (%) | |
| 54 | 25 | 8.5% | |
| 48 | 19 | 6.5% | |
| 52 | 17 | 5.8% | |
| 55 | 15 | 5.1% | |
| 49 | 14 | 4.8% | |
| 46 | 13 | 4.4% | |
| 53 | 12 | 4.1% | |
| 43 | 12 | 4.1% | |
| 50 | 12 | 4.1% | |
| 39 | 11 | 3.8% | |
| 41 | 11 | 3.8% | |
| 47 | 10 | 3.4% | |
| 56 | 10 | 3.4% | |
| 51 | 9 | 3.1% | |
| 58 | 9 | 3.1% | |
| 59 | 8 | 2.7% | |
| 37 | 8 | 2.7% | |
| 45 | 8 | 2.7% | |
| 44 | 7 | 2.4% | |
| 42 | 7 | 2.4% | |
| 40 | 7 | 2.4% | |
| 38 | 7 | 2.4% | |
| 35 | 5 | 1.7% | |
| 57 | 5 | 1.7% | |
| 36 | 5 | 1.7% | |
| Other values (13) | 27 | 9.2% |
| Value | Count | Frequency (%) | |
| 28 | 1 | 0.3% | |
| 29 | 2 | 0.7% | |
| 30 | 1 | 0.3% | |
| 31 | 2 | 0.7% | |
| 32 | 4 | 1.4% | |
| 33 | 2 | 0.7% | |
| 34 | 4 | 1.4% | |
| 35 | 5 | 1.7% | |
| 36 | 5 | 1.7% | |
| 37 | 8 | 2.7% |
| Value | Count | Frequency (%) | |
| 66 | 1 | 0.3% | |
| 65 | 3 | 1.0% | |
| 63 | 1 | 0.3% | |
| 62 | 2 | 0.7% | |
| 61 | 2 | 0.7% | |
| 60 | 2 | 0.7% | |
| 59 | 8 | 2.7% | |
| 58 | 9 | 3.1% | |
| 57 | 5 | 1.7% | |
| 56 | 10 | 3.4% |
sex
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| 1 | |
|---|---|
| 0 |
| Value | Count | Frequency (%) | |
| 1 | 213 | 72.7% | |
| 0 | 80 | 27.3% |
cp
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| 4 | |
|---|---|
| 2 | |
| 3 | |
| 1 | 11 |
| Value | Count | Frequency (%) | |
| 4 | 123 | 42.0% | |
| 2 | 105 | 35.8% | |
| 3 | 54 | 18.4% | |
| 1 | 11 | 3.8% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 4 | 123 | 42.0% | |
| 2 | 105 | 35.8% | |
| 3 | 54 | 18.4% | |
| 1 | 11 | 3.8% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 293 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 4 | 123 | 42.0% | |
| 2 | 105 | 35.8% | |
| 3 | 54 | 18.4% | |
| 1 | 11 | 3.8% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 293 | 100.0% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 4 | 123 | 42.0% | |
| 2 | 105 | 35.8% | |
| 3 | 54 | 18.4% | |
| 1 | 11 | 3.8% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 293 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 4 | 123 | 42.0% | |
| 2 | 105 | 35.8% | |
| 3 | 54 | 18.4% | |
| 1 | 11 | 3.8% |
trestbps
Real number (ℝ≥0)
| Distinct | 32 |
|---|---|
| Distinct (%) | 10.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 132.6606949 |
|---|---|
| Minimum | 92 |
| Maximum | 200 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 92 |
|---|---|
| 5-th percentile | 110 |
| Q1 | 120 |
| median | 130 |
| Q3 | 140 |
| 95-th percentile | 160 |
| Maximum | 200 |
| Range | 108 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 17.57678272 |
|---|---|
| Coefficient of variation (CV) | 0.1324942759 |
| Kurtosis | 0.8312457912 |
| Mean | 132.6606949 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 0.7386292738 |
| Sum | 38869.58362 |
| Variance | 308.9432907 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=32)
| Value | Count | Frequency (%) | |
| 120 | 65 | 22.2% | |
| 130 | 54 | 18.4% | |
| 140 | 50 | 17.1% | |
| 150 | 23 | 7.8% | |
| 110 | 20 | 6.8% | |
| 160 | 20 | 6.8% | |
| 125 | 8 | 2.7% | |
| 100 | 6 | 2.0% | |
| 180 | 6 | 2.0% | |
| 145 | 5 | 1.7% | |
| 170 | 5 | 1.7% | |
| 135 | 5 | 1.7% | |
| 112 | 3 | 1.0% | |
| 118 | 2 | 0.7% | |
| 124 | 2 | 0.7% | |
| 115 | 2 | 0.7% | |
| 122 | 2 | 0.7% | |
| 105 | 1 | 0.3% | |
| 98 | 1 | 0.3% | |
| 132.5836177 | 1 | 0.3% | |
| 190 | 1 | 0.3% | |
| 155 | 1 | 0.3% | |
| 132 | 1 | 0.3% | |
| 108 | 1 | 0.3% | |
| 113 | 1 | 0.3% | |
| Other values (7) | 7 | 2.4% |
| Value | Count | Frequency (%) | |
| 92 | 1 | 0.3% | |
| 98 | 1 | 0.3% | |
| 100 | 6 | 2.0% | |
| 105 | 1 | 0.3% | |
| 106 | 1 | 0.3% | |
| 108 | 1 | 0.3% | |
| 110 | 20 | 6.8% | |
| 112 | 3 | 1.0% | |
| 113 | 1 | 0.3% | |
| 115 | 2 | 0.7% |
| Value | Count | Frequency (%) | |
| 200 | 1 | 0.3% | |
| 190 | 1 | 0.3% | |
| 180 | 6 | 2.0% | |
| 170 | 5 | 1.7% | |
| 160 | 20 | 6.8% | |
| 155 | 1 | 0.3% | |
| 150 | 23 | 7.8% | |
| 145 | 5 | 1.7% | |
| 142 | 1 | 0.3% | |
| 140 | 50 | 17.1% |
chol
Real number (ℝ≥0)
| Distinct | 154 |
|---|---|
| Distinct (%) | 52.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 250.8487085 |
|---|---|
| Minimum | 85 |
| Maximum | 603 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 85 |
|---|---|
| 5-th percentile | 165.2 |
| Q1 | 211 |
| median | 248 |
| Q3 | 277 |
| 95-th percentile | 350.2 |
| Maximum | 603 |
| Range | 518 |
| Interquartile range (IQR) | 66 |
Descriptive statistics
| Standard deviation | 65.05905634 |
|---|---|
| Coefficient of variation (CV) | 0.2593557556 |
| Kurtosis | 5.187495865 |
| Mean | 250.8487085 |
| Median Absolute Deviation (MAD) | 34 |
| Skewness | 1.489808115 |
| Sum | 73498.67159 |
| Variance | 4232.680812 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 250.8487085 | 22 | 7.5% | |
| 275 | 5 | 1.7% | |
| 230 | 5 | 1.7% | |
| 246 | 5 | 1.7% | |
| 216 | 4 | 1.4% | |
| 211 | 4 | 1.4% | |
| 263 | 4 | 1.4% | |
| 260 | 4 | 1.4% | |
| 224 | 4 | 1.4% | |
| 238 | 4 | 1.4% | |
| 215 | 4 | 1.4% | |
| 196 | 4 | 1.4% | |
| 237 | 4 | 1.4% | |
| 207 | 3 | 1.0% | |
| 288 | 3 | 1.0% | |
| 248 | 3 | 1.0% | |
| 182 | 3 | 1.0% | |
| 292 | 3 | 1.0% | |
| 223 | 3 | 1.0% | |
| 193 | 3 | 1.0% | |
| 297 | 3 | 1.0% | |
| 213 | 3 | 1.0% | |
| 268 | 3 | 1.0% | |
| 184 | 3 | 1.0% | |
| 291 | 3 | 1.0% | |
| Other values (129) | 184 | 62.8% |
| Value | Count | Frequency (%) | |
| 85 | 1 | 0.3% | |
| 100 | 1 | 0.3% | |
| 117 | 1 | 0.3% | |
| 129 | 1 | 0.3% | |
| 132 | 1 | 0.3% | |
| 147 | 2 | 0.7% | |
| 156 | 1 | 0.3% | |
| 160 | 3 | 1.0% | |
| 161 | 1 | 0.3% | |
| 163 | 2 | 0.7% |
| Value | Count | Frequency (%) | |
| 603 | 1 | 0.3% | |
| 529 | 1 | 0.3% | |
| 518 | 1 | 0.3% | |
| 491 | 1 | 0.3% | |
| 468 | 1 | 0.3% | |
| 466 | 1 | 0.3% | |
| 412 | 1 | 0.3% | |
| 404 | 1 | 0.3% | |
| 394 | 1 | 0.3% | |
| 393 | 1 | 0.3% |
fbs
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| 0 | |
|---|---|
| 1 | 20 |
| Value | Count | Frequency (%) | |
| 0 | 273 | 93.2% | |
| 1 | 20 | 6.8% |
restecg
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| 0 | |
|---|---|
| 1 | |
| 2 | 6 |
| Value | Count | Frequency (%) | |
| 0 | 235 | 80.2% | |
| 1 | 52 | 17.7% | |
| 2 | 6 | 2.0% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 0 | 528 | 60.1% | |
| . | 293 | 33.3% | |
| 1 | 52 | 5.9% | |
| 2 | 6 | 0.7% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 586 | 66.7% | |
| Other Punctuation | 293 | 33.3% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 0 | 528 | 90.1% | |
| 1 | 52 | 8.9% | |
| 2 | 6 | 1.0% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| . | 293 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 879 | 100.0% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 0 | 528 | 60.1% | |
| . | 293 | 33.3% | |
| 1 | 52 | 5.9% | |
| 2 | 6 | 0.7% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 879 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 0 | 528 | 60.1% | |
| . | 293 | 33.3% | |
| 1 | 52 | 5.9% | |
| 2 | 6 | 0.7% |
thalach
Real number (ℝ≥0)
| Distinct | 72 |
|---|---|
| Distinct (%) | 24.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 139.0584631 |
|---|---|
| Minimum | 82 |
| Maximum | 190 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 82 |
|---|---|
| 5-th percentile | 98 |
| Q1 | 122 |
| median | 140 |
| Q3 | 155 |
| 95-th percentile | 176.8 |
| Maximum | 190 |
| Range | 108 |
| Interquartile range (IQR) | 33 |
Descriptive statistics
| Standard deviation | 23.55800299 |
|---|---|
| Coefficient of variation (CV) | 0.169410782 |
| Kurtosis | -0.579829956 |
| Mean | 139.0584631 |
| Median Absolute Deviation (MAD) | 16 |
| Skewness | -0.08053560888 |
| Sum | 40744.12969 |
| Variance | 554.9795047 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 150 | 29 | 9.9% | |
| 140 | 21 | 7.2% | |
| 130 | 17 | 5.8% | |
| 170 | 14 | 4.8% | |
| 160 | 12 | 4.1% | |
| 120 | 11 | 3.8% | |
| 110 | 9 | 3.1% | |
| 142 | 8 | 2.7% | |
| 125 | 8 | 2.7% | |
| 135 | 7 | 2.4% | |
| 100 | 7 | 2.4% | |
| 155 | 7 | 2.4% | |
| 138 | 6 | 2.0% | |
| 115 | 6 | 2.0% | |
| 175 | 6 | 2.0% | |
| 180 | 6 | 2.0% | |
| 145 | 6 | 2.0% | |
| 118 | 5 | 1.7% | |
| 122 | 5 | 1.7% | |
| 124 | 4 | 1.4% | |
| 116 | 4 | 1.4% | |
| 137 | 4 | 1.4% | |
| 134 | 4 | 1.4% | |
| 98 | 4 | 1.4% | |
| 165 | 4 | 1.4% | |
| Other values (47) | 79 | 27.0% |
| Value | Count | Frequency (%) | |
| 82 | 1 | 0.3% | |
| 87 | 1 | 0.3% | |
| 90 | 1 | 0.3% | |
| 91 | 1 | 0.3% | |
| 92 | 3 | 1.0% | |
| 94 | 2 | 0.7% | |
| 96 | 3 | 1.0% | |
| 98 | 4 | 1.4% | |
| 99 | 2 | 0.7% | |
| 100 | 7 | 2.4% |
| Value | Count | Frequency (%) | |
| 190 | 1 | 0.3% | |
| 188 | 1 | 0.3% | |
| 185 | 3 | 1.0% | |
| 184 | 3 | 1.0% | |
| 180 | 6 | 2.0% | |
| 178 | 1 | 0.3% | |
| 176 | 1 | 0.3% | |
| 175 | 6 | 2.0% | |
| 174 | 2 | 0.7% | |
| 172 | 3 | 1.0% |
exang
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 204 | 69.6% | |
| 1 | 89 | 30.4% |
| Distinct | 10 |
|---|---|
| Distinct (%) | 3.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5880546075 |
|---|---|
| Minimum | 0 |
| Maximum | 5 |
| Zeros | 188 |
| Zeros (%) | 64.2% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.9095539158 |
|---|---|
| Coefficient of variation (CV) | 1.546716758 |
| Kurtosis | 2.163645114 |
| Mean | 0.5880546075 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.543805087 |
| Sum | 172.3 |
| Variance | 0.8272883258 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 188 | 64.2% | |
| 1 | 41 | 14.0% | |
| 2 | 31 | 10.6% | |
| 1.5 | 16 | 5.5% | |
| 3 | 9 | 3.1% | |
| 2.5 | 3 | 1.0% | |
| 0.5 | 2 | 0.7% | |
| 0.8 | 1 | 0.3% | |
| 5 | 1 | 0.3% | |
| 4 | 1 | 0.3% |
| Value | Count | Frequency (%) | |
| 0 | 188 | 64.2% | |
| 0.5 | 2 | 0.7% | |
| 0.8 | 1 | 0.3% | |
| 1 | 41 | 14.0% | |
| 1.5 | 16 | 5.5% | |
| 2 | 31 | 10.6% | |
| 2.5 | 3 | 1.0% | |
| 3 | 9 | 3.1% | |
| 4 | 1 | 0.3% | |
| 5 | 1 | 0.3% |
| Value | Count | Frequency (%) | |
| 5 | 1 | 0.3% | |
| 4 | 1 | 0.3% | |
| 3 | 9 | 3.1% | |
| 2.5 | 3 | 1.0% | |
| 2 | 31 | 10.6% | |
| 1.5 | 16 | 5.5% | |
| 1 | 41 | 14.0% | |
| 0.8 | 1 | 0.3% | |
| 0.5 | 2 | 0.7% | |
| 0 | 188 | 64.2% |
num
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 187 | 63.8% | |
| 1 | 106 | 36.2% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| df_index | age | sex | cp | trestbps | chol | fbs | restecg | thalach | exang | oldpeak | num | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 28 | 1 | 2 | 130.0 | 132.000000 | 0.0 | 2.0 | 185.0 | 0.0 | 0.0 | 0 |
| 1 | 1 | 29 | 1 | 2 | 120.0 | 243.000000 | 0.0 | 0.0 | 160.0 | 0.0 | 0.0 | 0 |
| 2 | 2 | 29 | 1 | 2 | 140.0 | 250.848708 | 0.0 | 0.0 | 170.0 | 0.0 | 0.0 | 0 |
| 3 | 3 | 30 | 0 | 1 | 170.0 | 237.000000 | 0.0 | 1.0 | 170.0 | 0.0 | 0.0 | 0 |
| 4 | 4 | 31 | 0 | 2 | 100.0 | 219.000000 | 0.0 | 1.0 | 150.0 | 0.0 | 0.0 | 0 |
| 5 | 5 | 32 | 0 | 2 | 105.0 | 198.000000 | 0.0 | 0.0 | 165.0 | 0.0 | 0.0 | 0 |
| 6 | 6 | 32 | 1 | 2 | 110.0 | 225.000000 | 0.0 | 0.0 | 184.0 | 0.0 | 0.0 | 0 |
| 7 | 7 | 32 | 1 | 2 | 125.0 | 254.000000 | 0.0 | 0.0 | 155.0 | 0.0 | 0.0 | 0 |
| 8 | 8 | 33 | 1 | 3 | 120.0 | 298.000000 | 0.0 | 0.0 | 185.0 | 0.0 | 0.0 | 0 |
| 9 | 9 | 34 | 0 | 2 | 130.0 | 161.000000 | 0.0 | 0.0 | 190.0 | 0.0 | 0.0 | 0 |
Last rows
| df_index | age | sex | cp | trestbps | chol | fbs | restecg | thalach | exang | oldpeak | num | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 283 | 284 | 49 | 1 | 4 | 128.0 | 212.0 | 0.0 | 0.0 | 96.0 | 1.0 | 0.0 | 1 |
| 284 | 285 | 49 | 1 | 4 | 150.0 | 222.0 | 0.0 | 0.0 | 122.0 | 0.0 | 2.0 | 1 |
| 285 | 286 | 50 | 1 | 4 | 140.0 | 231.0 | 0.0 | 1.0 | 140.0 | 1.0 | 5.0 | 1 |
| 286 | 287 | 50 | 1 | 4 | 140.0 | 341.0 | 0.0 | 1.0 | 125.0 | 1.0 | 2.5 | 1 |
| 287 | 288 | 52 | 1 | 4 | 140.0 | 266.0 | 0.0 | 0.0 | 134.0 | 1.0 | 2.0 | 1 |
| 288 | 289 | 52 | 1 | 4 | 160.0 | 331.0 | 0.0 | 0.0 | 94.0 | 1.0 | 2.5 | 1 |
| 289 | 290 | 54 | 0 | 3 | 130.0 | 294.0 | 0.0 | 1.0 | 100.0 | 1.0 | 0.0 | 1 |
| 290 | 291 | 56 | 1 | 4 | 155.0 | 342.0 | 1.0 | 0.0 | 150.0 | 1.0 | 3.0 | 1 |
| 291 | 292 | 58 | 0 | 2 | 180.0 | 393.0 | 0.0 | 0.0 | 110.0 | 1.0 | 1.0 | 1 |
| 292 | 293 | 65 | 1 | 4 | 130.0 | 275.0 | 0.0 | 1.0 | 115.0 | 1.0 | 1.0 | 1 |