Dataset statistics
| Number of variables | 7 |
|---|---|
| Number of observations | 12000 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 65 |
| Duplicate rows (%) | 0.5% |
| Total size in memory | 656.4 KiB |
| Average record size in memory | 56.0 B |
Variable types
| Numeric | 6 |
|---|---|
| Categorical | 1 |
| Dataset has 65 (0.5%) duplicate rows | Duplicates |
fl is highly correlated with e | High correlation |
ia is highly correlated with s and 1 other fields | High correlation |
s is highly correlated with ia and 1 other fields | High correlation |
t is highly correlated with ia and 1 other fields | High correlation |
e is highly correlated with fl | High correlation |
fl is highly correlated with e | High correlation |
ia is highly correlated with s | High correlation |
s is highly correlated with ia and 1 other fields | High correlation |
t is highly correlated with s | High correlation |
e is highly correlated with fl | High correlation |
fl is highly correlated with e | High correlation |
e is highly correlated with fl | High correlation |
fl is highly correlated with e | High correlation |
ia is highly correlated with s and 2 other fields | High correlation |
s is highly correlated with ia and 1 other fields | High correlation |
t is highly correlated with ia and 2 other fields | High correlation |
e is highly correlated with fl | High correlation |
et is highly correlated with ia and 1 other fields | High correlation |
Reproduction
| Analysis started | 2022-06-14 18:59:43.124639 |
|---|---|
| Analysis finished | 2022-06-14 18:59:49.642948 |
| Duration | 6.52 seconds |
| Software version | pandas-profiling v3.2.0 |
| Download configuration | config.json |
| Distinct | 11684 |
|---|---|
| Distinct (%) | 97.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.4168125574 |
| Minimum | 0.029390577 |
|---|---|
| Maximum | 0.9492131 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 93.9 KiB |
Quantile statistics
| Minimum | 0.029390577 |
|---|---|
| 5-th percentile | 0.1360574175 |
| Q1 | 0.2850291325 |
| median | 0.40379098 |
| Q3 | 0.5011375575 |
| 95-th percentile | 0.786797873 |
| Maximum | 0.9492131 |
| Range | 0.919822523 |
| Interquartile range (IQR) | 0.216108425 |
Descriptive statistics
| Standard deviation | 0.1852367088 |
|---|---|
| Coefficient of variation (CV) | 0.4444124954 |
| Kurtosis | -0.09893912846 |
| Mean | 0.4168125574 |
| Median Absolute Deviation (MAD) | 0.10433418 |
| Skewness | 0.4737160247 |
| Sum | 5001.750689 |
| Variance | 0.03431263827 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.8437023 | 88 | 0.7% |
| 0.90289146 | 40 | 0.3% |
| 0.94800824 | 13 | 0.1% |
| 0.4149162 | 11 | 0.1% |
| 0.4143504 | 11 | 0.1% |
| 0.4311919 | 10 | 0.1% |
| 0.44937053 | 10 | 0.1% |
| 0.47760788 | 10 | 0.1% |
| 0.40379098 | 8 | 0.1% |
| 0.41839996 | 8 | 0.1% |
| Other values (11674) | 11791 |
| Value | Count | Frequency (%) |
| 0.029390577 | 1 | |
| 0.029955925 | 1 | |
| 0.033604544 | 1 | |
| 0.033606403 | 1 | |
| 0.034301028 | 1 | |
| 0.036454223 | 1 | |
| 0.038488213 | 1 | |
| 0.03961533 | 1 | |
| 0.040670637 | 1 | |
| 0.041285664 | 1 |
| Value | Count | Frequency (%) |
| 0.9492131 | 2 | < 0.1% |
| 0.94800824 | 13 | |
| 0.9440342 | 1 | < 0.1% |
| 0.94353414 | 1 | < 0.1% |
| 0.9394918 | 1 | < 0.1% |
| 0.92673665 | 1 | < 0.1% |
| 0.9119069 | 1 | < 0.1% |
| 0.90620583 | 1 | < 0.1% |
| 0.9056027 | 1 | < 0.1% |
| 0.90545267 | 1 | < 0.1% |
| Distinct | 10409 |
|---|---|
| Distinct (%) | 86.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.4382163503 |
| Minimum | 0.037709847 |
|---|---|
| Maximum | 0.99387753 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 93.9 KiB |
Quantile statistics
| Minimum | 0.037709847 |
|---|---|
| 5-th percentile | 0.11280035 |
| Q1 | 0.23262316 |
| median | 0.35341949 |
| Q3 | 0.6069273 |
| 95-th percentile | 0.9490587 |
| Maximum | 0.99387753 |
| Range | 0.956167683 |
| Interquartile range (IQR) | 0.37430414 |
Descriptive statistics
| Standard deviation | 0.2662644614 |
|---|---|
| Coefficient of variation (CV) | 0.6076096003 |
| Kurtosis | -0.7448420262 |
| Mean | 0.4382163503 |
| Median Absolute Deviation (MAD) | 0.17451961 |
| Skewness | 0.6597911337 |
| Sum | 5258.596203 |
| Variance | 0.07089676341 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.6069273 | 165 | 1.4% |
| 0.9490587 | 158 | 1.3% |
| 0.9149589 | 136 | 1.1% |
| 0.6342086 | 124 | 1.0% |
| 0.9340332 | 85 | 0.7% |
| 0.9586736 | 80 | 0.7% |
| 0.81136084 | 77 | 0.6% |
| 0.6894879 | 73 | 0.6% |
| 0.90638536 | 73 | 0.6% |
| 0.8772666 | 60 | 0.5% |
| Other values (10399) | 10969 |
| Value | Count | Frequency (%) |
| 0.037709847 | 1 | |
| 0.044970874 | 1 | |
| 0.04508064 | 1 | |
| 0.045482233 | 1 | |
| 0.049547765 | 1 | |
| 0.05189743 | 1 | |
| 0.052189957 | 1 | |
| 0.05362984 | 1 | |
| 0.054279275 | 1 | |
| 0.055933435 | 1 |
| Value | Count | Frequency (%) |
| 0.99387753 | 3 | |
| 0.9930588 | 1 | < 0.1% |
| 0.99256897 | 1 | < 0.1% |
| 0.9921237 | 1 | < 0.1% |
| 0.99065804 | 1 | < 0.1% |
| 0.99036723 | 1 | < 0.1% |
| 0.99019986 | 1 | < 0.1% |
| 0.9901501 | 1 | < 0.1% |
| 0.9900025 | 1 | < 0.1% |
| 0.98979497 | 1 | < 0.1% |
| Distinct | 7420 |
|---|---|
| Distinct (%) | 61.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.801580311 |
| Minimum | 0.024430014 |
|---|---|
| Maximum | 0.9943364 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 93.9 KiB |
Quantile statistics
| Minimum | 0.024430014 |
|---|---|
| 5-th percentile | 0.4998674795 |
| Q1 | 0.699709325 |
| median | 0.8435214 |
| Q3 | 0.93682694 |
| 95-th percentile | 0.97533107 |
| Maximum | 0.9943364 |
| Range | 0.969906386 |
| Interquartile range (IQR) | 0.237117615 |
Descriptive statistics
| Standard deviation | 0.1620618911 |
|---|---|
| Coefficient of variation (CV) | 0.2021779838 |
| Kurtosis | 0.3933632595 |
| Mean | 0.801580311 |
| Median Absolute Deviation (MAD) | 0.1099997 |
| Skewness | -0.978641045 |
| Sum | 9618.963732 |
| Variance | 0.02626405655 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.9605577 | 349 | 2.9% |
| 0.9535211 | 341 | 2.8% |
| 0.9658004 | 317 | 2.6% |
| 0.9076751 | 311 | 2.6% |
| 0.93682694 | 283 | 2.4% |
| 0.9098734 | 278 | 2.3% |
| 0.97533107 | 278 | 2.3% |
| 0.9811452 | 263 | 2.2% |
| 0.72723687 | 191 | 1.6% |
| 0.59276253 | 187 | 1.6% |
| Other values (7410) | 9202 |
| Value | Count | Frequency (%) |
| 0.024430014 | 1 | |
| 0.06102065 | 1 | |
| 0.071603596 | 1 | |
| 0.07817141 | 1 | |
| 0.0788923 | 1 | |
| 0.08381921 | 1 | |
| 0.1061011 | 1 | |
| 0.11373025 | 1 | |
| 0.11440568 | 1 | |
| 0.11515171 | 1 |
| Value | Count | Frequency (%) |
| 0.9943364 | 1 | < 0.1% |
| 0.99291486 | 1 | < 0.1% |
| 0.9856039 | 1 | < 0.1% |
| 0.9854358 | 69 | |
| 0.9854041 | 1 | < 0.1% |
| 0.98533255 | 1 | < 0.1% |
| 0.98518544 | 1 | < 0.1% |
| 0.98511934 | 1 | < 0.1% |
| 0.98511195 | 1 | < 0.1% |
| 0.9850903 | 1 | < 0.1% |
| Distinct | 7967 |
|---|---|
| Distinct (%) | 66.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.8218832335 |
| Minimum | 0.024729094 |
|---|---|
| Maximum | 0.984462 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 93.9 KiB |
Quantile statistics
| Minimum | 0.024729094 |
|---|---|
| 5-th percentile | 0.70857113 |
| Q1 | 0.7473183 |
| median | 0.8214078 |
| Q3 | 0.89414346 |
| 95-th percentile | 0.95164716 |
| Maximum | 0.984462 |
| Range | 0.959732906 |
| Interquartile range (IQR) | 0.14682516 |
Descriptive statistics
| Standard deviation | 0.09560208289 |
|---|---|
| Coefficient of variation (CV) | 0.1163207607 |
| Kurtosis | 7.324972053 |
| Mean | 0.8218832335 |
| Median Absolute Deviation (MAD) | 0.0740895 |
| Skewness | -1.288609868 |
| Sum | 9862.598802 |
| Variance | 0.009139758252 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.7473183 | 782 | 6.5% |
| 0.8214078 | 521 | 4.3% |
| 0.95164716 | 455 | 3.8% |
| 0.9450092 | 453 | 3.8% |
| 0.8546382 | 410 | 3.4% |
| 0.86612374 | 292 | 2.4% |
| 0.9456558 | 280 | 2.3% |
| 0.88678163 | 256 | 2.1% |
| 0.91536796 | 232 | 1.9% |
| 0.89414346 | 180 | 1.5% |
| Other values (7957) | 8139 |
| Value | Count | Frequency (%) |
| 0.024729094 | 1 | |
| 0.037766483 | 1 | |
| 0.052537628 | 1 | |
| 0.06294791 | 1 | |
| 0.079592496 | 1 | |
| 0.08782799 | 1 | |
| 0.106605686 | 1 | |
| 0.12488287 | 1 | |
| 0.12789373 | 1 | |
| 0.13427275 | 1 |
| Value | Count | Frequency (%) |
| 0.984462 | 4 | |
| 0.9810698 | 1 | < 0.1% |
| 0.9798555 | 1 | < 0.1% |
| 0.97813374 | 1 | < 0.1% |
| 0.9753948 | 1 | < 0.1% |
| 0.9749705 | 1 | < 0.1% |
| 0.9745511 | 1 | < 0.1% |
| 0.9736628 | 1 | < 0.1% |
| 0.97301006 | 1 | < 0.1% |
| 0.97179496 | 1 | < 0.1% |
| Distinct | 8215 |
|---|---|
| Distinct (%) | 68.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5565736624 |
| Minimum | 0.017585102 |
|---|---|
| Maximum | 1 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 93.9 KiB |
Quantile statistics
| Minimum | 0.017585102 |
|---|---|
| 5-th percentile | 0.1259514565 |
| Q1 | 0.3056789975 |
| median | 0.5481356 |
| Q3 | 0.820784365 |
| 95-th percentile | 0.97691213 |
| Maximum | 1 |
| Range | 0.982414898 |
| Interquartile range (IQR) | 0.5151053675 |
Descriptive statistics
| Standard deviation | 0.2865405723 |
|---|---|
| Coefficient of variation (CV) | 0.5148295574 |
| Kurtosis | -1.294983056 |
| Mean | 0.5565736624 |
| Median Absolute Deviation (MAD) | 0.2510637 |
| Skewness | -0.03276281222 |
| Sum | 6678.883949 |
| Variance | 0.08210549957 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.6522458 | 815 | 6.8% |
| 0.51694775 | 336 | 2.8% |
| 0.9770172 | 295 | 2.5% |
| 0.7991993 | 247 | 2.1% |
| 0.7722222 | 219 | 1.8% |
| 0.87872106 | 200 | 1.7% |
| 0.95597285 | 180 | 1.5% |
| 0.8688342 | 165 | 1.4% |
| 0.90230405 | 152 | 1.3% |
| 0.69036824 | 133 | 1.1% |
| Other values (8205) | 9258 |
| Value | Count | Frequency (%) |
| 0.017585102 | 1 | |
| 0.018334996 | 1 | |
| 0.018437412 | 1 | |
| 0.018739773 | 1 | |
| 0.018762376 | 1 | |
| 0.02262627 | 1 | |
| 0.025222166 | 1 | |
| 0.026389748 | 1 | |
| 0.027400458 | 1 | |
| 0.029437775 | 1 |
| Value | Count | Frequency (%) |
| 1 | 2 | < 0.1% |
| 0.9996084 | 1 | < 0.1% |
| 0.99830574 | 1 | < 0.1% |
| 0.99696827 | 1 | < 0.1% |
| 0.9960866 | 1 | < 0.1% |
| 0.9951251 | 18 | |
| 0.9950257 | 1 | < 0.1% |
| 0.99496084 | 1 | < 0.1% |
| 0.99490833 | 1 | < 0.1% |
| 0.99471945 | 1 | < 0.1% |
ea
Real number (ℝ≥0)
| Distinct | 10554 |
|---|---|
| Distinct (%) | 87.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.4052619229 |
| Minimum | 0.026123213 |
|---|---|
| Maximum | 1 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 93.9 KiB |
Quantile statistics
| Minimum | 0.026123213 |
|---|---|
| 5-th percentile | 0.141569518 |
| Q1 | 0.2243731675 |
| median | 0.30714799 |
| Q3 | 0.4974262425 |
| 95-th percentile | 0.975200513 |
| Maximum | 1 |
| Range | 0.973876787 |
| Interquartile range (IQR) | 0.273053075 |
Descriptive statistics
| Standard deviation | 0.2566860404 |
|---|---|
| Coefficient of variation (CV) | 0.6333830689 |
| Kurtosis | 0.0605287453 |
| Mean | 0.4052619229 |
| Median Absolute Deviation (MAD) | 0.0987772 |
| Skewness | 1.146051382 |
| Sum | 4863.143075 |
| Variance | 0.06588772336 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.9083624 | 159 | 1.3% |
| 0.9456398 | 146 | 1.2% |
| 0.9800452 | 143 | 1.2% |
| 0.9262451 | 125 | 1.0% |
| 0.80709213 | 110 | 0.9% |
| 0.8496349 | 104 | 0.9% |
| 0.7658759 | 97 | 0.8% |
| 0.9860803 | 68 | 0.6% |
| 0.9621058 | 62 | 0.5% |
| 0.9805333 | 60 | 0.5% |
| Other values (10544) | 10926 |
| Value | Count | Frequency (%) |
| 0.026123213 | 1 | |
| 0.04204471 | 1 | |
| 0.0421607 | 1 | |
| 0.04882782 | 1 | |
| 0.04949934 | 1 | |
| 0.050134588 | 1 | |
| 0.050295223 | 1 | |
| 0.05053083 | 1 | |
| 0.050905466 | 1 | |
| 0.052496407 | 1 |
| Value | Count | Frequency (%) |
| 1 | 3 | < 0.1% |
| 0.99996513 | 1 | < 0.1% |
| 0.99962157 | 1 | < 0.1% |
| 0.9984293 | 1 | < 0.1% |
| 0.9980843 | 34 | |
| 0.99800116 | 1 | < 0.1% |
| 0.99767536 | 1 | < 0.1% |
| 0.9959575 | 1 | < 0.1% |
| 0.99550843 | 1 | < 0.1% |
| 0.99498713 | 1 | < 0.1% |
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 93.9 KiB |
| Offensive | |
|---|---|
| Profanity | |
| Very offensive | |
| Neutral | |
| Extremely offensive | 426 |
| Other values (2) | 226 |
Length
| Max length | 19 |
|---|---|
| Median length | 9 |
| Mean length | 9.6845 |
| Min length | 7 |
Characters and Unicode
| Total characters | 116214 |
|---|---|
| Distinct characters | 28 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Offensive |
|---|---|
| 2nd row | Offensive |
| 3rd row | Very offensive |
| 4th row | Neutral |
| 5th row | Profanity |
Common Values
| Value | Count | Frequency (%) |
| Offensive | 5966 | |
| Profanity | 3430 | |
| Very offensive | 1138 | 9.5% |
| Neutral | 814 | 6.8% |
| Extremely offensive | 426 | 3.5% |
| Unknown | 140 | 1.2% |
| Hate speech | 86 | 0.7% |
Length
Histogram of lengths of the category
Category Frequency Plot
| Value | Count | Frequency (%) |
| offensive | 7530 | |
| profanity | 3430 | |
| very | 1138 | 8.3% |
| neutral | 814 | 6.0% |
| extremely | 426 | 3.1% |
| unknown | 140 | 1.0% |
| hate | 86 | 0.6% |
| speech | 86 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| f | 18490 | |
| e | 18122 | |
| n | 11380 | |
| i | 10960 | |
| s | 7616 | 6.6% |
| v | 7530 | 6.5% |
| O | 5966 | 5.1% |
| r | 5808 | 5.0% |
| o | 5134 | 4.4% |
| y | 4994 | 4.3% |
| Other values (18) | 20214 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 102564 | |
| Uppercase Letter | 12000 | 10.3% |
| Space Separator | 1650 | 1.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| f | 18490 | |
| e | 18122 | |
| n | 11380 | |
| i | 10960 | |
| s | 7616 | |
| v | 7530 | |
| r | 5808 | 5.7% |
| o | 5134 | 5.0% |
| y | 4994 | 4.9% |
| t | 4756 | 4.6% |
| Other values (10) | 7774 |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 5966 | |
| P | 3430 | |
| V | 1138 | 9.5% |
| N | 814 | 6.8% |
| E | 426 | 3.5% |
| U | 140 | 1.2% |
| H | 86 | 0.7% |
Space Separator
| Value | Count | Frequency (%) |
| 1650 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 114564 | |
| Common | 1650 | 1.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| f | 18490 | |
| e | 18122 | |
| n | 11380 | |
| i | 10960 | |
| s | 7616 | |
| v | 7530 | |
| O | 5966 | 5.2% |
| r | 5808 | 5.1% |
| o | 5134 | 4.5% |
| y | 4994 | 4.4% |
| Other values (17) | 18564 |
Common
| Value | Count | Frequency (%) |
| 1650 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 116214 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| f | 18490 | |
| e | 18122 | |
| n | 11380 | |
| i | 10960 | |
| s | 7616 | 6.6% |
| v | 7530 | 6.5% |
| O | 5966 | 5.1% |
| r | 5808 | 5.0% |
| o | 5134 | 4.4% |
| y | 4994 | 4.3% |
| Other values (18) | 20214 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
| fl | ia | s | t | e | ea | et | |
|---|---|---|---|---|---|---|---|
| 0 | 0.593828 | 0.563516 | 0.849090 | 0.864632 | 0.777347 | 0.602494 | Offensive |
| 1 | 0.213193 | 0.407253 | 0.925010 | 0.856451 | 0.456983 | 0.592931 | Offensive |
| 2 | 0.474532 | 0.323574 | 0.710831 | 0.747318 | 0.933715 | 0.208848 | Very offensive |
| 3 | 0.503426 | 0.407557 | 0.796685 | 0.854638 | 0.955973 | 0.343336 | Neutral |
| 4 | 0.394807 | 0.170078 | 0.561849 | 0.766563 | 0.459300 | 0.223698 | Profanity |
| 5 | 0.277215 | 0.250670 | 0.867602 | 0.883861 | 0.355724 | 0.228288 | Profanity |
| 6 | 0.197042 | 0.388147 | 0.853236 | 0.779098 | 0.147911 | 0.188907 | Profanity |
| 7 | 0.384475 | 0.988942 | 0.975331 | 0.951647 | 0.772222 | 0.366735 | Extremely offensive |
| 8 | 0.110473 | 0.086410 | 0.452698 | 0.700786 | 0.177249 | 0.108329 | Profanity |
| 9 | 0.388394 | 0.243946 | 0.679047 | 0.747318 | 0.591966 | 0.246288 | Profanity |
Last rows
| fl | ia | s | t | e | ea | et | |
|---|---|---|---|---|---|---|---|
| 11990 | 0.489507 | 0.964097 | 0.960558 | 0.945009 | 0.772222 | 0.472298 | Offensive |
| 11991 | 0.241716 | 0.521499 | 0.976476 | 0.945009 | 0.646652 | 0.380112 | Offensive |
| 11992 | 0.372119 | 0.606927 | 0.981145 | 0.942245 | 0.652246 | 0.363261 | Offensive |
| 11993 | 0.465192 | 0.634209 | 0.965800 | 0.945040 | 0.772222 | 0.648024 | Offensive |
| 11994 | 0.470747 | 0.536505 | 0.936827 | 0.945009 | 0.838113 | 0.496711 | Offensive |
| 11995 | 0.543366 | 0.684481 | 0.951176 | 0.945009 | 0.878651 | 0.765876 | Offensive |
| 11996 | 0.843702 | 0.877267 | 0.933437 | 0.945656 | 0.980018 | 0.476850 | Very offensive |
| 11997 | 0.372335 | 0.979884 | 0.963483 | 0.945009 | 0.685437 | 0.308390 | Very offensive |
| 11998 | 0.747068 | 0.906385 | 0.953521 | 0.945589 | 0.987091 | 0.926245 | Very offensive |
| 11999 | 0.640017 | 0.750959 | 0.934347 | 0.945009 | 0.902304 | 0.692056 | Offensive |
Most frequently occurring
| fl | ia | s | t | e | ea | et | # duplicates | |
|---|---|---|---|---|---|---|---|---|
| 51 | 0.477608 | 0.498115 | 0.824878 | 0.821408 | 0.868834 | 0.203815 | Profanity | 8 |
| 25 | 0.414350 | 0.492813 | 0.831883 | 0.822128 | 0.788562 | 0.199998 | Profanity | 6 |
| 26 | 0.414916 | 0.428946 | 0.824859 | 0.816635 | 0.772222 | 0.153293 | Offensive | 6 |
| 29 | 0.418400 | 0.532076 | 0.839415 | 0.828528 | 0.774779 | 0.202531 | Profanity | 6 |
| 33 | 0.431192 | 0.593916 | 0.896435 | 0.866037 | 0.799199 | 0.248875 | Profanity | 6 |
| 38 | 0.441925 | 0.606927 | 0.867102 | 0.854638 | 0.833085 | 0.214615 | Profanity | 6 |
| 40 | 0.449371 | 0.571334 | 0.853236 | 0.854638 | 0.837973 | 0.215436 | Profanity | 6 |
| 27 | 0.414916 | 0.428946 | 0.824859 | 0.816635 | 0.772222 | 0.153293 | Profanity | 5 |
| 22 | 0.403791 | 0.522775 | 0.853236 | 0.837239 | 0.767377 | 0.203527 | Offensive | 4 |
| 24 | 0.414350 | 0.492813 | 0.831883 | 0.822128 | 0.788562 | 0.199998 | Offensive | 4 |