Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 12000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 65 |
Duplicate rows (%) | 0.5% |
Total size in memory | 656.4 KiB |
Average record size in memory | 56.0 B |
Variable types
Numeric | 6 |
---|---|
Categorical | 1 |
Dataset has 65 (0.5%) duplicate rows | Duplicates |
fl is highly correlated with e | High correlation |
ia is highly correlated with s and 1 other fields | High correlation |
s is highly correlated with ia and 1 other fields | High correlation |
t is highly correlated with ia and 1 other fields | High correlation |
e is highly correlated with fl | High correlation |
fl is highly correlated with e | High correlation |
ia is highly correlated with s | High correlation |
s is highly correlated with ia and 1 other fields | High correlation |
t is highly correlated with s | High correlation |
e is highly correlated with fl | High correlation |
fl is highly correlated with e | High correlation |
e is highly correlated with fl | High correlation |
fl is highly correlated with e | High correlation |
ia is highly correlated with s and 2 other fields | High correlation |
s is highly correlated with ia and 1 other fields | High correlation |
t is highly correlated with ia and 2 other fields | High correlation |
e is highly correlated with fl | High correlation |
et is highly correlated with ia and 1 other fields | High correlation |
Reproduction
Analysis started | 2022-06-14 18:59:43.124639 |
---|---|
Analysis finished | 2022-06-14 18:59:49.642948 |
Duration | 6.52 seconds |
Software version | pandas-profiling v3.2.0 |
Download configuration | config.json |
Distinct | 11684 |
---|---|
Distinct (%) | 97.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.4168125574 |
Minimum | 0.029390577 |
---|---|
Maximum | 0.9492131 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 93.9 KiB |
Quantile statistics
Minimum | 0.029390577 |
---|---|
5-th percentile | 0.1360574175 |
Q1 | 0.2850291325 |
median | 0.40379098 |
Q3 | 0.5011375575 |
95-th percentile | 0.786797873 |
Maximum | 0.9492131 |
Range | 0.919822523 |
Interquartile range (IQR) | 0.216108425 |
Descriptive statistics
Standard deviation | 0.1852367088 |
---|---|
Coefficient of variation (CV) | 0.4444124954 |
Kurtosis | -0.09893912846 |
Mean | 0.4168125574 |
Median Absolute Deviation (MAD) | 0.10433418 |
Skewness | 0.4737160247 |
Sum | 5001.750689 |
Variance | 0.03431263827 |
Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
Value | Count | Frequency (%) |
0.8437023 | 88 | 0.7% |
0.90289146 | 40 | 0.3% |
0.94800824 | 13 | 0.1% |
0.4149162 | 11 | 0.1% |
0.4143504 | 11 | 0.1% |
0.4311919 | 10 | 0.1% |
0.44937053 | 10 | 0.1% |
0.47760788 | 10 | 0.1% |
0.40379098 | 8 | 0.1% |
0.41839996 | 8 | 0.1% |
Other values (11674) | 11791 |
Value | Count | Frequency (%) |
0.029390577 | 1 | |
0.029955925 | 1 | |
0.033604544 | 1 | |
0.033606403 | 1 | |
0.034301028 | 1 | |
0.036454223 | 1 | |
0.038488213 | 1 | |
0.03961533 | 1 | |
0.040670637 | 1 | |
0.041285664 | 1 |
Value | Count | Frequency (%) |
0.9492131 | 2 | < 0.1% |
0.94800824 | 13 | |
0.9440342 | 1 | < 0.1% |
0.94353414 | 1 | < 0.1% |
0.9394918 | 1 | < 0.1% |
0.92673665 | 1 | < 0.1% |
0.9119069 | 1 | < 0.1% |
0.90620583 | 1 | < 0.1% |
0.9056027 | 1 | < 0.1% |
0.90545267 | 1 | < 0.1% |
Distinct | 10409 |
---|---|
Distinct (%) | 86.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.4382163503 |
Minimum | 0.037709847 |
---|---|
Maximum | 0.99387753 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 93.9 KiB |
Quantile statistics
Minimum | 0.037709847 |
---|---|
5-th percentile | 0.11280035 |
Q1 | 0.23262316 |
median | 0.35341949 |
Q3 | 0.6069273 |
95-th percentile | 0.9490587 |
Maximum | 0.99387753 |
Range | 0.956167683 |
Interquartile range (IQR) | 0.37430414 |
Descriptive statistics
Standard deviation | 0.2662644614 |
---|---|
Coefficient of variation (CV) | 0.6076096003 |
Kurtosis | -0.7448420262 |
Mean | 0.4382163503 |
Median Absolute Deviation (MAD) | 0.17451961 |
Skewness | 0.6597911337 |
Sum | 5258.596203 |
Variance | 0.07089676341 |
Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
Value | Count | Frequency (%) |
0.6069273 | 165 | 1.4% |
0.9490587 | 158 | 1.3% |
0.9149589 | 136 | 1.1% |
0.6342086 | 124 | 1.0% |
0.9340332 | 85 | 0.7% |
0.9586736 | 80 | 0.7% |
0.81136084 | 77 | 0.6% |
0.6894879 | 73 | 0.6% |
0.90638536 | 73 | 0.6% |
0.8772666 | 60 | 0.5% |
Other values (10399) | 10969 |
Value | Count | Frequency (%) |
0.037709847 | 1 | |
0.044970874 | 1 | |
0.04508064 | 1 | |
0.045482233 | 1 | |
0.049547765 | 1 | |
0.05189743 | 1 | |
0.052189957 | 1 | |
0.05362984 | 1 | |
0.054279275 | 1 | |
0.055933435 | 1 |
Value | Count | Frequency (%) |
0.99387753 | 3 | |
0.9930588 | 1 | < 0.1% |
0.99256897 | 1 | < 0.1% |
0.9921237 | 1 | < 0.1% |
0.99065804 | 1 | < 0.1% |
0.99036723 | 1 | < 0.1% |
0.99019986 | 1 | < 0.1% |
0.9901501 | 1 | < 0.1% |
0.9900025 | 1 | < 0.1% |
0.98979497 | 1 | < 0.1% |
Distinct | 7420 |
---|---|
Distinct (%) | 61.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.801580311 |
Minimum | 0.024430014 |
---|---|
Maximum | 0.9943364 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 93.9 KiB |
Quantile statistics
Minimum | 0.024430014 |
---|---|
5-th percentile | 0.4998674795 |
Q1 | 0.699709325 |
median | 0.8435214 |
Q3 | 0.93682694 |
95-th percentile | 0.97533107 |
Maximum | 0.9943364 |
Range | 0.969906386 |
Interquartile range (IQR) | 0.237117615 |
Descriptive statistics
Standard deviation | 0.1620618911 |
---|---|
Coefficient of variation (CV) | 0.2021779838 |
Kurtosis | 0.3933632595 |
Mean | 0.801580311 |
Median Absolute Deviation (MAD) | 0.1099997 |
Skewness | -0.978641045 |
Sum | 9618.963732 |
Variance | 0.02626405655 |
Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
Value | Count | Frequency (%) |
0.9605577 | 349 | 2.9% |
0.9535211 | 341 | 2.8% |
0.9658004 | 317 | 2.6% |
0.9076751 | 311 | 2.6% |
0.93682694 | 283 | 2.4% |
0.9098734 | 278 | 2.3% |
0.97533107 | 278 | 2.3% |
0.9811452 | 263 | 2.2% |
0.72723687 | 191 | 1.6% |
0.59276253 | 187 | 1.6% |
Other values (7410) | 9202 |
Value | Count | Frequency (%) |
0.024430014 | 1 | |
0.06102065 | 1 | |
0.071603596 | 1 | |
0.07817141 | 1 | |
0.0788923 | 1 | |
0.08381921 | 1 | |
0.1061011 | 1 | |
0.11373025 | 1 | |
0.11440568 | 1 | |
0.11515171 | 1 |
Value | Count | Frequency (%) |
0.9943364 | 1 | < 0.1% |
0.99291486 | 1 | < 0.1% |
0.9856039 | 1 | < 0.1% |
0.9854358 | 69 | |
0.9854041 | 1 | < 0.1% |
0.98533255 | 1 | < 0.1% |
0.98518544 | 1 | < 0.1% |
0.98511934 | 1 | < 0.1% |
0.98511195 | 1 | < 0.1% |
0.9850903 | 1 | < 0.1% |
Distinct | 7967 |
---|---|
Distinct (%) | 66.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.8218832335 |
Minimum | 0.024729094 |
---|---|
Maximum | 0.984462 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 93.9 KiB |
Quantile statistics
Minimum | 0.024729094 |
---|---|
5-th percentile | 0.70857113 |
Q1 | 0.7473183 |
median | 0.8214078 |
Q3 | 0.89414346 |
95-th percentile | 0.95164716 |
Maximum | 0.984462 |
Range | 0.959732906 |
Interquartile range (IQR) | 0.14682516 |
Descriptive statistics
Standard deviation | 0.09560208289 |
---|---|
Coefficient of variation (CV) | 0.1163207607 |
Kurtosis | 7.324972053 |
Mean | 0.8218832335 |
Median Absolute Deviation (MAD) | 0.0740895 |
Skewness | -1.288609868 |
Sum | 9862.598802 |
Variance | 0.009139758252 |
Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
Value | Count | Frequency (%) |
0.7473183 | 782 | 6.5% |
0.8214078 | 521 | 4.3% |
0.95164716 | 455 | 3.8% |
0.9450092 | 453 | 3.8% |
0.8546382 | 410 | 3.4% |
0.86612374 | 292 | 2.4% |
0.9456558 | 280 | 2.3% |
0.88678163 | 256 | 2.1% |
0.91536796 | 232 | 1.9% |
0.89414346 | 180 | 1.5% |
Other values (7957) | 8139 |
Value | Count | Frequency (%) |
0.024729094 | 1 | |
0.037766483 | 1 | |
0.052537628 | 1 | |
0.06294791 | 1 | |
0.079592496 | 1 | |
0.08782799 | 1 | |
0.106605686 | 1 | |
0.12488287 | 1 | |
0.12789373 | 1 | |
0.13427275 | 1 |
Value | Count | Frequency (%) |
0.984462 | 4 | |
0.9810698 | 1 | < 0.1% |
0.9798555 | 1 | < 0.1% |
0.97813374 | 1 | < 0.1% |
0.9753948 | 1 | < 0.1% |
0.9749705 | 1 | < 0.1% |
0.9745511 | 1 | < 0.1% |
0.9736628 | 1 | < 0.1% |
0.97301006 | 1 | < 0.1% |
0.97179496 | 1 | < 0.1% |
Distinct | 8215 |
---|---|
Distinct (%) | 68.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.5565736624 |
Minimum | 0.017585102 |
---|---|
Maximum | 1 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 93.9 KiB |
Quantile statistics
Minimum | 0.017585102 |
---|---|
5-th percentile | 0.1259514565 |
Q1 | 0.3056789975 |
median | 0.5481356 |
Q3 | 0.820784365 |
95-th percentile | 0.97691213 |
Maximum | 1 |
Range | 0.982414898 |
Interquartile range (IQR) | 0.5151053675 |
Descriptive statistics
Standard deviation | 0.2865405723 |
---|---|
Coefficient of variation (CV) | 0.5148295574 |
Kurtosis | -1.294983056 |
Mean | 0.5565736624 |
Median Absolute Deviation (MAD) | 0.2510637 |
Skewness | -0.03276281222 |
Sum | 6678.883949 |
Variance | 0.08210549957 |
Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
Value | Count | Frequency (%) |
0.6522458 | 815 | 6.8% |
0.51694775 | 336 | 2.8% |
0.9770172 | 295 | 2.5% |
0.7991993 | 247 | 2.1% |
0.7722222 | 219 | 1.8% |
0.87872106 | 200 | 1.7% |
0.95597285 | 180 | 1.5% |
0.8688342 | 165 | 1.4% |
0.90230405 | 152 | 1.3% |
0.69036824 | 133 | 1.1% |
Other values (8205) | 9258 |
Value | Count | Frequency (%) |
0.017585102 | 1 | |
0.018334996 | 1 | |
0.018437412 | 1 | |
0.018739773 | 1 | |
0.018762376 | 1 | |
0.02262627 | 1 | |
0.025222166 | 1 | |
0.026389748 | 1 | |
0.027400458 | 1 | |
0.029437775 | 1 |
Value | Count | Frequency (%) |
1 | 2 | < 0.1% |
0.9996084 | 1 | < 0.1% |
0.99830574 | 1 | < 0.1% |
0.99696827 | 1 | < 0.1% |
0.9960866 | 1 | < 0.1% |
0.9951251 | 18 | |
0.9950257 | 1 | < 0.1% |
0.99496084 | 1 | < 0.1% |
0.99490833 | 1 | < 0.1% |
0.99471945 | 1 | < 0.1% |
ea
Real number (ℝ≥0)
Distinct | 10554 |
---|---|
Distinct (%) | 87.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.4052619229 |
Minimum | 0.026123213 |
---|---|
Maximum | 1 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 93.9 KiB |
Quantile statistics
Minimum | 0.026123213 |
---|---|
5-th percentile | 0.141569518 |
Q1 | 0.2243731675 |
median | 0.30714799 |
Q3 | 0.4974262425 |
95-th percentile | 0.975200513 |
Maximum | 1 |
Range | 0.973876787 |
Interquartile range (IQR) | 0.273053075 |
Descriptive statistics
Standard deviation | 0.2566860404 |
---|---|
Coefficient of variation (CV) | 0.6333830689 |
Kurtosis | 0.0605287453 |
Mean | 0.4052619229 |
Median Absolute Deviation (MAD) | 0.0987772 |
Skewness | 1.146051382 |
Sum | 4863.143075 |
Variance | 0.06588772336 |
Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
Value | Count | Frequency (%) |
0.9083624 | 159 | 1.3% |
0.9456398 | 146 | 1.2% |
0.9800452 | 143 | 1.2% |
0.9262451 | 125 | 1.0% |
0.80709213 | 110 | 0.9% |
0.8496349 | 104 | 0.9% |
0.7658759 | 97 | 0.8% |
0.9860803 | 68 | 0.6% |
0.9621058 | 62 | 0.5% |
0.9805333 | 60 | 0.5% |
Other values (10544) | 10926 |
Value | Count | Frequency (%) |
0.026123213 | 1 | |
0.04204471 | 1 | |
0.0421607 | 1 | |
0.04882782 | 1 | |
0.04949934 | 1 | |
0.050134588 | 1 | |
0.050295223 | 1 | |
0.05053083 | 1 | |
0.050905466 | 1 | |
0.052496407 | 1 |
Value | Count | Frequency (%) |
1 | 3 | < 0.1% |
0.99996513 | 1 | < 0.1% |
0.99962157 | 1 | < 0.1% |
0.9984293 | 1 | < 0.1% |
0.9980843 | 34 | |
0.99800116 | 1 | < 0.1% |
0.99767536 | 1 | < 0.1% |
0.9959575 | 1 | < 0.1% |
0.99550843 | 1 | < 0.1% |
0.99498713 | 1 | < 0.1% |
Distinct | 7 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 93.9 KiB |
Offensive | |
---|---|
Profanity | |
Very offensive | |
Neutral | |
Extremely offensive | 426 |
Other values (2) | 226 |
Length
Max length | 19 |
---|---|
Median length | 9 |
Mean length | 9.6845 |
Min length | 7 |
Characters and Unicode
Total characters | 116214 |
---|---|
Distinct characters | 28 |
Distinct categories | 3 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | Offensive |
---|---|
2nd row | Offensive |
3rd row | Very offensive |
4th row | Neutral |
5th row | Profanity |
Common Values
Value | Count | Frequency (%) |
Offensive | 5966 | |
Profanity | 3430 | |
Very offensive | 1138 | 9.5% |
Neutral | 814 | 6.8% |
Extremely offensive | 426 | 3.5% |
Unknown | 140 | 1.2% |
Hate speech | 86 | 0.7% |
Length
Histogram of lengths of the category
Category Frequency Plot
Value | Count | Frequency (%) |
offensive | 7530 | |
profanity | 3430 | |
very | 1138 | 8.3% |
neutral | 814 | 6.0% |
extremely | 426 | 3.1% |
unknown | 140 | 1.0% |
hate | 86 | 0.6% |
speech | 86 | 0.6% |
Most occurring characters
Value | Count | Frequency (%) |
f | 18490 | |
e | 18122 | |
n | 11380 | |
i | 10960 | |
s | 7616 | 6.6% |
v | 7530 | 6.5% |
O | 5966 | 5.1% |
r | 5808 | 5.0% |
o | 5134 | 4.4% |
y | 4994 | 4.3% |
Other values (18) | 20214 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 102564 | |
Uppercase Letter | 12000 | 10.3% |
Space Separator | 1650 | 1.4% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
f | 18490 | |
e | 18122 | |
n | 11380 | |
i | 10960 | |
s | 7616 | |
v | 7530 | |
r | 5808 | 5.7% |
o | 5134 | 5.0% |
y | 4994 | 4.9% |
t | 4756 | 4.6% |
Other values (10) | 7774 |
Uppercase Letter
Value | Count | Frequency (%) |
O | 5966 | |
P | 3430 | |
V | 1138 | 9.5% |
N | 814 | 6.8% |
E | 426 | 3.5% |
U | 140 | 1.2% |
H | 86 | 0.7% |
Space Separator
Value | Count | Frequency (%) |
1650 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 114564 | |
Common | 1650 | 1.4% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
f | 18490 | |
e | 18122 | |
n | 11380 | |
i | 10960 | |
s | 7616 | |
v | 7530 | |
O | 5966 | 5.2% |
r | 5808 | 5.1% |
o | 5134 | 4.5% |
y | 4994 | 4.4% |
Other values (17) | 18564 |
Common
Value | Count | Frequency (%) |
1650 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 116214 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
f | 18490 | |
e | 18122 | |
n | 11380 | |
i | 10960 | |
s | 7616 | 6.6% |
v | 7530 | 6.5% |
O | 5966 | 5.1% |
r | 5808 | 5.0% |
o | 5134 | 4.4% |
y | 4994 | 4.3% |
Other values (18) | 20214 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
fl | ia | s | t | e | ea | et | |
---|---|---|---|---|---|---|---|
0 | 0.593828 | 0.563516 | 0.849090 | 0.864632 | 0.777347 | 0.602494 | Offensive |
1 | 0.213193 | 0.407253 | 0.925010 | 0.856451 | 0.456983 | 0.592931 | Offensive |
2 | 0.474532 | 0.323574 | 0.710831 | 0.747318 | 0.933715 | 0.208848 | Very offensive |
3 | 0.503426 | 0.407557 | 0.796685 | 0.854638 | 0.955973 | 0.343336 | Neutral |
4 | 0.394807 | 0.170078 | 0.561849 | 0.766563 | 0.459300 | 0.223698 | Profanity |
5 | 0.277215 | 0.250670 | 0.867602 | 0.883861 | 0.355724 | 0.228288 | Profanity |
6 | 0.197042 | 0.388147 | 0.853236 | 0.779098 | 0.147911 | 0.188907 | Profanity |
7 | 0.384475 | 0.988942 | 0.975331 | 0.951647 | 0.772222 | 0.366735 | Extremely offensive |
8 | 0.110473 | 0.086410 | 0.452698 | 0.700786 | 0.177249 | 0.108329 | Profanity |
9 | 0.388394 | 0.243946 | 0.679047 | 0.747318 | 0.591966 | 0.246288 | Profanity |
Last rows
fl | ia | s | t | e | ea | et | |
---|---|---|---|---|---|---|---|
11990 | 0.489507 | 0.964097 | 0.960558 | 0.945009 | 0.772222 | 0.472298 | Offensive |
11991 | 0.241716 | 0.521499 | 0.976476 | 0.945009 | 0.646652 | 0.380112 | Offensive |
11992 | 0.372119 | 0.606927 | 0.981145 | 0.942245 | 0.652246 | 0.363261 | Offensive |
11993 | 0.465192 | 0.634209 | 0.965800 | 0.945040 | 0.772222 | 0.648024 | Offensive |
11994 | 0.470747 | 0.536505 | 0.936827 | 0.945009 | 0.838113 | 0.496711 | Offensive |
11995 | 0.543366 | 0.684481 | 0.951176 | 0.945009 | 0.878651 | 0.765876 | Offensive |
11996 | 0.843702 | 0.877267 | 0.933437 | 0.945656 | 0.980018 | 0.476850 | Very offensive |
11997 | 0.372335 | 0.979884 | 0.963483 | 0.945009 | 0.685437 | 0.308390 | Very offensive |
11998 | 0.747068 | 0.906385 | 0.953521 | 0.945589 | 0.987091 | 0.926245 | Very offensive |
11999 | 0.640017 | 0.750959 | 0.934347 | 0.945009 | 0.902304 | 0.692056 | Offensive |
Most frequently occurring
fl | ia | s | t | e | ea | et | # duplicates | |
---|---|---|---|---|---|---|---|---|
51 | 0.477608 | 0.498115 | 0.824878 | 0.821408 | 0.868834 | 0.203815 | Profanity | 8 |
25 | 0.414350 | 0.492813 | 0.831883 | 0.822128 | 0.788562 | 0.199998 | Profanity | 6 |
26 | 0.414916 | 0.428946 | 0.824859 | 0.816635 | 0.772222 | 0.153293 | Offensive | 6 |
29 | 0.418400 | 0.532076 | 0.839415 | 0.828528 | 0.774779 | 0.202531 | Profanity | 6 |
33 | 0.431192 | 0.593916 | 0.896435 | 0.866037 | 0.799199 | 0.248875 | Profanity | 6 |
38 | 0.441925 | 0.606927 | 0.867102 | 0.854638 | 0.833085 | 0.214615 | Profanity | 6 |
40 | 0.449371 | 0.571334 | 0.853236 | 0.854638 | 0.837973 | 0.215436 | Profanity | 6 |
27 | 0.414916 | 0.428946 | 0.824859 | 0.816635 | 0.772222 | 0.153293 | Profanity | 5 |
22 | 0.403791 | 0.522775 | 0.853236 | 0.837239 | 0.767377 | 0.203527 | Offensive | 4 |
24 | 0.414350 | 0.492813 | 0.831883 | 0.822128 | 0.788562 | 0.199998 | Offensive | 4 |