Overview

Dataset statistics

Number of variables6
Number of observations865
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory190.1 KiB
Average record size in memory225.1 B

Variable types

Categorical3
Numeric3

Alerts

Code has a high cardinality: 865 distinct valuesHigh cardinality
Name has a high cardinality: 865 distinct valuesHigh cardinality
Hex has a high cardinality: 765 distinct valuesHigh cardinality
Code is uniformly distributedUniform
Name is uniformly distributedUniform
Hex is uniformly distributedUniform
Code has unique valuesUnique
Name has unique valuesUnique
R has 81 (9.4%) zerosZeros
G has 58 (6.7%) zerosZeros
B has 80 (9.2%) zerosZeros

Reproduction

Analysis started2023-01-25 14:29:41.000604
Analysis finished2023-01-25 14:29:42.658288
Duration1.66 second
Software versionpandas-profiling v0.0.dev0
Download configurationconfig.json

Variables

Code
Categorical

HIGH CARDINALITY  UNIFORM  UNIQUE 

Distinct865
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size57.9 KiB
air_force_blue_raf
 
1
pale_taupe
 
1
pale_gold
 
1
pale_goldenrod
 
1
pale_green
 
1
Other values (860)
860 

Length

Max length39
Median length26
Mean length11.375723
Min length3

Characters and Unicode

Total characters9840
Distinct characters31
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique865 ?
Unique (%)100.0%

Sample

1st rowair_force_blue_raf
2nd rowair_force_blue_usaf
3rd rowair_superiority_blue
4th rowalabama_crimson
5th rowalice_blue

Common Values

ValueCountFrequency (%)
air_force_blue_raf 1
 
0.1%
pale_taupe 1
 
0.1%
pale_gold 1
 
0.1%
pale_goldenrod 1
 
0.1%
pale_green 1
 
0.1%
pale_lavender 1
 
0.1%
pale_magenta 1
 
0.1%
pale_pink 1
 
0.1%
pale_plum 1
 
0.1%
pale_red_violet 1
 
0.1%
Other values (855) 855
98.8%

Length

2023-01-25T14:29:42.843182image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
air_force_blue_raf 1
 
0.1%
amethyst 1
 
0.1%
arsenic 1
 
0.1%
air_superiority_blue 1
 
0.1%
alabama_crimson 1
 
0.1%
alice_blue 1
 
0.1%
alizarin_crimson 1
 
0.1%
alloy_orange 1
 
0.1%
almond 1
 
0.1%
amaranth 1
 
0.1%
Other values (855) 855
98.8%

Most occurring characters

ValueCountFrequency (%)
e 1201
 
12.2%
_ 799
 
8.1%
r 796
 
8.1%
a 788
 
8.0%
l 695
 
7.1%
n 626
 
6.4%
i 558
 
5.7%
o 519
 
5.3%
t 396
 
4.0%
u 373
 
3.8%
Other values (21) 3089
31.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 9025
91.7%
Connector Punctuation 799
 
8.1%
Decimal Number 16
 
0.2%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 1201
13.3%
r 796
 
8.8%
a 788
 
8.7%
l 695
 
7.7%
n 626
 
6.9%
i 558
 
6.2%
o 519
 
5.8%
t 396
 
4.4%
u 373
 
4.1%
s 343
 
3.8%
Other values (16) 2730
30.2%
Decimal Number
ValueCountFrequency (%)
1 13
81.2%
7 1
 
6.2%
3 1
 
6.2%
9 1
 
6.2%
Connector Punctuation
ValueCountFrequency (%)
_ 799
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 9025
91.7%
Common 815
 
8.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 1201
13.3%
r 796
 
8.8%
a 788
 
8.7%
l 695
 
7.7%
n 626
 
6.9%
i 558
 
6.2%
o 519
 
5.8%
t 396
 
4.4%
u 373
 
4.1%
s 343
 
3.8%
Other values (16) 2730
30.2%
Common
ValueCountFrequency (%)
_ 799
98.0%
1 13
 
1.6%
7 1
 
0.1%
3 1
 
0.1%
9 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 9840
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 1201
 
12.2%
_ 799
 
8.1%
r 796
 
8.1%
a 788
 
8.0%
l 695
 
7.1%
n 626
 
6.4%
i 558
 
5.7%
o 519
 
5.3%
t 396
 
4.0%
u 373
 
3.8%
Other values (21) 3089
31.4%

Name
Categorical

HIGH CARDINALITY  UNIFORM  UNIQUE 

Distinct865
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size58.2 KiB
Air Force Blue (Raf)
 
1
Pale Taupe
 
1
Pale Gold
 
1
Pale Goldenrod
 
1
Pale Green
 
1
Other values (860)
860 

Length

Max length41
Median length28
Mean length11.591908
Min length3

Characters and Unicode

Total characters10027
Distinct characters69
Distinct categories9 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique865 ?
Unique (%)100.0%

Sample

1st rowAir Force Blue (Raf)
2nd rowAir Force Blue (Usaf)
3rd rowAir Superiority Blue
4th rowAlabama Crimson
5th rowAlice Blue

Common Values

ValueCountFrequency (%)
Air Force Blue (Raf) 1
 
0.1%
Pale Taupe 1
 
0.1%
Pale Gold 1
 
0.1%
Pale Goldenrod 1
 
0.1%
Pale Green 1
 
0.1%
Pale Lavender 1
 
0.1%
Pale Magenta 1
 
0.1%
Pale Pink 1
 
0.1%
Pale Plum 1
 
0.1%
Pale Red-Violet 1
 
0.1%
Other values (855) 855
98.8%

Length

2023-01-25T14:29:42.999811image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
blue 98
 
6.0%
green 78
 
4.8%
pink 47
 
2.9%
dark 45
 
2.8%
red 42
 
2.6%
yellow 31
 
1.9%
rose 28
 
1.7%
light 25
 
1.5%
lavender 23
 
1.4%
orange 23
 
1.4%
Other values (606) 1190
73.0%

Most occurring characters

ValueCountFrequency (%)
e 1168
 
11.6%
765
 
7.6%
a 737
 
7.4%
r 661
 
6.6%
l 611
 
6.1%
n 609
 
6.1%
i 536
 
5.3%
o 463
 
4.6%
u 345
 
3.4%
t 328
 
3.3%
Other values (59) 3804
37.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 7369
73.5%
Uppercase Letter 1661
 
16.6%
Space Separator 765
 
7.6%
Open Punctuation 89
 
0.9%
Close Punctuation 89
 
0.9%
Dash Punctuation 20
 
0.2%
Other Punctuation 17
 
0.2%
Decimal Number 16
 
0.2%
Final Punctuation 1
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 1168
15.9%
a 737
10.0%
r 661
 
9.0%
l 611
 
8.3%
n 609
 
8.3%
i 536
 
7.3%
o 463
 
6.3%
u 345
 
4.7%
t 328
 
4.5%
d 251
 
3.4%
Other values (19) 1660
22.5%
Uppercase Letter
ValueCountFrequency (%)
B 206
12.4%
P 174
10.5%
C 158
 
9.5%
G 140
 
8.4%
R 135
 
8.1%
M 95
 
5.7%
S 93
 
5.6%
D 90
 
5.4%
L 84
 
5.1%
T 68
 
4.1%
Other values (16) 418
25.2%
Other Punctuation
ValueCountFrequency (%)
/ 7
41.2%
' 6
35.3%
# 2
 
11.8%
. 1
 
5.9%
& 1
 
5.9%
Decimal Number
ValueCountFrequency (%)
1 13
81.2%
3 1
 
6.2%
7 1
 
6.2%
9 1
 
6.2%
Space Separator
ValueCountFrequency (%)
765
100.0%
Open Punctuation
ValueCountFrequency (%)
( 89
100.0%
Close Punctuation
ValueCountFrequency (%)
) 89
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 20
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 9030
90.1%
Common 997
 
9.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 1168
 
12.9%
a 737
 
8.2%
r 661
 
7.3%
l 611
 
6.8%
n 609
 
6.7%
i 536
 
5.9%
o 463
 
5.1%
u 345
 
3.8%
t 328
 
3.6%
d 251
 
2.8%
Other values (45) 3321
36.8%
Common
ValueCountFrequency (%)
765
76.7%
( 89
 
8.9%
) 89
 
8.9%
- 20
 
2.0%
1 13
 
1.3%
/ 7
 
0.7%
' 6
 
0.6%
# 2
 
0.2%
3 1
 
0.1%
7 1
 
0.1%
Other values (4) 4
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 10021
99.9%
None 5
 
< 0.1%
Punctuation 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 1168
 
11.7%
765
 
7.6%
a 737
 
7.4%
r 661
 
6.6%
l 611
 
6.1%
n 609
 
6.1%
i 536
 
5.3%
o 463
 
4.6%
u 345
 
3.4%
t 328
 
3.3%
Other values (55) 3798
37.9%
None
ValueCountFrequency (%)
é 3
60.0%
à 1
 
20.0%
ú 1
 
20.0%
Punctuation
ValueCountFrequency (%)
1
100.0%

Hex
Categorical

HIGH CARDINALITY  UNIFORM 

Distinct765
Distinct (%)88.4%
Missing0
Missing (%)0.0%
Memory size54.0 KiB
#c19a6b
 
5
#fada5e
 
4
#967117
 
4
#808080
 
3
#a52a2a
 
3
Other values (760)
846 

Length

Max length7
Median length7
Mean length6.7988439
Min length4

Characters and Unicode

Total characters5881
Distinct characters17
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique684 ?
Unique (%)79.1%

Sample

1st row#5d8aa8
2nd row#00308f
3rd row#72a0c1
4th row#a32638
5th row#f0f8ff

Common Values

ValueCountFrequency (%)
#c19a6b 5
 
0.6%
#fada5e 4
 
0.5%
#967117 4
 
0.5%
#808080 3
 
0.3%
#a52a2a 3
 
0.3%
#f88379 3
 
0.3%
#900 3
 
0.3%
#0ff 3
 
0.3%
#cf0 3
 
0.3%
#008000 3
 
0.3%
Other values (755) 831
96.1%

Length

2023-01-25T14:29:43.133507image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
c19a6b 5
 
0.6%
967117 4
 
0.5%
fada5e 4
 
0.5%
808080 3
 
0.3%
0f0 3
 
0.3%
a52a2a 3
 
0.3%
483c32 3
 
0.3%
d2691e 3
 
0.3%
fad6a5 3
 
0.3%
dda0dd 3
 
0.3%
Other values (755) 831
96.1%

Most occurring characters

ValueCountFrequency (%)
# 865
14.7%
0 665
 
11.3%
f 625
 
10.6%
8 317
 
5.4%
c 300
 
5.1%
a 292
 
5.0%
e 269
 
4.6%
4 268
 
4.6%
b 268
 
4.6%
3 267
 
4.5%
Other values (7) 1745
29.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2997
51.0%
Lowercase Letter 2019
34.3%
Other Punctuation 865
 
14.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 665
22.2%
8 317
10.6%
4 268
8.9%
3 267
8.9%
6 265
 
8.8%
7 252
 
8.4%
9 250
 
8.3%
5 248
 
8.3%
2 243
 
8.1%
1 222
 
7.4%
Lowercase Letter
ValueCountFrequency (%)
f 625
31.0%
c 300
14.9%
a 292
14.5%
e 269
13.3%
b 268
13.3%
d 265
13.1%
Other Punctuation
ValueCountFrequency (%)
# 865
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3862
65.7%
Latin 2019
34.3%

Most frequent character per script

Common
ValueCountFrequency (%)
# 865
22.4%
0 665
17.2%
8 317
 
8.2%
4 268
 
6.9%
3 267
 
6.9%
6 265
 
6.9%
7 252
 
6.5%
9 250
 
6.5%
5 248
 
6.4%
2 243
 
6.3%
Latin
ValueCountFrequency (%)
f 625
31.0%
c 300
14.9%
a 292
14.5%
e 269
13.3%
b 268
13.3%
d 265
13.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5881
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
# 865
14.7%
0 665
 
11.3%
f 625
 
10.6%
8 317
 
5.4%
c 300
 
5.1%
a 292
 
5.0%
e 269
 
4.6%
4 268
 
4.6%
b 268
 
4.6%
3 267
 
4.5%
Other values (7) 1745
29.7%

R
Real number (ℝ)

Distinct221
Distinct (%)25.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean158.59884
Minimum0
Maximum255
Zeros81
Zeros (%)9.4%
Negative0
Negative (%)0.0%
Memory size6.9 KiB
2023-01-25T14:29:43.263052image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q1101
median178
Q3236
95-th percentile255
Maximum255
Range255
Interquartile range (IQR)135

Descriptive statistics

Standard deviation85.338432
Coefficient of variation (CV)0.53807726
Kurtosis-0.92645087
Mean158.59884
Median Absolute Deviation (MAD)66
Skewness-0.59367921
Sum137188
Variance7282.6479
MonotonicityNot monotonic
2023-01-25T14:29:43.414437image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
255 110
 
12.7%
0 81
 
9.4%
250 15
 
1.7%
204 13
 
1.5%
150 11
 
1.3%
128 11
 
1.3%
227 10
 
1.2%
153 10
 
1.2%
244 10
 
1.2%
240 9
 
1.0%
Other values (211) 585
67.6%
ValueCountFrequency (%)
0 81
9.4%
1 4
 
0.5%
2 1
 
0.1%
3 2
 
0.2%
5 1
 
0.1%
6 1
 
0.1%
8 4
 
0.5%
10 1
 
0.1%
11 1
 
0.1%
13 1
 
0.1%
ValueCountFrequency (%)
255 110
12.7%
254 7
 
0.8%
253 8
 
0.9%
252 6
 
0.7%
251 9
 
1.0%
250 15
 
1.7%
249 4
 
0.5%
248 8
 
0.9%
247 3
 
0.3%
246 2
 
0.2%

G
Real number (ℝ)

Distinct234
Distinct (%)27.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean124.68324
Minimum0
Maximum255
Zeros58
Zeros (%)6.7%
Negative0
Negative (%)0.0%
Memory size6.9 KiB
2023-01-25T14:29:43.558149image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q164
median123
Q3190
95-th percentile250
Maximum255
Range255
Interquartile range (IQR)126

Descriptive statistics

Standard deviation76.270225
Coefficient of variation (CV)0.61171194
Kurtosis-1.0978467
Mean124.68324
Median Absolute Deviation (MAD)63
Skewness0.052233472
Sum107851
Variance5817.1472
MonotonicityNot monotonic
2023-01-25T14:29:43.707151image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 58
 
6.7%
255 35
 
4.0%
128 13
 
1.5%
105 12
 
1.4%
51 11
 
1.3%
204 11
 
1.3%
66 9
 
1.0%
160 9
 
1.0%
218 9
 
1.0%
102 9
 
1.0%
Other values (224) 689
79.7%
ValueCountFrequency (%)
0 58
6.7%
1 2
 
0.2%
2 2
 
0.2%
3 2
 
0.2%
6 2
 
0.2%
8 2
 
0.2%
10 3
 
0.3%
11 2
 
0.2%
12 3
 
0.3%
14 2
 
0.2%
ValueCountFrequency (%)
255 35
4.0%
254 3
 
0.3%
253 2
 
0.2%
252 2
 
0.2%
251 1
 
0.1%
250 5
 
0.6%
249 1
 
0.1%
248 4
 
0.5%
247 2
 
0.2%
246 1
 
0.1%

B
Real number (ℝ)

Distinct230
Distinct (%)26.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean119.08786
Minimum0
Maximum255
Zeros80
Zeros (%)9.2%
Negative0
Negative (%)0.0%
Memory size6.9 KiB
2023-01-25T14:29:43.852643image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q153
median119
Q3186
95-th percentile253.6
Maximum255
Range255
Interquartile range (IQR)133

Descriptive statistics

Standard deviation78.343862
Coefficient of variation (CV)0.65786606
Kurtosis-1.13796
Mean119.08786
Median Absolute Deviation (MAD)66
Skewness0.10728769
Sum103011
Variance6137.7608
MonotonicityNot monotonic
2023-01-25T14:29:43.998855image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 80
 
9.2%
255 41
 
4.7%
107 15
 
1.7%
128 14
 
1.6%
204 10
 
1.2%
94 9
 
1.0%
120 9
 
1.0%
50 8
 
0.9%
51 8
 
0.9%
153 8
 
0.9%
Other values (220) 663
76.6%
ValueCountFrequency (%)
0 80
9.2%
2 3
 
0.3%
3 1
 
0.1%
5 2
 
0.2%
7 2
 
0.2%
8 3
 
0.3%
9 1
 
0.1%
10 2
 
0.2%
11 3
 
0.3%
12 3
 
0.3%
ValueCountFrequency (%)
255 41
4.7%
254 3
 
0.3%
252 1
 
0.1%
251 1
 
0.1%
250 7
 
0.8%
249 1
 
0.1%
245 3
 
0.3%
244 2
 
0.2%
241 1
 
0.1%
240 6
 
0.7%

Interactions

2023-01-25T14:29:42.068577image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
2023-01-25T14:29:41.358564image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
2023-01-25T14:29:41.713846image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
2023-01-25T14:29:42.183781image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
2023-01-25T14:29:41.479067image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
2023-01-25T14:29:41.833270image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/