Dataset statistics
Number of variables | 1 |
---|---|
Number of observations | 69240 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 53884 |
Duplicate rows (%) | 77.8% |
Total size in memory | 5.9 MiB |
Average record size in memory | 89.1 B |
Variable types
CAT | 1 |
---|
Reproduction
Analysis started | 2020-06-09 05:15:57.002462 |
---|---|
Analysis finished | 2020-06-09 05:15:58.473779 |
Duration | 1.47 second |
Version | pandas-profiling v2.8.0 |
Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
Download configuration | config.yaml |
Distinct count | 15356 |
---|---|
Unique (%) | 22.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 541.1 KiB |
<td></td> | |
---|---|
</tr> | 4573 |
<td>weekly</td> | 4179 |
<td>2018</td> | 862 |
<td>2019</td> | 832 |
Other values (15351) |
Value | Count | Frequency (%) | |
<td></td> | 17603 | 25.4% | |
</tr> | 4573 | 6.6% | |
<td>weekly</td> | 4179 | 6.0% | |
<td>2018</td> | 862 | 1.2% | |
<td>2019</td> | 832 | 1.2% | |
<td>2017</td> | 807 | 1.2% | |
<td>2016</td> | 695 | 1.0% | |
<td>3</td> | 651 | 0.9% | |
<td>4</td> | 639 | 0.9% | |
<td>2</td> | 605 | 0.9% | |
<td>United Kingdom</td> | 518 | 0.7% | |
<td>France</td> | 483 | 0.7% | |
<td>5</td> | 479 | 0.7% | |
<td>7</td> | 460 | 0.7% | |
<td>9</td> | 450 | 0.6% | |
<td>10</td> | 445 | 0.6% | |
<td>6</td> | 436 | 0.6% | |
<td>8</td> | 435 | 0.6% | |
<td>11</td> | 432 | 0.6% | |
<td>Brazil</td> | 425 | 0.6% | |
<td>12</td> | 409 | 0.6% | |
<td>monthly</td> | 393 | 0.6% | |
<td>2015</td> | 389 | 0.6% | |
<td>1</td> | 381 | 0.6% | |
<td>2020</td> | 380 | 0.5% | |
Other values (15331) | 31279 | 45.2% |
Length
Max length | 765 |
---|---|
Median length | 27 |
Mean length | 32.03019931 |
Min length | 1 |
Most occurring characters
Value | Count | Frequency (%) | |
951669 | 42.9% | ||
t | 138236 | 6.2% | |
d | 136248 | 6.1% | |
< | 128869 | 5.8% | |
> | 128865 | 5.8% | |
/ | 64823 | 2.9% | |
- | 51909 | 2.3% | |
" | 47802 | 2.2% | |
e | 44492 | 2.0% | |
l | 40539 | 1.8% | |
1 | 36071 | 1.6% | |
n | 33420 | 1.5% | |
0 | 33015 | 1.5% | |
i | 32856 | 1.5% | |
2 | 32232 | 1.5% | |
s | 30976 | 1.4% | |
a | 25977 | 1.2% | |
= | 23942 | 1.1% | |
r | 23639 | 1.1% | |
b | 18871 | 0.9% | |
m | 16335 | 0.7% | |
u | 15685 | 0.7% | |
3 | 13695 | 0.6% | |
4 | 11785 | 0.5% | |
c | 11344 | 0.5% | |
Other values (63) | 124476 | 5.6% |
Most occurring categories
Value | Count | Frequency (%) | |
Space Separator | 951669 | 42.9% | |
Lowercase Letter | 616995 | 27.8% | |
Math Symbol | 281706 | 12.7% | |
Decimal Number | 179608 | 8.1% | |
Other Punctuation | 114171 | 5.1% | |
Dash Punctuation | 51909 | 2.3% | |
Uppercase Letter | 21606 | 1.0% | |
Connector Punctuation | 96 | < 0.1% | |
Other Symbol | 6 | < 0.1% | |
Modifier Symbol | 2 | < 0.1% | |
Open Punctuation | 1 | < 0.1% | |
Close Punctuation | 1 | < 0.1% | |
Final Punctuation | 1 | < 0.1% |
Most frequent Math Symbol characters
Value | Count | Frequency (%) | |
< | 128869 | 45.7% | |
> | 128865 | 45.7% | |
= | 23942 | 8.5% | |
+ | 30 | < 0.1% |
Most frequent Lowercase Letter characters
Value | Count | Frequency (%) | |
t | 138236 | 22.4% | |
d | 136248 | 22.1% | |
e | 44492 | 7.2% | |
l | 40539 | 6.6% | |
n | 33420 | 5.4% | |
i | 32856 | 5.3% | |
s | 30976 | 5.0% | |
a | 25977 | 4.2% | |
r | 23639 | 3.8% | |
b | 18871 | 3.1% | |
m | 16335 | 2.6% | |
u | 15685 | 2.5% | |
c | 11344 | 1.8% | |
j | 9462 | 1.5% | |
o | 8967 | 1.5% | |
y | 6054 | 1.0% | |
w | 5769 | 0.9% | |
k | 5328 | 0.9% | |
f | 5284 | 0.9% | |
g | 2129 | 0.3% | |
h | 1693 | 0.3% | |
p | 1211 | 0.2% | |
v | 1172 | 0.2% | |
z | 871 | 0.1% | |
x | 412 | 0.1% |
Most frequent Space Separator characters
Value | Count | Frequency (%) | |
951669 | 100.0% |
Most frequent Other Punctuation characters
Value | Count | Frequency (%) | |
/ | 64823 | 56.8% | |
" | 47802 | 41.9% | |
. | 1369 | 1.2% | |
: | 80 | 0.1% | |
% | 30 | < 0.1% | |
; | 24 | < 0.1% | |
& | 19 | < 0.1% | |
# | 6 | < 0.1% | |
? | 6 | < 0.1% | |
! | 5 | < 0.1% | |
· | 2 | < 0.1% | |
' | 2 | < 0.1% | |
… | 2 | < 0.1% | |
@ | 1 | < 0.1% |
Most frequent Dash Punctuation characters
Value | Count | Frequency (%) | |
- | 51909 | 100.0% |
Most frequent Decimal Number characters
Value | Count | Frequency (%) | |
1 | 36071 | 20.1% | |
0 | 33015 | 18.4% | |
2 | 32232 | 17.9% | |
3 | 13695 | 7.6% | |
4 | 11785 | 6.6% | |
9 | 11257 | 6.3% | |
7 | 10673 | 5.9% | |
8 | 10663 | 5.9% | |
6 | 10292 | 5.7% | |
5 | 9925 | 5.5% |
Most frequent Uppercase Letter characters
Value | Count | Frequency (%) | |
L | 9229 | 42.7% | |
C | 4858 | 22.5% | |
S | 966 | 4.5% | |
F | 786 | 3.6% | |
B | 741 | 3.4% | |
U | 717 | 3.3% | |
N | 678 | 3.1% | |
K | 542 | 2.5% | |
P | 435 | 2.0% | |
A | 375 | 1.7% | |
D | 300 | 1.4% | |
I | 278 | 1.3% | |
R | 266 | 1.2% | |
G | 262 | 1.2% | |
J | 238 | 1.1% | |
M | 236 | 1.1% | |
Y | 191 | 0.9% | |
H | 133 | 0.6% | |
V | 101 | 0.5% | |
T | 80 | 0.4% | |
E | 61 | 0.3% | |
Z | 33 | 0.2% | |
O | 27 | 0.1% | |
Q | 26 | 0.1% | |
W | 25 | 0.1% |
Most frequent Connector Punctuation characters
Value | Count | Frequency (%) | |
_ | 96 | 100.0% |
Most frequent Modifier Symbol characters
Value | Count | Frequency (%) | |
` | 2 | 100.0% |
Most frequent Other Symbol characters
Value | Count | Frequency (%) | |
↵ | 6 | 100.0% |
Most frequent Open Punctuation characters
Value | Count | Frequency (%) | |
( | 1 | 100.0% |
Most frequent Close Punctuation characters
Value | Count | Frequency (%) | |
) | 1 | 100.0% |
Most frequent Final Punctuation characters
Value | Count | Frequency (%) | |
’ | 1 | 100.0% |
Most occurring scripts
Value | Count | Frequency (%) | |
Common | 1579170 | 71.2% | |
Latin | 638601 | 28.8% |
Most frequent Common characters
Value | Count | Frequency (%) | |
951669 | 60.3% | ||
< | 128869 | 8.2% | |
> | 128865 | 8.2% | |
/ | 64823 | 4.1% | |
- | 51909 | 3.3% | |
" | 47802 | 3.0% | |
1 | 36071 | 2.3% | |
0 | 33015 | 2.1% | |
2 | 32232 | 2.0% | |
= | 23942 | 1.5% | |
3 | 13695 | 0.9% | |
4 | 11785 | 0.7% | |
9 | 11257 | 0.7% | |
7 | 10673 | 0.7% | |
8 | 10663 | 0.7% | |
6 | 10292 | 0.7% | |
5 | 9925 | 0.6% | |
. | 1369 | 0.1% | |
_ | 96 | < 0.1% | |
: | 80 | < 0.1% | |
+ | 30 | < 0.1% | |
% | 30 | < 0.1% | |
; | 24 | < 0.1% | |
& | 19 | < 0.1% | |
# | 6 | < 0.1% | |
Other values (11) | 29 | < 0.1% |
Most frequent Latin characters
Value | Count | Frequency (%) | |
t | 138236 | 21.6% | |
d | 136248 | 21.3% | |
e | 44492 | 7.0% | |
l | 40539 | 6.3% | |
n | 33420 | 5.2% | |
i | 32856 | 5.1% | |
s | 30976 | 4.9% | |
a | 25977 | 4.1% | |
r | 23639 | 3.7% | |
b | 18871 | 3.0% | |
m | 16335 | 2.6% | |
u | 15685 | 2.5% | |
c | 11344 | 1.8% | |
j | 9462 | 1.5% | |
L | 9229 | 1.4% | |
o | 8967 | 1.4% | |
y | 6054 | 0.9% | |
w | 5769 | 0.9% | |
k | 5328 | 0.8% | |
f | 5284 | 0.8% | |
C | 4858 | 0.8% | |
g | 2129 | 0.3% | |
h | 1693 | 0.3% | |
p | 1211 | 0.2% | |
v | 1172 | 0.2% | |
Other values (27) | 8827 | 1.4% |
Most occurring blocks
Value | Count | Frequency (%) | |
ASCII | 2217760 | > 99.9% | |
Arrows | 6 | < 0.1% | |
Punctuation | 3 | < 0.1% | |
None | 2 | < 0.1% |
Most frequent ASCII characters
Value | Count | Frequency (%) | |
951669 | 42.9% | ||
t | 138236 | 6.2% | |
d | 136248 | 6.1% | |
< | 128869 | 5.8% | |
> | 128865 | 5.8% | |
/ | 64823 | 2.9% | |
- | 51909 | 2.3% | |
" | 47802 | 2.2% | |
e | 44492 | 2.0% | |
l | 40539 | 1.8% | |
1 | 36071 | 1.6% | |
n | 33420 | 1.5% | |
0 | 33015 | 1.5% | |
i | 32856 | 1.5% | |
2 | 32232 | 1.5% | |
s | 30976 | 1.4% | |
a | 25977 | 1.2% | |
= | 23942 | 1.1% | |
r | 23639 | 1.1% | |
b | 18871 | 0.9% | |
m | 16335 | 0.7% | |
u | 15685 | 0.7% | |
3 | 13695 | 0.6% | |
4 | 11785 | 0.5% | |
c | 11344 | 0.5% | |
Other values (59) | 124465 | 5.6% |
Most frequent None characters
Value | Count | Frequency (%) | |
· | 2 | 100.0% |
Most frequent Arrows characters
Value | Count | Frequency (%) | |
↵ | 6 | 100.0% |
Most frequent Punctuation characters
Value | Count | Frequency (%) | |
… | 2 | 66.7% | |
’ | 1 | 33.3% |
First rows
<!DOCTYPE html> | |
---|---|
0 | <html lang="en"> |
1 | <head> |
2 | <meta charset="utf-8"> |
3 | <link rel="dns-prefetch" href="https://github.githubassets.com"> |
4 | <link rel="dns-prefetch" href="https://avatars0.githubusercontent.com"> |
5 | <link rel="dns-prefetch" href="https://avatars1.githubusercontent.com"> |
6 | <link rel="dns-prefetch" href="https://avatars2.githubusercontent.com"> |
7 | <link rel="dns-prefetch" href="https://avatars3.githubusercontent.com"> |
8 | <link rel="dns-prefetch" href="https://github-cloud.s3.amazonaws.com"> |
9 | <link rel="dns-prefetch" href="https://user-images.githubusercontent.com/"> |
Last rows
<!DOCTYPE html> | |
---|---|
69230 | <div class="octocat-spinner my-6 js-details-dialog-spinner"></div> |
69231 | </details-dialog> |
69232 | </details> |
69233 | </template> |
69234 | <div class="Popover js-hovercard-content position-absolute" style="display: none; outline: none;" tabindex="0"> |
69235 | <div class="Popover-message Popover-message--bottom-left Popover-message--large Box box-shadow-large" style="width:360px;"> |
69236 | </div> |
69237 | </div> |
69238 | </body> |
69239 | </html> |
Most frequent
<!DOCTYPE html> | count | |
---|---|---|
1851 | <td></td> | 17603 |
1891 | </tr> | 4573 |
1886 | <td>weekly</td> | 4179 |
1185 | <td>2018</td> | 862 |
1336 | <td>2019</td> | 832 |
1032 | <td>2017</td> | 807 |
879 | <td>2016</td> | 695 |
1621 | <td>3</td> | 651 |
1636 | <td>4</td> | 639 |
1567 | <td>2</td> | 605 |