Frequency Analysis

Reset
2,301,223
Total Tokens
143,976
Unique Types
6.26%
Type-Token Ratio
3,067
Corpus Entries

Word Frequency List

143,976 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
501 2019 532
0.02%
0.0231% 231.2
502 waayo 531
0.02%
0.0231% 230.7
503 xaaladda 529
0.02%
0.0230% 229.9
504 booliska 529
0.02%
0.0230% 229.9
505 xiray 528
0.02%
0.0229% 229.4
506 waan 526
0.02%
0.0229% 228.6
507 dalkaasi 526
0.02%
0.0229% 228.6
508 GUUSHA 526
0.02%
0.0229% 228.6
509 qoraal 525
0.02%
0.0228% 228.1
510 waaweyn 524
0.02%
0.0228% 227.7

Top 10 Words

oo
61848
ka
52762
ku
46277
ah
42618
ay
42444
ee
37386
iyo
37001
in
34780
ayaa
29623
uu
29508