Frequency Analysis

Reset
2,301,223
Total Tokens
143,976
Unique Types
6.26%
Type-Token Ratio
3,067
Corpus Entries

Word Frequency List

143,976 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
31 wax 6,625
0.29%
0.2879% 2,878.9
32 sida 5,890
0.26%
0.2560% 2,559.5
33 wuxuu 5,740
0.25%
0.2494% 2,494.3
34 tahay 5,718
0.25%
0.2485% 2,484.8
35 kala 5,593
0.24%
0.2430% 2,430.4
36 yahay 5,485
0.24%
0.2384% 2,383.5
37 Soomaaliya 4,980
0.22%
0.2164% 2,164.1
38 inuu 4,949
0.22%
0.2151% 2,150.6
39 dadka 4,886
0.21%
0.2123% 2,123.2
40 markii 4,855
0.21%
0.2110% 2,109.7

Top 10 Words

oo
61848
ka
52762
ku
46277
ah
42618
ay
42444
ee
37386
iyo
37001
in
34780
ayaa
29623
uu
29508