Frequency Analysis

Reset
2,301,223
Total Tokens
143,976
Unique Types
6.26%
Type-Token Ratio
3,067
Corpus Entries

Word Frequency List

143,976 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
71 hor 2,392
0.10%
0.1039% 1,039.4
72 is 2,379
0.10%
0.1034% 1,033.8
73 Waa 2,366
0.10%
0.1028% 1,028.1
74 jiray 2,363
0.10%
0.1027% 1,026.8
75 looga 2,357
0.10%
0.1024% 1,024.2
76 da 2,333
0.10%
0.1014% 1,013.8
77 Mareykanka 2,293
0.10%
0.0996% 996.4
78 sheegtay 2,257
0.10%
0.0981% 980.8
79 isla 2,254
0.10%
0.0979% 979.5
80 Maxamed 2,229
0.10%
0.0969% 968.6

Top 10 Words

oo
61848
ka
52762
ku
46277
ah
42618
ay
42444
ee
37386
iyo
37001
in
34780
ayaa
29623
uu
29508