Frequency Analysis

Reset
2,301,223
Total Tokens
143,976
Unique Types
6.26%
Type-Token Ratio
3,067
Corpus Entries

Word Frequency List

143,976 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
271 adda 846
0.04%
0.0368% 367.6
272 Ra 844
0.04%
0.0367% 366.8
273 Cabdi 843
0.04%
0.0366% 366.3
274 Sidoo 843
0.04%
0.0366% 366.3
275 kara 841
0.04%
0.0365% 365.5
276 kooxda 840
0.04%
0.0365% 365.0
277 kooban 837
0.04%
0.0364% 363.7
278 Caasimada 836
0.04%
0.0363% 363.3
279 Shabaab 834
0.04%
0.0362% 362.4
280 dhammaan 832
0.04%
0.0362% 361.5

Top 10 Words

oo
61848
ka
52762
ku
46277
ah
42618
ay
42444
ee
37386
iyo
37001
in
34780
ayaa
29623
uu
29508