Frequency Analysis

Reset
2,301,223
Total Tokens
143,976
Unique Types
6.26%
Type-Token Ratio
3,067
Corpus Entries

Word Frequency List

143,976 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
171 kasoo 1,192
0.05%
0.0518% 518.0
172 In 1,192
0.05%
0.0518% 518.0
173 Soomaaliga 1,192
0.05%
0.0518% 518.0
174 walba 1,186
0.05%
0.0515% 515.4
175 nin 1,176
0.05%
0.0511% 511.0
176 baxay 1,171
0.05%
0.0509% 508.9
177 Ruushka 1,165
0.05%
0.0506% 506.3
178 ahayn 1,155
0.05%
0.0502% 501.9
179 iyada 1,154
0.05%
0.0501% 501.5
180 Soomaalida 1,150
0.05%
0.0500% 499.7

Top 10 Words

oo
61848
ka
52762
ku
46277
ah
42618
ay
42444
ee
37386
iyo
37001
in
34780
ayaa
29623
uu
29508