Frequency Analysis

Reset
2,308,541
Total Tokens
144,351
Unique Types
6.25%
Type-Token Ratio
3,070
Corpus Entries

Word Frequency List

144,351 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
1271 isticmaalka 211
0.01%
0.0091% 91.4
1272 Buugga 211
0.01%
0.0091% 91.4
1273 Mogadishu 210
0.01%
0.0091% 91.0
1274 hadlayo 210
0.01%
0.0091% 91.0
1275 Ilaahay 210
0.01%
0.0091% 91.0
1276 dhinacyada 210
0.01%
0.0091% 91.0
1277 Sharciga 210
0.01%
0.0091% 91.0
1278 caddeeyay 210
0.01%
0.0091% 91.0
1279 Mansuur 210
0.01%
0.0091% 91.0
1280 codsaday 209
0.01%
0.0091% 90.5

Top 10 Words

oo
62045
ka
52893
ku
46423
ah
42737
ay
42498
ee
37436
iyo
37181
in
34858
ayaa
29666
uu
29539