Frequency Analysis

Reset
2,308,541
Total Tokens
144,351
Unique Types
6.25%
Type-Token Ratio
3,070
Corpus Entries

Word Frequency List

144,351 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
2361 qaada 109
0.00%
0.0047% 47.2
2362 weerarro 109
0.00%
0.0047% 47.2
2363 qoyskiisa 109
0.00%
0.0047% 47.2
2364 William 109
0.00%
0.0047% 47.2
2365 iibiyo 109
0.00%
0.0047% 47.2
2366 gebi 109
0.00%
0.0047% 47.2
2367 islamarkana 109
0.00%
0.0047% 47.2
2368 dhacdooyinka 108
0.00%
0.0047% 46.8
2369 rabay 108
0.00%
0.0047% 46.8
2370 agaasimaha 108
0.00%
0.0047% 46.8

Top 10 Words

oo
62045
ka
52893
ku
46423
ah
42737
ay
42498
ee
37436
iyo
37181
in
34858
ayaa
29666
uu
29539