Frequency Analysis

Reset
2,308,541
Total Tokens
144,351
Unique Types
6.25%
Type-Token Ratio
3,070
Corpus Entries

Word Frequency List

144,351 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
971 inkastoo 281
0.01%
0.0122% 121.7
972 dhaawac 280
0.01%
0.0121% 121.3
973 socota 280
0.01%
0.0121% 121.3
974 waxbarasho 280
0.01%
0.0121% 121.3
975 ahaanba 280
0.01%
0.0121% 121.3
976 xabsiga 279
0.01%
0.0121% 120.9
977 xeer 279
0.01%
0.0121% 120.9
978 Göteborg 279
0.01%
0.0121% 120.9
979 Galmudug 278
0.01%
0.0120% 120.4
980 doorasho 278
0.01%
0.0120% 120.4

Top 10 Words

oo
62045
ka
52893
ku
46423
ah
42737
ay
42498
ee
37436
iyo
37181
in
34858
ayaa
29666
uu
29539