Frequency Analysis

Reset
2,308,541
Total Tokens
144,351
Unique Types
6.25%
Type-Token Ratio
3,070
Corpus Entries

Word Frequency List

144,351 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
1041 halkii 261
0.01%
0.0113% 113.1
1042 shirkad 261
0.01%
0.0113% 113.1
1043 Qaxootiga 261
0.01%
0.0113% 113.1
1044 guusha 260
0.01%
0.0113% 112.6
1045 awgeed 260
0.01%
0.0113% 112.6
1046 fadhiya 259
0.01%
0.0112% 112.2
1047 halkan 259
0.01%
0.0112% 112.2
1048 xadka 258
0.01%
0.0112% 111.8
1049 xanuun 258
0.01%
0.0112% 111.8
1050 Jubbaland 258
0.01%
0.0112% 111.8

Top 10 Words

oo
62045
ka
52893
ku
46423
ah
42737
ay
42498
ee
37436
iyo
37181
in
34858
ayaa
29666
uu
29539