Frequency Analysis

Reset
2,308,541
Total Tokens
144,351
Unique Types
6.25%
Type-Token Ratio
3,070
Corpus Entries

Word Frequency List

144,351 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
1541 kalana 175
0.01%
0.0076% 75.8
1542 Dibadda 175
0.01%
0.0076% 75.8
1543 casharka 175
0.01%
0.0076% 75.8
1544 bari 175
0.01%
0.0076% 75.8
1545 isticmaali 175
0.01%
0.0076% 75.8
1546 gabdhaha 175
0.01%
0.0076% 75.8
1547 xaaskiisa 175
0.01%
0.0076% 75.8
1548 shaqeeyo 174
0.01%
0.0075% 75.4
1549 xidhiidha 174
0.01%
0.0075% 75.4
1550 warramay 174
0.01%
0.0075% 75.4

Top 10 Words

oo
62045
ka
52893
ku
46423
ah
42737
ay
42498
ee
37436
iyo
37181
in
34858
ayaa
29666
uu
29539