Frequency Analysis

Reset
2,308,541
Total Tokens
144,351
Unique Types
6.25%
Type-Token Ratio
3,070
Corpus Entries

Word Frequency List

144,351 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
1001 cunto 271
0.01%
0.0117% 117.4
1002 Pakistan 270
0.01%
0.0117% 117.0
1003 Waxaad 270
0.01%
0.0117% 117.0
1004 xeerarka 270
0.01%
0.0117% 117.0
1005 boqolkiiba 270
0.01%
0.0117% 117.0
1006 kaasoo 269
0.01%
0.0117% 116.5
1007 hawlaha 268
0.01%
0.0116% 116.1
1008 tala 267
0.01%
0.0116% 115.7
1009 qaabka 267
0.01%
0.0116% 115.7
1010 2025 267
0.01%
0.0116% 115.7

Top 10 Words

oo
62045
ka
52893
ku
46423
ah
42737
ay
42498
ee
37436
iyo
37181
in
34858
ayaa
29666
uu
29539