Frequency Analysis

Reset
2,308,541
Total Tokens
144,351
Unique Types
6.25%
Type-Token Ratio
3,070
Corpus Entries

Word Frequency List

144,351 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
1661 saldhig 162
0.01%
0.0070% 70.2
1662 nukliyeerka 162
0.01%
0.0070% 70.2
1663 aashan 162
0.01%
0.0070% 70.2
1664 Iswiidan 162
0.01%
0.0070% 70.2
1665 warar 161
0.01%
0.0070% 69.7
1666 Shariif 161
0.01%
0.0070% 69.7
1667 bannaan 161
0.01%
0.0070% 69.7
1668 bahda 161
0.01%
0.0070% 69.7
1669 rasaas 161
0.01%
0.0070% 69.7
1670 tani 161
0.01%
0.0070% 69.7

Top 10 Words

oo
62045
ka
52893
ku
46423
ah
42737
ay
42498
ee
37436
iyo
37181
in
34858
ayaa
29666
uu
29539