Frequency Analysis

Reset
2,308,541
Total Tokens
144,351
Unique Types
6.25%
Type-Token Ratio
3,070
Corpus Entries

Word Frequency List

144,351 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
1751 saarka 151
0.01%
0.0065% 65.4
1752 dhiig 151
0.01%
0.0065% 65.4
1753 mida 151
0.01%
0.0065% 65.4
1754 liiska 151
0.01%
0.0065% 65.4
1755 noqdeen 151
0.01%
0.0065% 65.4
1756 Cabdinaasir 150
0.01%
0.0065% 65.0
1757 codka 150
0.01%
0.0065% 65.0
1758 qaybaha 150
0.01%
0.0065% 65.0
1759 dileen 150
0.01%
0.0065% 65.0
1760 xiriirta 150
0.01%
0.0065% 65.0

Top 10 Words

oo
62045
ka
52893
ku
46423
ah
42737
ay
42498
ee
37436
iyo
37181
in
34858
ayaa
29666
uu
29539