Frequency Analysis

Reset
2,308,541
Total Tokens
144,351
Unique Types
6.25%
Type-Token Ratio
3,070
Corpus Entries

Word Frequency List

144,351 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
851 aanta 321
0.01%
0.0139% 139.0
852 galeen 321
0.01%
0.0139% 139.0
853 Denmark 320
0.01%
0.0139% 138.6
854 soco 320
0.01%
0.0139% 138.6
855 wiil 320
0.01%
0.0139% 138.6
856 wayn 319
0.01%
0.0138% 138.2
857 socdo 319
0.01%
0.0138% 138.2
858 Dawladda 319
0.01%
0.0138% 138.2
859 waayay 319
0.01%
0.0138% 138.2
860 jeeda 319
0.01%
0.0138% 138.2

Top 10 Words

oo
62045
ka
52893
ku
46423
ah
42737
ay
42498
ee
37436
iyo
37181
in
34858
ayaa
29666
uu
29539