Frequency Analysis

Reset
2,308,541
Total Tokens
144,351
Unique Types
6.25%
Type-Token Ratio
3,070
Corpus Entries

Word Frequency List

144,351 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
881 nolol 310
0.01%
0.0134% 134.3
882 Tigray 309
0.01%
0.0134% 133.9
883 qaadi 308
0.01%
0.0133% 133.4
884 baaqay 308
0.01%
0.0133% 133.4
885 culus 308
0.01%
0.0133% 133.4
886 Ciidanka 307
0.01%
0.0133% 133.0
887 mudan 307
0.01%
0.0133% 133.0
888 Aadan 306
0.01%
0.0133% 132.6
889 dadweynaha 306
0.01%
0.0133% 132.6
890 Soomaalidu 306
0.01%
0.0133% 132.6

Top 10 Words

oo
62045
ka
52893
ku
46423
ah
42737
ay
42498
ee
37436
iyo
37181
in
34858
ayaa
29666
uu
29539