Frequency Analysis

Reset
2,308,541
Total Tokens
144,351
Unique Types
6.25%
Type-Token Ratio
3,070
Corpus Entries

Word Frequency List

144,351 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
771 dhawaad 355
0.02%
0.0154% 153.8
772 wasiirka 355
0.02%
0.0154% 153.8
773 khatar 355
0.02%
0.0154% 153.8
774 doontaa 355
0.02%
0.0154% 153.8
775 2024 353
0.02%
0.0153% 152.9
776 uuna 352
0.02%
0.0152% 152.5
777 gaari 352
0.02%
0.0152% 152.5
778 saaran 352
0.02%
0.0152% 152.5
779 Dhexe 351
0.02%
0.0152% 152.0
780 goob 351
0.02%
0.0152% 152.0

Top 10 Words

oo
62045
ka
52893
ku
46423
ah
42737
ay
42498
ee
37436
iyo
37181
in
34858
ayaa
29666
uu
29539