Frequency Analysis

Reset
2,308,541
Total Tokens
144,351
Unique Types
6.25%
Type-Token Ratio
3,070
Corpus Entries

Word Frequency List

144,351 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
901 lix 302
0.01%
0.0131% 130.8
902 taageero 302
0.01%
0.0131% 130.8
903 mucaaradka 302
0.01%
0.0131% 130.8
904 galaan 301
0.01%
0.0130% 130.4
905 Waxaana 300
0.01%
0.0130% 130.0
906 dugsiga 300
0.01%
0.0130% 130.0
907 ammaanka 300
0.01%
0.0130% 130.0
908 cabsi 300
0.01%
0.0130% 130.0
909 sabab 300
0.01%
0.0130% 130.0
910 qoro 300
0.01%
0.0130% 130.0

Top 10 Words

oo
62045
ka
52893
ku
46423
ah
42737
ay
42498
ee
37436
iyo
37181
in
34858
ayaa
29666
uu
29539