Frequency Analysis

Reset
2,308,541
Total Tokens
144,351
Unique Types
6.25%
Type-Token Ratio
3,070
Corpus Entries

Word Frequency List

144,351 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
1981 sababtay 132
0.01%
0.0057% 57.2
1982 yaab 132
0.01%
0.0057% 57.2
1983 inuusan 132
0.01%
0.0057% 57.2
1984 Qofkasta 132
0.01%
0.0057% 57.2
1985 1951 131
0.01%
0.0057% 56.7
1986 shabaab 131
0.01%
0.0057% 56.7
1987 dheeri 131
0.01%
0.0057% 56.7
1988 Cismaan 131
0.01%
0.0057% 56.7
1989 dhiso 131
0.01%
0.0057% 56.7
1990 horseeday 131
0.01%
0.0057% 56.7

Top 10 Words

oo
62045
ka
52893
ku
46423
ah
42737
ay
42498
ee
37436
iyo
37181
in
34858
ayaa
29666
uu
29539