Frequency Analysis

Reset
2,308,541
Total Tokens
144,351
Unique Types
6.25%
Type-Token Ratio
3,070
Corpus Entries

Word Frequency List

144,351 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
1921 khilaafka 137
0.01%
0.0059% 59.3
1922 ciyaaraha 136
0.01%
0.0059% 58.9
1923 jiri 136
0.01%
0.0059% 58.9
1924 shaki 136
0.01%
0.0059% 58.9
1925 beeraha 136
0.01%
0.0059% 58.9
1926 doorashadii 136
0.01%
0.0059% 58.9
1927 salaysan 136
0.01%
0.0059% 58.9
1928 geerida 136
0.01%
0.0059% 58.9
1929 arimaha 136
0.01%
0.0059% 58.9
1930 haystay 136
0.01%
0.0059% 58.9

Top 10 Words

oo
62045
ka
52893
ku
46423
ah
42737
ay
42498
ee
37436
iyo
37181
in
34858
ayaa
29666
uu
29539