Frequency Analysis

Reset
2,308,541
Total Tokens
144,351
Unique Types
6.25%
Type-Token Ratio
3,070
Corpus Entries

Word Frequency List

144,351 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
1021 Xildhibaan 264
0.01%
0.0114% 114.4
1022 dhibaatada 264
0.01%
0.0114% 114.4
1023 bixiyay 264
0.01%
0.0114% 114.4
1024 Akademiya 264
0.01%
0.0114% 114.4
1025 Hargeysa 263
0.01%
0.0114% 113.9
1026 Maanta 263
0.01%
0.0114% 113.9
1027 31 263
0.01%
0.0114% 113.9
1028 Sidaas 263
0.01%
0.0114% 113.9
1029 hub 263
0.01%
0.0114% 113.9
1030 arko 263
0.01%
0.0114% 113.9

Top 10 Words

oo
62045
ka
52893
ku
46423
ah
42737
ay
42498
ee
37436
iyo
37181
in
34858
ayaa
29666
uu
29539