Frequency Analysis

Reset
2,308,541
Total Tokens
144,351
Unique Types
6.25%
Type-Token Ratio
3,070
Corpus Entries

Word Frequency List

144,351 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
861 50 318
0.01%
0.0138% 137.7
862 difaaca 318
0.01%
0.0138% 137.7
863 carruur 318
0.01%
0.0138% 137.7
864 dhamaan 317
0.01%
0.0137% 137.3
865 aheyd 317
0.01%
0.0137% 137.3
866 degdeg 317
0.01%
0.0137% 137.3
867 cirka 317
0.01%
0.0137% 137.3
868 xil 316
0.01%
0.0137% 136.9
869 shisheeye 316
0.01%
0.0137% 136.9
870 Mar 316
0.01%
0.0137% 136.9

Top 10 Words

oo
62045
ka
52893
ku
46423
ah
42737
ay
42498
ee
37436
iyo
37181
in
34858
ayaa
29666
uu
29539