Frequency Analysis

Reset
2,301,223
Total Tokens
143,976
Unique Types
6.26%
Type-Token Ratio
3,067
Corpus Entries

Word Frequency List

143,976 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
121 com 1,576
0.07%
0.0685% 684.9
122 hadda 1,571
0.07%
0.0683% 682.7
123 kadib 1,567
0.07%
0.0681% 680.9
124 qaar 1,558
0.07%
0.0677% 677.0
125 tirsan 1,557
0.07%
0.0677% 676.6
126 Soomaaliyeed 1,551
0.07%
0.0674% 674.0
127 maanta 1,551
0.07%
0.0674% 674.0
128 Muqdisho 1,539
0.07%
0.0669% 668.8
129 doono 1,494
0.06%
0.0649% 649.2
130 Somaliska 1,488
0.06%
0.0647% 646.6

Top 10 Words

oo
61848
ka
52762
ku
46277
ah
42618
ay
42444
ee
37386
iyo
37001
in
34780
ayaa
29623
uu
29508