Frequency Analysis

Reset
2,301,223
Total Tokens
143,976
Unique Types
6.26%
Type-Token Ratio
3,067
Corpus Entries

Word Frequency List

143,976 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
311 dagaal 763
0.03%
0.0332% 331.6
312 socday 761
0.03%
0.0331% 330.7
313 shaqada 761
0.03%
0.0331% 330.7
314 Markii 760
0.03%
0.0330% 330.3
315 helo 759
0.03%
0.0330% 329.8
316 shirkadda 759
0.03%
0.0330% 329.8
317 Madaxweyne 758
0.03%
0.0329% 329.4
318 dhacday 756
0.03%
0.0329% 328.5
319 Maraykanka 755
0.03%
0.0328% 328.1
320 Si 753
0.03%
0.0327% 327.2

Top 10 Words

oo
61848
ka
52762
ku
46277
ah
42618
ay
42444
ee
37386
iyo
37001
in
34780
ayaa
29623
uu
29508