Frequency Analysis

Reset
2,301,223
Total Tokens
143,976
Unique Types
6.26%
Type-Token Ratio
3,067
Corpus Entries

Word Frequency List

143,976 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
461 lahayd 574
0.02%
0.0249% 249.4
462 erayadan 573
0.02%
0.0249% 249.0
463 Afka 568
0.02%
0.0247% 246.8
464 Somalia 565
0.02%
0.0246% 245.5
465 sababta 564
0.02%
0.0245% 245.1
466 taasoo 564
0.02%
0.0245% 245.1
467 mas 562
0.02%
0.0244% 244.2
468 toos 562
0.02%
0.0244% 244.2
469 Reuters 561
0.02%
0.0244% 243.8
470 taasi 561
0.02%
0.0244% 243.8

Top 10 Words

oo
61848
ka
52762
ku
46277
ah
42618
ay
42444
ee
37386
iyo
37001
in
34780
ayaa
29623
uu
29508