Frequency Analysis

Reset
2,301,223
Total Tokens
143,976
Unique Types
6.26%
Type-Token Ratio
3,067
Corpus Entries

Word Frequency List

143,976 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
421 lasoo 629
0.03%
0.0273% 273.3
422 shacabka 628
0.03%
0.0273% 272.9
423 qoray 628
0.03%
0.0273% 272.9
424 dhaqan 627
0.03%
0.0272% 272.5
425 maadaama 627
0.03%
0.0272% 272.5
426 19 626
0.03%
0.0272% 272.0
427 dhintay 626
0.03%
0.0272% 272.0
428 warbaahinta 623
0.03%
0.0271% 270.7
429 aanu 622
0.03%
0.0270% 270.3
430 taagan 622
0.03%
0.0270% 270.3

Top 10 Words

oo
61848
ka
52762
ku
46277
ah
42618
ay
42444
ee
37386
iyo
37001
in
34780
ayaa
29623
uu
29508