Frequency Analysis

Reset
2,301,223
Total Tokens
143,976
Unique Types
6.26%
Type-Token Ratio
3,067
Corpus Entries

Word Frequency List

143,976 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
51 sii 3,474
0.15%
0.1510% 1,509.6
52 leh 3,453
0.15%
0.1501% 1,500.5
53 magaalada 3,395
0.15%
0.1475% 1,475.3
54 yihiin 3,328
0.14%
0.1446% 1,446.2
55 ma 3,205
0.14%
0.1393% 1,392.7
56 Qoraalka 3,131
0.14%
0.1361% 1,360.6
57 marka 3,102
0.13%
0.1348% 1,348.0
58 ahaan 3,096
0.13%
0.1345% 1,345.4
59 file 3,021
0.13%
0.1313% 1,312.8
60 imported 3,014
0.13%
0.1310% 1,309.7

Top 10 Words

oo
61848
ka
52762
ku
46277
ah
42618
ay
42444
ee
37386
iyo
37001
in
34780
ayaa
29623
uu
29508