Frequency Analysis

Reset
2,301,223
Total Tokens
143,976
Unique Types
6.26%
Type-Token Ratio
3,067
Corpus Entries

Word Frequency List

143,976 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
111 duwan 1,747
0.08%
0.0759% 759.2
112 iska 1,691
0.07%
0.0735% 734.8
113 Waxa 1,661
0.07%
0.0722% 721.8
114 leeyahay 1,627
0.07%
0.0707% 707.0
115 mar 1,623
0.07%
0.0705% 705.3
116 Al 1,619
0.07%
0.0704% 703.5
117 Itoobiya 1,613
0.07%
0.0701% 700.9
118 inaad 1,606
0.07%
0.0698% 697.9
119 Ku 1,597
0.07%
0.0694% 694.0
120 hal 1,578
0.07%
0.0686% 685.7

Top 10 Words

oo
61848
ka
52762
ku
46277
ah
42618
ay
42444
ee
37386
iyo
37001
in
34780
ayaa
29623
uu
29508