Frequency Analysis

Reset
2,308,541
Total Tokens
144,351
Unique Types
6.25%
Type-Token Ratio
3,070
Corpus Entries

Word Frequency List

144,351 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
141931 Dhinacyadad 1
0.00%
0.0000% 0.4
141932 2120 1
0.00%
0.0000% 0.4
141933 ugandha 1
0.00%
0.0000% 0.4
141934 shegtay 1
0.00%
0.0000% 0.4
141935 urdun 1
0.00%
0.0000% 0.4
141936 liibiyaan 1
0.00%
0.0000% 0.4
141937 Venster 1
0.00%
0.0000% 0.4
141938 yaren 1
0.00%
0.0000% 0.4
141939 Somaliyda 1
0.00%
0.0000% 0.4
141940 Arimo 1
0.00%
0.0000% 0.4

Top 10 Words

oo
62045
ka
52893
ku
46423
ah
42737
ay
42498
ee
37436
iyo
37181
in
34858
ayaa
29666
uu
29539