Frequency Analysis

Reset
2,308,541
Total Tokens
144,351
Unique Types
6.25%
Type-Token Ratio
3,070
Corpus Entries

Word Frequency List

144,351 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
961 kuwaasi 283
0.01%
0.0123% 122.6
962 London 283
0.01%
0.0123% 122.6
963 bilowday 283
0.01%
0.0123% 122.6
964 tallaabo 283
0.01%
0.0123% 122.6
965 dowlad 282
0.01%
0.0122% 122.2
966 Dib 282
0.01%
0.0122% 122.2
967 ciidan 282
0.01%
0.0122% 122.2
968 xagga 282
0.01%
0.0122% 122.2
969 markaa 281
0.01%
0.0122% 121.7
970 waxana 281
0.01%
0.0122% 121.7

Top 10 Words

oo
62045
ka
52893
ku
46423
ah
42737
ay
42498
ee
37436
iyo
37181
in
34858
ayaa
29666
uu
29539