Frequency Analysis

Reset
2,308,541
Total Tokens
144,351
Unique Types
6.25%
Type-Token Ratio
3,070
Corpus Entries

Word Frequency List

144,351 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
621 Faransiiska 437
0.02%
0.0189% 189.3
622 ahna 436
0.02%
0.0189% 188.9
623 baarlamaanka 435
0.02%
0.0188% 188.4
624 xiriirka 431
0.02%
0.0187% 186.7
625 darro 430
0.02%
0.0186% 186.3
626 intaas 428
0.02%
0.0185% 185.4
627 heshiis 428
0.02%
0.0185% 185.4
628 sanno 428
0.02%
0.0185% 185.4
629 Norway 427
0.02%
0.0185% 185.0
630 Maxay 427
0.02%
0.0185% 185.0

Top 10 Words

oo
62045
ka
52893
ku
46423
ah
42737
ay
42498
ee
37436
iyo
37181
in
34858
ayaa
29666
uu
29539