Frequency Analysis

Reset
2,308,541
Total Tokens
144,351
Unique Types
6.25%
Type-Token Ratio
3,070
Corpus Entries

Word Frequency List

144,351 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
571 1771873968 479
0.02%
0.0207% 207.5
572 Wax 478
0.02%
0.0207% 207.1
573 xigeenka 477
0.02%
0.0207% 206.6
574 biyaha 472
0.02%
0.0204% 204.5
575 aqoon 471
0.02%
0.0204% 204.0
576 adduunka 469
0.02%
0.0203% 203.2
577 Xuseen 468
0.02%
0.0203% 202.7
578 afar 468
0.02%
0.0203% 202.7
579 bilaabay 468
0.02%
0.0203% 202.7
580 Cabdullaahi 467
0.02%
0.0202% 202.3

Top 10 Words

oo
62045
ka
52893
ku
46423
ah
42737
ay
42498
ee
37436
iyo
37181
in
34858
ayaa
29666
uu
29539