Seznam finálních kod v mluveném korpusu ORAL2013 spolu s jejich frekvenčními charakteristikami.
Jednoduché kody | Kody CC | Kody CCC |
typ | koda | abs_frek (tokeny) | rel_frek (tokeny) | IPM (tokeny) | abs_frek (typy) | rel_frek (typy) |
---|---|---|---|---|---|---|
C | 17 kod | 643 254 | 100,00% | 195 785,25 | 3 802 | 100,00% |
C | m | 174 899 | 27,19% | 53 233,47 | 615 | 16,18% |
C | k | 112 610 | 17,51% | 34 274,76 | 321 | 8,44% |
C | š | 78 482 | 12,20% | 23 887,33 | 289 | 7,60% |
C | t | 70 761 | 11,00% | 21 537,31 | 857 | 22,54% |
C | l | 50 355 | 7,83% | 15 326,40 | 567 | 14,91% |
C | n | 39 390 | 6,12% | 11 989,01 | 118 | 3,10% |
C | j | 29 927 | 4,65% | 9 108,79 | 325 | 8,55% |
C | x | 19 073 | 2,97% | 5 805,19 | 179 | 4,71% |
C | c | 18 662 | 2,90% | 5 680,10 | 80 | 2,10% |
C | ť | 17 183 | 2,67% | 5 229,94 | 50 | 1,32% |
C | s | 16 663 | 2,59% | 5 071,67 | 117 | 3,08% |
C | r | 4 532 | 0,70% | 1 379,39 | 99 | 2,60% |
C | č | 3 186 | 0,50% | 969,71 | 27 | 0,71% |
C | f | 2 655 | 0,41% | 808,09 | 54 | 1,42% |
C | ň | 2 526 | 0,39% | 768,83 | 30 | 0,79% |
C | p | 1 843 | 0,29% | 560,95 | 51 | 1,34% |
C | ř | 507 | 0,08% | 154,31 | 23 | 0,60% |
typ | koda | abs_frek (tokeny) | rel_frek (tokeny) | IPM (tokeny) | abs_frek (typy) | rel_frek (typy) |
---|---|---|---|---|---|---|
CC | 37 kod | 21 363 | 100,00% | 6 502,19 | 235 | 100,00% |
CC | st | 4 948 | 23,16% | 1 506,01 | 78 | 33,19% |
CC | kt | 4 853 | 22,72% | 1 477,09 | 8 | 3,40% |
CC | rt | 3 478 | 16,28% | 1 058,59 | 13 | 5,53% |
CC | ct | 2 093 | 9,80% | 637,04 | 19 | 8,09% |
CC | jt | 1 879 | 8,80% | 571,91 | 8 | 3,40% |
CC | jx | 1 089 | 5,10% | 331,46 | 31 | 13,19% |
CC | nt | 743 | 3,48% | 226,14 | 21 | 8,94% |
CC | tř | 307 | 1,44% | 93,44 | 4 | 1,70% |
CC | lm | 297 | 1,39% | 90,40 | 1 | 0,43% |
CC | jn | 275 | 1,29% | 83,70 | 1 | 0,43% |
CC | šť | 212 | 0,99% | 64,53 | 4 | 1,70% |
CC | lt | 187 | 0,88% | 56,92 | 4 | 1,70% |
CC | jc | 121 | 0,57% | 36,83 | 6 | 2,55% |
CC | ks | 85 | 0,40% | 25,87 | 5 | 2,13% |
CC | sk | 77 | 0,36% | 23,44 | 4 | 1,70% |
CC | jl | 68 | 0,32% | 20,70 | 1 | 0,43% |
CC | nk | 66 | 0,31% | 20,09 | 2 | 0,85% |
CC | rs | 61 | 0,29% | 18,57 | 1 | 0,43% |
CC | jš | 58 | 0,27% | 17,65 | 2 | 0,85% |
CC | kš | 56 | 0,26% | 17,04 | 2 | 0,85% |
CC | pt | 53 | 0,25% | 16,13 | 1 | 0,43% |
CC | rk | 52 | 0,24% | 15,83 | 2 | 0,85% |
CC | jť | 41 | 0,19% | 12,48 | 2 | 0,85% |
CC | nc | 38 | 0,18% | 11,57 | 1 | 0,43% |
CC | jk | 30 | 0,14% | 9,13 | 2 | 0,85% |
CC | ft | 30 | 0,14% | 9,13 | 1 | 0,43% |
CC | xt | 27 | 0,13% | 8,22 | 1 | 0,43% |
CC | mp | 23 | 0,11% | 7,00 | 1 | 0,43% |
CC | ns | 17 | 0,08% | 5,17 | 1 | 0,43% |
CC | mš | 16 | 0,07% | 4,87 | 1 | 0,43% |
CC | nč | 15 | 0,07% | 4,57 | 1 | 0,43% |
CC | rf | 13 | 0,06% | 3,96 | 1 | 0,43% |
CC | jm | 12 | 0,06% | 3,65 | 1 | 0,43% |
CC | js | 11 | 0,05% | 3,35 | 1 | 0,43% |
CC | lf | 11 | 0,05% | 3,35 | 1 | 0,43% |
CC | př | 11 | 0,05% | 3,35 | 1 | 0,43% |
CC | ls | 10 | 0,05% | 3,04 | 1 | 0,43% |
typ | koda | abs_frek (tokeny) | rel_frek (tokeny) | IPM (tokeny) | abs_frek (typy) | rel_frek (typy) |
---|---|---|---|---|---|---|
CCC | 2 kody | 62 | 100,00% | 18,87 | 2 | 100,00% |
CCC | kst | 51 | 82,26% | 15,52 | 1 | 50,00% |
CCC | rkt | 11 | 17,74% | 3,35 | 1 | 50,00% |
© 2016 Filozofická Fakulta Univerzity Karlovy v Praze / Pavel Šturm, David Lukeš