(base) juku@jaagup:~/oma/23/03/kolmikud/v2$ wc *ees.txt 122 390 2343 A2_harv_osakaal_ees.txt 177 681 4259 B1_harv_osakaal_ees.txt 324 1762 11623 B2_harv_osakaal_ees.txt 470 2451 17540 C1_harv_osakaal_ees.txt 0 12752 131017 osakaalud_ees.txt 11855953 59586911 493539919 refcorp_harv_osakaal_ees.txt (base) juku@jaagup:~/oma/23/03/kolmikud/v2$ wc *taga.txt 206 648 3890 A2_harv_osakaal_taga.txt 213 794 4936 B1_harv_osakaal_taga.txt 341 1886 12189 B2_harv_osakaal_taga.txt 313 1615 11344 C1_harv_osakaal_taga.txt 0 12752 131131 osakaalud_taga.txt 11027967 55877895 462125296 refcorp_harv_osakaal_taga.txt 11029040 55895590 462288786 total (base) juku@jaagup:~/oma/23/03/kolmikud/v2$ wc /mnt/c/jaagup/23/04/keel/tasemetekstid/*.txt 495 3522 18316 /mnt/c/jaagup/23/04/keel/tasemetekstid/A2.txt 504 4727 26143 /mnt/c/jaagup/23/04/keel/tasemetekstid/B1.txt 535 6881 39833 /mnt/c/jaagup/23/04/keel/tasemetekstid/B2.txt 495 7456 49797 /mnt/c/jaagup/23/04/keel/tasemetekstid/C1.txt 2029 22586 134089 total (base) juku@jaagup:~/oma/23/03/kolmikud/v2$ wc /mnt/c/jaagup/22/korpused/etnc19_reference_corpus_clean.txt 13173122 180944778 1409832880 /mnt/c/jaagup/22/korpused/etnc19_reference_corpus_clean.txt refcorp: ees haruldasi: 0,90 rida korpuse rea kohta, ca 22,5% ridadest haruldase D-eeskontekstiga taga haruldasi 0,837 -> 20,9% A2 ees: 122/495=0,24 -> 6,2% taga: 206/495=0,42 -> 10,4% B1 ees: 177/504=0,35 -> 8,8% taga: 213/504=0,40 -> 10,0% B2 ees: 324/535=0,61 -> 15,1% taga: 341/535=0,64 -> 15,9% C1 ees: 470/495=0,95 -> 23,7% taga: 313/495=0,63 -> 15,8%