加拿大國歌的歌詞最初是用法文寫的。 - Çînkîya Mandarînî cumleya nimûne

keyboard_arrow_left

verên

raştameye

dimayên

keyboard_arrow_right

Ziwan seba cumleya verêne, dimayêne yan zî raştameyîye

warning

Cumleya şima nêamê îlawekerdene çunke cêrênî xora estê.

star Na cumle aîdê qiseykerdoxêko/a ziwanê dayîke ya.

warning

Na cumle muteber nîya.

content_copy

Cumle kopya bike

info

Şo rîpelê cumle

subdirectory_arrow_right

warning

Açarnayîşî

Lînkê nê açarnayîşî wedarne

link

Bike açarnayîşo raşteraşt

chevron_right

Cumleya mewcûde #{{::translation.id}} sey açarnayîşêk amê îlawekerdene.

edit

Nê açarnayîşî pergal bike

warning

Na cumle muteber nîya.

content_copy

Cumle kopya bike

info

Şo rîpelê cumle

subdirectory_arrow_right

warning

Açarnayîşanê açarnayîşan

Lînkê nê açarnayîşî wedarne

link

Bike açarnayîşo raşteraşt

chevron_right

Cumleya mewcûde #{{::translation.id}} sey açarnayîşêk amê îlawekerdene.

edit

Nê açarnayîşî pergal bike

warning

Na cumle muteber nîya.

content_copy

Cumle kopya bike

info

Şo rîpelê cumle

subdirectory_arrow_right

warning

{{vm.sentence.expandLabel}} Hîna tay açarnayîşî

sysko September 6, 2011 September 6, 2011 at 1:42:52 PM UTC

flag

Report

link

Lînko payîdar

Note to myself: here the system has cut in 用法文 instead of 用法文

nickyeow September 6, 2011 September 6, 2011 at 1:57:40 PM UTC

flag

Report

link

Lînko payîdar

Just out of curiosity, will it be possible to manually fix this kind of misinterpretation in the future?

sysko September 6, 2011 September 6, 2011 at 9:36:13 PM UTC

flag

Report

link

Lînko payîdar

yup, it will be, I already know how to code it and so, just a question of time now:)

Actually here the problem is that I choose a quick and dirty way to split Chinese sentences into words, the software read from left to right and try to find the longest string it knows, and then continue etc. etc.

When I will have some free time I will replace that by something smarter, based on statistics, so that it will know that 用法 + 文 is far less probable than 用 + 法文

and eventually one day (we're working on that) have something even smarter based on sentence pattern and grammatical class of words. (and still stat)

Anyway tatoeba here is already great "real world" test for this kind of software :)