Profile
Sentences
Vocabulary
Reviews
Lists
Favorites
Comments
Comments on nickyeow's sentences
Wall messages
Logs
Audio
Transcriptions
Translate nickyeow's sentences
I see. Thanks for the detailed explanation!
Am I missing something, or are Cantonese transliterations currently uneditable? Also, what can I do to change the default transliterations of certain words or characters? Some of them are wrong (or highly unusual)...
Agreed!
On a side note, it would be nice if pronunciations of compounds could be displayed with spaces in between the individual syllables. 'zung6 jiu3' looks much better than 'zung6jiu3' in my opinion.
Wow, thank you so much for taking on this issue!
I think the autogenerated transcriptions for Cantonese are doing okay. There are mistakes here and there, but most of them are caused by a small set of characters with multiple pronunciations. Many of these can be solved by adding more pronunciations for compounds. I'd say the percentage of sentences with completely correct transliterations is around 90% to 95%.
Some characters can be quite tricky though. For instance, the final particle 喎 is pronounced wo3 when it indicates a casual remark, wo4 when it indicates a sort of playful scolding, and wo5 when it is used to quote something undesirable. Apparently, it can also be pronounced kwaa1, waa1, and wo1, although these are extremely rare. To be fair, I only found out about these (I blush to confess) when I looked them up in a dictionary. :p
I think the review system you suggested would be the best way to solve the problem. Perhaps autogenerated transcriptions could still be displayed—they are correct most of the time after all—but it would be nice if reviewed transcriptions could be given a little green tick or something.
Thanks for the explanation! It would be great if the name could be changed back to 'Hong Kong'.
Why was 'Hong Kong' on the profile page renamed 'Hong Kong SAR China'...?
I think this has been discussed before, but is it possible to make the romanisation of Cantonese sentences editable?
It is simply impossible for computer-generated romanisation to be 100% accurate. To illustrate the problem, the final particle 呀 can be pronounced as aa1, aa3, aa4 and aa6 depending on the tone you want to convey...
Worked like magic. Thanks! ;-)
Hi,
This might be a rather selfish request, but I'm wondering if it would be possible to add in an option to turn off the furigana in Japanese sentences? I'm not saying they're not useful (they are!) but sometimes they do clutter up the page a bit.
Thanks! :-)
Perhaps we should replace the underscores in these tags with spaces?
[verb_of_motion]
[day_of_the_week]
That's great :-) !
I’ve noticed that when you hover your cursor onto a tag, the ID number of the tagger is displayed — I’m not sure if this has been discussed before, but wouldn’t it make more sense to display the tagger’s username instead?
Perfect. Thanks!
I see, I’ll start building the list now. ;-)
By the way sysko, have you taken care of the duplicate words? It seems that 公 is still gung4, 度 is still dok6, 生日 is still sang1jat6, etc.
Actually, what I was trying to do with the list was to only include compound words that have pronunciations different from the “default” ones. In the case of 英國人 as you mentioned above, I didn’t include it in the list because its pronunciation is exactly the same as the “default” pronunciation:
Default pronunciation of 英 = jing1
Default pronunciation of 國 = gwok3
Default pronunciation of 人 = jan4
Pronunciation of 英國人 = jing1 gwok3 jan4
However, when the pronunciation of a compound word is different from the “default” pronunciation, it is included in the list. For example:
Default pronunciation of 英 = jing1
Default pronunciation of 文 = man4
Pronunciation of 英文 = jing1 *man2*
If you think it’s better to also include words like 英國人, I’ll go through the Cantonese sentences again and make another list of pronunciations. ^^
Just to clarify one point: most of those pronunciations are not wrong, but uncommon. For example, both dou6 and dok6 are actually possible pronunciations of 度, but in about 95% of the cases the pronunciation dou6 is used, so I figured it might be more convenient if we set the “default” pronunciation to be dou6.
And the same applies to all the other words, so yes, please replace the old pronunciations with the ones on my list :-)
Hmm... it looks like only a part of the words has been imported successfully — for example, 公 is still romanized as gung4 instead of gung1, 度 is still romanized as dok6 instead of dou6, 生日 is still romanized as sang1jat6 instead of saang1jat6, etc.
Thanks!
@sysko, I’ve finished proofreading all the Cantonese sentences and here is the pronunciation list:
http://db.tt/k3Im2Yc
Obviously, it doesn’t contain every single word in the Cantonese language, but it should be able to fix most of the incorrect romanizations on Tatoeba :-)
你好!我也會說中文 :-)