Menu
erm, why is it that whenever I write something in Punjabi (gurmukhi) the punctuation mark shifts to the beginning of the sentence? Is it because Tatoeba thinks the text is in Shahmukhi, which is a RTL script? If so, can Admins kindly look into it?
Ta!
Welcome back, Radium. I and others noticed this problem. The solution was to split Punjabi into Eastern and Western versions (which is how ISO 639-3 does it). Eastern, or Indian Punjabi will be the one you'll likely use, with the Gurmukhi script. This change has been pending for a long while now (most probably due to a lack of regular native Punjabi contributors).
You can still add sentences, but if you do, add your sentences to either the Western Punjabi or Eastern Punjabi lists after adding them. These lists will be used when splitting the language later.
I looked quickly through the sentences, and since I do not know the language, I would need to know in which case you put the punctuation at the wrong place to make it display at the right place.
Here are 2 sentences where the question mark is not at the same place.
http://tatoeba.org/eng/sentences/show/3788613
--> http://prntscr.com/5vvhv8
http://tatoeba.org/eng/sentences/show/3786275
--> http://prntscr.com/5vviny
Anyway I'd suggest you write your sentences as they normally should, without trying to hack the position of the punctuation. We'll see how we can fix it on our side.
TRANG: The first sentence is the one without the hack i.e it's written the way it's supposed to be written, with punctuation mark at the end of the sentence (at the right side).
Someone else on here also advised that I keep on translating without manually fixing the punctuation, and that is what I'm doing now.
CK : I've tried manual select multiple times but that doesn't seem to have any effect on the output. I strongly suspect tatoeba code is designed to treat Punjabi text inputs as Shahmukhi by default, which is a RTL script while Gurmukhi is a LTR script, hence the mix-up. It would be nice if a fix can be implemented whereby one gets to choose which particular script to use.. If not, That's OK too so long as it can treat Gurmukhi as LTR :)
Okay so I had misunderstood your problem. I thought you were writing with a script that is supposed to be RTL, but some weird bug caused the punctuation to be misplaced. But in fact it's just that the script you are using is a LTR script.
Tommy_san reminded me of this discussion:
http://tatoeba.org/eng/wall/show_message/17381
If most of the Punjabi sentences in Tatoeba are written with Gurmukhi, then we will change the language code to "pan" and specify the language name to be "Punjabi (Eastern)". The sentences added to this language should then use the Gurmukhi, and they will be displayed left to right.
We will add "Punjabi (Western)" later on, if there are contributors for it.
Does this sound okay?
Note that the direction of the language is not influenced by the way you choose the language. We have a list of language code for which the sentence text should be displayed right to left, and "pnb" was in that list.
https://github.com/Tatoeba/tato...uages.php#L518
I believe that RadiumCat can also contribute in Western Punjabi. If that's the case, I'll add Western Punjabi at the same time, and reassign the sentence languages according to the lists.
Trang: Sounds great to me! :)
I primarily use Gurmukhi script for working with Punjabi sentences but as AlanF_US has rightly guessed in the comment below, I'm also familiar with Shahmukhi, along with major Western (and Eastern) Punjabi dialects so I think I do have the capacity to contribute using Shahmukhi every once in a while. I don't know if you feel that is good enough to add another language code though!
Eastern Punjabi has been added :)
Please check the sentences to fix the punctuation where needed:
http://tatoeba.org/eng/sentence...ne/indifferent
Thanks for the fix, Trang! :)
I've made all the necessary changes, I think :)
See http://tatoeba.org/wall/show_message/17381 for earlier discussions.
It seems that #1754557 is an example of a sentence written with the RTL script Shahmukhi.
Ah right, there was this discussion. Thanks lot for refreshing my memory.