menu
Tatoeba
language
Register Log in
language English
menu
Tatoeba

chevron_right Register

chevron_right Log in

Browse

chevron_right Show random sentence

chevron_right Browse by language

chevron_right Browse by list

chevron_right Browse by tag

chevron_right Browse audio

Community

chevron_right Wall

chevron_right List of all members

chevron_right Languages of members

chevron_right Native speakers

search
clear
swap_horiz
search
gillux {{ icon }} keyboard_arrow_right

Profile

keyboard_arrow_right

Sentences

keyboard_arrow_right

Vocabulary

keyboard_arrow_right

Reviews

keyboard_arrow_right

Lists

keyboard_arrow_right

Favorites

keyboard_arrow_right

Comments

keyboard_arrow_right

Comments on gillux's sentences

keyboard_arrow_right

Wall messages

keyboard_arrow_right

Logs

keyboard_arrow_right

Audio

keyboard_arrow_right

Transcriptions

translate

Translate gillux's sentences

gillux's messages on the Wall (total 595)

gillux gillux June 23, 2015 June 23, 2015 at 10:25:44 AM UTC link Permalink

We should probably update the wiki, but in short:
• break → matches break and other words that have the same stem in each language that support stemming, and just break for languages that doesn’t support stemming. For instance, in English sentences it will match break, breaks, breaking, in French it will match break, breaks (it’s a car type), in Esperanto it will only match break because Esperanto doesn’t have a stemmer.
• =break → matches any sentence that have the word break exactly, regardless of stemming
• break* → matches any word starting with break (including break), like breakfast, regardless of stemming

gillux gillux June 23, 2015 June 23, 2015 at 12:38:27 AM UTC link Permalink

Note that you need to provide at least three characters before or after the star. Which means th*, *th or *th* won’t work but the*, *the or *the* will.

gillux gillux June 22, 2015 June 22, 2015 at 3:13:59 AM UTC link Permalink

Use minus, just like in Google.

https://tatoeba.org/sentences/s...rom=eng&to=und

gillux gillux June 21, 2015 June 21, 2015 at 10:16:46 AM UTC link Permalink

This delay will also allow us to translate the interface. I updated the resource on Transifex, there are a few new strings to translate.

gillux gillux June 21, 2015, edited June 21, 2015 June 21, 2015 at 4:14:44 AM UTC, edited June 21, 2015 at 4:15:09 AM UTC link Permalink

Yes, and I think it’s probably good enough. All recent dates are reported in a “x ago” form, while only the more older ones (older than 30 days) are displayed as a date, in which case the precise hour and minutes are likely to be unimportant.

gillux gillux June 21, 2015 June 21, 2015 at 2:22:03 AM UTC link Permalink

I added an “empty trash” function to remove all the messages from the trash.

gillux gillux June 20, 2015 June 20, 2015 at 2:43:14 PM UTC link Permalink

What do you mean by “search history”? If you mean the previously submitted queries, this is a browser-specific thing I don’t think we can mess with in any way.

gillux gillux June 20, 2015 June 20, 2015 at 2:39:54 PM UTC link Permalink

One way to solve the slowness problem is to exclusively rely on Sphinx to sort out sentences. My recent progresses in that field make be believe it’s a workable solution.

gillux gillux June 20, 2015 June 20, 2015 at 9:11:37 AM UTC link Permalink

How about that? http://dev.tatoeba.org/

gillux gillux June 20, 2015 June 20, 2015 at 8:25:20 AM UTC link Permalink

Do you think this button should reset the “from” and “to” lists to “Any” as well?

gillux gillux June 20, 2015 June 20, 2015 at 7:03:31 AM UTC link Permalink

Go to https://dev.tatoeba.org/user/settings and try changing the new “Number of sentences per page” setting.

gillux gillux June 20, 2015 June 20, 2015 at 5:14:25 AM UTC link Permalink

I’m working on it (ticket: https://github.com/Tatoeba/tatoeba2/issues/30).

gillux gillux June 14, 2015 June 14, 2015 at 4:39:09 PM UTC link Permalink

The wiki is going to be shut down for maintenance during a few hours. Sorry for the inconvenience!

gillux gillux June 14, 2015 June 14, 2015 at 3:24:20 PM UTC link Permalink

> After going through your post I installed a plugin in Chrome and since then it has been showing the beautiful Nastaleeq script. So thank you for that.
> However I personally believe that websites should render the Nastaleeq script automatically as not every Urdu speaker is technically sound enough to know how they can get the script.

Of course. Actually, the problem comes from your OS. It should ask and allow you to install an Urdu font so that Urdu could be displayed correctly, not only in browsers but in any graphical interface of your system, like text editors, menus, file names etc. By installing a plugin, you’re fixing Chrome but the rest of your system is still Urdu-unaware. We Tatoeba can try to help members with broken OSes by providing web fonts, but the root of problem is the OS. That’s why the author of the blog post you mentioned first asked Microsoft, Apple and Google.

gillux gillux June 14, 2015 June 14, 2015 at 3:09:02 PM UTC link Permalink

I updated the ticket.

What you can do is to advice us the best web font for Urdu. By “best”, I mean one that is free to use, that most people won’t find hard to read and that is accurate. I insist on the accuracy because Urdu seems quite complex to render, and fonts like Google Noto still has problems [1]. It would be best to get a native’s acknowledgement (like you ☺) regarding accuracy (and readability) before deploying a font. Also, note that not any font can be used as a webfont. It needs to be size-optimized and available in multiple formats. In other words, look for web fonts, not just fonts.

[1] https://github.com/googlei18n/n...Aopen+Nastaliq

gillux gillux June 13, 2015, edited June 13, 2015 June 13, 2015 at 8:29:40 PM UTC, edited June 13, 2015 at 8:30:02 PM UTC link Permalink

I don’t think there is much Tatoeba can do to “implement” Nasta'liq script. In fact, it seems to already work. Technically speaking, Nasta'liq and Naskh use the same Unicode codepoints but are represented differently. Which means the existing Urdu sentences are not wrong, they are just displayed in Naskh instead of Nasta'liq on most systems as a result of a lack of support for Nasta'liq.

It’s like you’d need a different font for French and Italian just because French should be written in cursive and Italian in block letters (of course this is not true, it’s just an example). So every time you’re on a French website, your browser would need to automatically select a cursive font, and switch to a block letters one when displaying Italian. This is only possible if the browser is aware of the language of the content. Fortunately, we already provide language information on Tatoeba (using the lang attribute) for sentences in every language, including Urdu. So, using browsers that handle this correctly, on systems that have Nasta'liq fonts installed and are configured to use them when displaying Urdu, Urdu sentences are automatically displayed in Nasta'liq.

I installed Nasta'liq fonts on my system and configured it appropriately, the result seems good: https://i.imgur.com/oQActxn.png

gillux gillux June 11, 2015 June 11, 2015 at 7:37:48 PM UTC link Permalink

Merci !

gillux gillux June 10, 2015 June 10, 2015 at 8:45:20 AM UTC link Permalink

I’m not a corpus maintainer, still I want to have access to such feature, and I’m pretty sure I’m not the only one.

gillux gillux June 9, 2015 June 9, 2015 at 7:48:24 PM UTC link Permalink

This makes me think we probably need some sort of highlighting for invisible characters like zero-width spaces or non-breaking spaces, so that we at least know where they are being used. We don’t want them to be highlighted normally, otherwise it would defeat their purpose, but we may have something that triggers highlighting or some visual information. I have no idea about how that could fit in the UI though, I’m sure nobody wants yet another button above sentences.

gillux gillux June 9, 2015 June 9, 2015 at 7:24:24 PM UTC link Permalink

I think we should restore the complete list of languages and put the profile languages on the top, the same way personal lists appears on the top when you click on the “Add to list” icon of a sentence.