menu
Tatoeba
language
Register Log in
language English
menu
Tatoeba

chevron_right Register

chevron_right Log in

Browse

chevron_right Show random sentence

chevron_right Browse by language

chevron_right Browse by list

chevron_right Browse by tag

chevron_right Browse audio

Community

chevron_right Wall

chevron_right List of all members

chevron_right Languages of members

chevron_right Native speakers

search
clear
swap_horiz
search
sharptoothed {{ icon }} keyboard_arrow_right

Profile

keyboard_arrow_right

Sentences

keyboard_arrow_right

Vocabulary

keyboard_arrow_right

Reviews

keyboard_arrow_right

Lists

keyboard_arrow_right

Favorites

keyboard_arrow_right

Comments

keyboard_arrow_right

Comments on sharptoothed's sentences

keyboard_arrow_right

Wall messages

keyboard_arrow_right

Logs

keyboard_arrow_right

Audio

keyboard_arrow_right

Transcriptions

translate

Translate sharptoothed's sentences

sharptoothed's messages on the Wall (total 669)

sharptoothed sharptoothed February 15, 2013 February 15, 2013 at 1:52:25 PM UTC link Permalink

Usually, a sentence with lesser number is the original with the exception of sentences with numbers up to 329712 approximately that originate from Tanaka Corpus (English - Japanese pairs).

sharptoothed sharptoothed February 15, 2013 February 15, 2013 at 7:52:41 AM UTC link Permalink

Hi, Ramin!
Welcome to the Tatoeba Project!

sharptoothed sharptoothed February 14, 2013 February 14, 2013 at 5:22:11 PM UTC link Permalink

I think, it's not a good idea to keep such things here. At least, not as "translation" for original sentences. I hope that metadata mechanism will be implemented eventually and then we'll be able to add transliterations, transcriptions, etc. without poisoning the main database.

sharptoothed sharptoothed February 7, 2013 February 7, 2013 at 6:21:40 PM UTC link Permalink

As far as I know, the sentences have to be indexed by the search engine before it becomes possible to find them via Tatoeba search tool. I suspect that indexing process just wasn't being ran for some time. Maybe due to performance issues Tatoeba experiencing last time.

sharptoothed sharptoothed February 2, 2013 February 2, 2013 at 5:36:42 PM UTC link Permalink

Try to open audio links in separate tab/window.

sharptoothed sharptoothed February 1, 2013 February 1, 2013 at 9:40:00 PM UTC link Permalink

お誕生日おめでとう!^^

sharptoothed sharptoothed February 1, 2013 February 1, 2013 at 6:50:53 AM UTC link Permalink

Добро пожаловать на Татоэбу!
Welcome to Tatoeba!

You've chosen the right place, indeed! Tatoeba is, maybe, not the best site to learn languages but definitely one of the very best to find natural and actual sentences in many languages.
Good luck! :-)

sharptoothed sharptoothed January 16, 2013 January 16, 2013 at 5:31:44 PM UTC link Permalink

Добро пожаловать на Татоэбу! :-)
Новичкам здесь всегда рады, а что касается нового - по части иностранных языков тут этого хоть отбавляй. :-)

sharptoothed sharptoothed January 11, 2013 January 11, 2013 at 7:14:29 AM UTC link Permalink

おめでとうございます!その調子で続けてください。^^

sharptoothed sharptoothed January 10, 2013 January 10, 2013 at 2:34:59 PM UTC link Permalink

That would be really nice. The era of new Tatoeba looks promising, indeed. :-)

sharptoothed sharptoothed January 10, 2013 January 10, 2013 at 12:18:02 PM UTC link Permalink

> Isn't information on each word the task of word dictionaries ?

Essentially, yes. So Japanese indices on Tatoeba provide just the dictionary form of words and the sense number in JMDict/EDICT but not the meanings. For beginners it often hard to determine the dictionary form of a conjugated verb or adjective and, thus, to find its meaning in dictionary. Furigana doesn't help much with this issue, it only solves the Kanji reading and word tokenizing problems.

sharptoothed sharptoothed January 10, 2013 January 10, 2013 at 11:03:27 AM UTC link Permalink

The main purpose of Japanese indices (as I imagine it for myself) is to provide user with comprehensive information on each word in the sentence, including its reading, dictionary form and a reference to its meaning (sense) in current context. This is truly invaluable thing for learners.

sharptoothed sharptoothed January 10, 2013 January 10, 2013 at 7:18:33 AM UTC link Permalink

At the moment, it seems that Japanese indices are not maintained at all. The reason is obvious: this job not only requires good command in Japanese but also very time consuming. Additionally, there are no any tools to do it on Tatoeba. Sysko said he's going to implement furigana editing features, though. I think, it could be a good start to implement Japanese indices tools, too.

sharptoothed sharptoothed January 10, 2013 January 10, 2013 at 6:54:50 AM UTC link Permalink

> It seems that per-language font specifications need not be implemented wholesale, but could be instituted one language at a time.

That's correct. CSS provides enough means to implement per-language styles including custom fonts that will we downloaded on the fly (via '@font-face') and all modern (this is the keyword) browsers seem to provide adequate support for these features.

sharptoothed sharptoothed January 9, 2013 January 9, 2013 at 6:01:45 PM UTC link Permalink

I've heard that font substitution add-ons exist for both Firefox and Chrome browsers though I've never used them.

sharptoothed sharptoothed January 9, 2013 January 9, 2013 at 5:40:23 PM UTC link Permalink

> Could you please confirm that if I filter the viewed languages, the download time would be affected only by the viewed languages ?

I haven't investigated this question that close, but, according to some reports (sorry, I haven't an URL at hand), Firefox and Chrome browsers won't download any font defined in CSS via '@font-face' unless they found a reference to it (explicitly via 'style' attribute or implicitly or via 'class' attribute) inside the page HTML code. That is, if you filter all unnecessary languages, the fonts for them won't be downloaded either. On the other hand, according to the same reports, other browsers, IE in particular, seem to download all fonts defined in CSS no matter if they used on the page or not.

sharptoothed sharptoothed January 9, 2013 January 9, 2013 at 4:05:03 PM UTC link Permalink

> How can the CSS "guess" what language it's processing ?

This is possible via 'lang' attribute. Tatoeba already has information about the sentence language so each sentence may be provided with appropriate 'lang' attribute. There's also 'lang' selector in CSS so it's possible to select and set different styles for different languages.

sharptoothed sharptoothed January 9, 2013 January 9, 2013 at 2:49:41 PM UTC link Permalink

---- About fonts on Tatoeba ----

From my personal experience, there's still no ultimate solution for multilingual sites that would provide an acceptable level of usability while staying lightweight and effective. That is, any solution we can invent at the moment will be a compromise solution somehow or other. I see several approaches and each of them has its own pros and contras.
1) Remove all 'font-family' specifications from Tatoeba CSS files.
Extremely straightforward solution. The whole site will be displayed using default proportional font specified in browser configuration. This may cause a disruption of Tatoeba visual design depending on particular font selection. To decrease the impact of this it's possible to work out a set of recommendations for Tatoeba users about their browsers customization, font getting and installing processes, etc. We also can count that many users already have their browsers and OSes optimized for different languages though it may seem like catching at a straw.
2) Use only 'serif' and 'sans-serif' in 'font-family' specifications.
This solution is almost the same as the one above but allows to use different typesets and potentially more design style friendly.
3) Use the most “Web safe” fonts, such as Arial / Helvetica for sans-serif and Times New Roman / Times for sans providing fallbacks to sans-serif and serif typesets accordingly.
This solution allows to count on particular font's look and feel but the result will be highly dependent on user OS type and localization since different versions of fonts may have different number of character sets and glyphs supported. But, apparently, for many localizations this solution will degenerate to the one above.
4) Implement 'lang' property support in CSS and provide different font-family for different languages. For the most complex/exotic cases provide optimized fonts via '@font-face' CSS directive.
Maybe the most elegant solution but requires a good deal of work that will include reworking Tatoeba engine, collecting information about best practices in font selection for different languages, investigating the font availability in different OSes and localizations, getting and optimizing free national fonts, etc. Another drawback is that usage of custom fonts may dramatically increase page download time.

I suspect that the truth is somewhere in the middle but hope my considerations will help. :-)

sharptoothed sharptoothed January 9, 2013 January 9, 2013 at 6:58:27 AM UTC link Permalink

Thanks for pointing out, sysko! I suspected that something is wrong :-)
Unfortunately, it seems that there's no separate HTML entity for Narrow No-Break Space so we have to use its HTML code instead ( ) and pray that browser display it right.
I've updated the test page.
http://jplangtools.com/tatoeba/thinspace.html

sharptoothed sharptoothed January 8, 2013 January 8, 2013 at 12:27:18 PM UTC link Permalink

> You are using justified text, aren't you?

ummm, nope, no justified alignment is used. English example just contains different type of spaces. :-)