menu
Tatoeba
language
Register Log in
language English
menu
Tatoeba

chevron_right Register

chevron_right Log in

Browse

chevron_right Show random sentence

chevron_right Browse by language

chevron_right Browse by list

chevron_right Browse by tag

chevron_right Browse audio

Community

chevron_right Wall

chevron_right List of all members

chevron_right Languages of members

chevron_right Native speakers

search
clear
swap_horiz
search

Wall (6,756 threads)

Tips

Before asking a question, make sure to read the FAQ.

We aim to maintain a healthy atmosphere for civilized discussions. Please read our rules against bad behavior.

Latest messages subdirectory_arrow_right

Thanuir

11 hours ago

subdirectory_arrow_right

LanguageExpert

18 hours ago

feedback

CK

19 hours ago

subdirectory_arrow_right

ssvb

22 hours ago

subdirectory_arrow_right

Pfirsichbaeumchen

23 hours ago

subdirectory_arrow_right

shekitten

yesterday

subdirectory_arrow_right

ssvb

yesterday

feedback

CK

2 days ago

feedback

sharptoothed

5 days ago

subdirectory_arrow_right

janTuki

9 days ago

Nuel2 Nuel2 16 days ago March 15, 2023 at 1:18:42 PM UTC link Permalink

Hello, it's me Nuel.

I had to change my e-mail address and create a new account.

How can I get my sentences and my user name back?

I'd also like to ask an admin if they can make me "Advanced contributor" again.

I need your help!

Thanks.

{{vm.hiddenReplies[39650] ? 'expand_more' : 'expand_less'}} hide replies show replies
Pfirsichbaeumchen Pfirsichbaeumchen 15 days ago March 16, 2023 at 2:44:26 AM UTC link Permalink

The matter is being taken care of.

Aiji Aiji 16 days ago, edited 16 days ago March 15, 2023 at 6:02:34 AM UTC, edited March 15, 2023 at 6:03:11 AM UTC link Permalink

โœจ Sentences with indirect translations and no direct translations โœจ
Original thread: https://tatoeba.org/fr/wall/sho...#message_39021

Excerpt from the original message:
> I’ve made lists of sentences in language A having indirect translations but no
> direct translations in language B. I think several people might find these lists
> quite useful.
> For ease of use, the names of all lists follow the same pattern:
> Indirect translations ISO1 → ISO2

After some months of experiment, I modified the content of those lists so that
1. The lists contain only sentences of users native in language A.
2. The lists contain a block of consecutive sentences, taken from the middle of the corpus (so neither too old, nor too recent).

These small modifications allowed me to be much more efficient in "fast-linking".
The first point allows me to spend a very little time on checking the validity of the original sentence.
The second point avoids the "This sentence, again?!" effect, when facing numbered / gendered language or abbreviation (I have been, I've been, etc.). These sentences are often added at the same time (often as a result of translating), so having them displayed together in the search results allows to quickly link translations to all of them, by just copy-pasting translations that already exist.

My personal way of using these lists is the following: https://tatoeba.org/fr/sentence...roved=no&user=
I order the results by "Last created first", and go from the last page to the first page. That way, sentences I ignored do not get in the way after I ignored them.

You can see all the available pairs of languages in my profile: https://tatoeba.org/fr/sentence...s/of_user/Aiji
If you'd like me to add some pair, please let me know.

I hope some people will make good use of them!

CK CK 16 days ago March 15, 2023 at 2:12:58 AM UTC link Permalink

๐ŸŽ New Audio Contributors

audio - rus - by retr0ra1n
https://tatoeba.org/en/sentence...how/171207/und

audio - fra- by Adrien_FR
https://tatoeba.org/en/sentence...how/171243/und

sharptoothed sharptoothed 19 days ago March 12, 2023 at 8:10:56 AM UTC link Permalink

โœนโœน Stats & Graphs โœนโœน

Tatoeba Stats, Graphs & Charts have been updated:
https://tatoeba.j-langtools.com/allstats/

DostKaplan DostKaplan February 23, 2023 February 23, 2023 at 11:03:50 PM UTC link Permalink

Either there are no English sentences containing "culturally-rich" or the pseudo-regexp "*-rich" (double quotes included) is able to extract such a text pattern.

{{vm.hiddenReplies[39606] ? 'expand_more' : 'expand_less'}} hide replies show replies
AlanF_US AlanF_US February 24, 2023 February 24, 2023 at 2:58:30 AM UTC link Permalink

There is one sentence that contains the phrase "culturally-rich". You can find it by searching for "culturally rich" (double quotes included):

https://tatoeba.org/en/sentence...ly+rich%22&to=

All punctuation contained in sentences (including hyphens) is thrown away when the sentences are indexed, so there's no way to specifically find sentences that include that punctuation. Furthermore, some punctuation marks have special significance for the search engine, so including punctuation in your search terms can actually prevent you from finding what you're looking for. Punctuation marks with special significance for the search engine are described on the following page:

https://en.wiki.tatoeba.org/art...w/text-search#

{{vm.hiddenReplies[39607] ? 'expand_more' : 'expand_less'}} hide replies show replies
DostKaplan DostKaplan February 24, 2023, edited February 24, 2023 February 24, 2023 at 1:32:04 PM UTC, edited February 24, 2023 at 1:34:32 PM UTC link Permalink

Even if all punctuation is thrown away before indexing, it would be nice if an escaped hyphen in the search pattern is recognized as a literal hyphen and the returned results further filtered, based on the search pattern ("*\-rich"), to return only those that DO contain a hyphen immediately before "rich".

{{vm.hiddenReplies[39611] ? 'expand_more' : 'expand_less'}} hide replies show replies
AlanF_US AlanF_US February 24, 2023 February 24, 2023 at 7:25:39 PM UTC link Permalink

The fact that the punctuation is thrown away when the words in a sentence are indexed means there's no longer any sign that the punctuation was there. So no search can distinguish between sentences that have punctuation and those that don't. It would be like storing all the content in uppercase when the sentence is indexed, then trying to find a particular lowercase letter. It can't be done.

Objectivesea Objectivesea 20 days ago March 11, 2023 at 3:25:11 AM UTC link Permalink

This is probably a little off the point under discussion, but the form "culturally rich" and similar adverb-adjective combinations should not include a hyphen in English. On the other hand, an adjective-adjective combination like "super-rich" is correctly hyphenated.

{{vm.hiddenReplies[39642] ? 'expand_more' : 'expand_less'}} hide replies show replies
AlanF_US AlanF_US 20 days ago March 11, 2023 at 5:27:33 PM UTC link Permalink

I agree.

gillux gillux February 26, 2023, edited February 27, 2023 February 26, 2023 at 10:24:13 AM UTC, edited February 27, 2023 at 6:46:40 AM UTC link Permalink

As Alan explained it is not possible to find words specifically containing hyphens. However there is a way to tune the search engine to allow that. Anybody is welcome to open an issue on Github to ask for that.

We've done such tuning in the past to allow searching for question marks, because a lot of users were confused by what happens when you search for a question https://github.com/Tatoeba/tatoeba2/pull/2399 However, because hyphen is also a metacharacter that means "exclude sentences containing that word", we'll have to carefully check how such tuning affects the use of hyphen as a metacharacter.

{{vm.hiddenReplies[39616] ? 'expand_more' : 'expand_less'}} hide replies show replies
DostKaplan DostKaplan February 26, 2023 February 26, 2023 at 11:31:11 AM UTC link Permalink

What about the use of '\' as an escape character just like in true regexp?

{{vm.hiddenReplies[39617] ? 'expand_more' : 'expand_less'}} hide replies show replies
gillux gillux February 26, 2023 February 26, 2023 at 11:46:08 AM UTC link Permalink

Yeah Manticore allows that, and apparently the query parser can also guess not to interpret the hyphen as a metacharacter in certain contexts.
https://manual.manticoresearch....on#blend_chars

My concerns are towards usability rather than feasibility.

20 days ago March 11, 2023 at 5:34:30 AM UTC link Permalink
warning

The content of this message goes against our rules and was therefore hidden. It is displayed only to admins and to the author of the message.

bicolino34 bicolino34 21 days ago March 10, 2023 at 10:43:00 AM UTC link Permalink

How can I release all my sentences as CC0

{{vm.hiddenReplies[39639] ? 'expand_more' : 'expand_less'}} hide replies show replies
Pineapple Pineapple 21 days ago March 10, 2023 at 1:10:41 PM UTC link Permalink

Here are instructions along with some info about limitations you may want to consider:
https://en.wiki.tatoeba.org/art...-contributions

{{vm.hiddenReplies[39640] ? 'expand_more' : 'expand_less'}} hide replies show replies
bicolino34 bicolino34 21 days ago March 10, 2023 at 2:15:12 PM UTC link Permalink

Thank you

CK CK 27 days ago March 4, 2023 at 3:01:52 AM UTC link Permalink

๐ŸŽ Tatoeba's Stats Page - 2023-03-04 Top 5, back to 2020-03-04

https://picallow.com/tatoebas-s...to-2020-03-04/

I've taken screenshots on March 4th for the past few years. I've merged these into one image.

hab638 hab638 February 27, 2023, edited February 27, 2023 February 27, 2023 at 4:29:22 PM UTC, edited February 27, 2023 at 4:30:42 PM UTC link Permalink

When I encounter a sentence that I consider flawed, but not strictly speaking ungrammatical, should I do something (report it, comment, complain)? Here's an example: The Italian phrase "Mio figlio poteva morire," which means "My son could have died." It is not, however, in the conditional tense, as in "My son could have died if he hadn't been so quick to react." It is, rather, in the imperfect tense, as in "Because he was mortal, my son could have died," which is something no one says, ever, even though it's technically grammatical. To say a specific person is capable of dying means nothing. It's as if a very poor computer algorithm translated the sentence. I come across these from time to time, and I'm trying to understand whether sentences like this are compatible with the Tatoeba mission statement, which includes the goal of making sure the data is "of good quality." So, is a grammatical sentence that means nothing of good quality or not?

{{vm.hiddenReplies[39619] ? 'expand_more' : 'expand_less'}} hide replies show replies
brauchinet brauchinet February 27, 2023 February 27, 2023 at 6:01:20 PM UTC link Permalink

I’d say it’s best to leave a comment.
There are many reasons why a sentence can be “less than natural”.
- It could have been added by a non-native.
- Native speakers who aren’t used to translating tend to adhere to closely to the source sentence (often enough tenses in different languages are similar, but there is no one-to-one relationship)
- Native speakers (of different regions) have different ways of speaking.
- “You” are not a native speaker, and your opinion is based on what you learned in school.

In the case you mentioned, the sentence was added by a - the most active Italian - native speaker. Chances are good that he reads your comment and even answers.

Many contributors have left though and won’t see a comment or won’t answer.
In case a sentence is undoubtedly wrong, a corpus maintainer of Italian who happens to see your comment could change it. If the situation is not so unequivocal, rules state that the decision should be left to the owner of the sentence. It there's no decision, it will stay as it is.

At least other users of Tataoba will see your comment and question or further analyse the sentence.

There is another way to communicate your opinion on the sentence: the rating system: โœ“ ? !
It means: I (as a native speaker) think this sentence is ok / doubtful, I wouldn’t use it myself / wrong.

AlanF_US AlanF_US February 28, 2023, edited February 28, 2023 February 28, 2023 at 1:27:03 AM UTC, edited February 28, 2023 at 1:27:42 AM UTC link Permalink

I agree with brauchinet's analysis.

Regarding the specific sentence you mentioned: Although I'm not an expert in Italian, I can imagine situations in which one would use the imperfect "poteva" in conjunction with "morire". How about an Italian parent having just talked about how their son was in active combat for years, then going on to say that during that time, death was always a possibility?

{{vm.hiddenReplies[39621] ? 'expand_more' : 'expand_less'}} hide replies show replies
Guybrush88 Guybrush88 February 28, 2023 February 28, 2023 at 2:51:08 PM UTC link Permalink

> How about an Italian parent having just talked about how their son was in active combat for years, then going on to say that during that time, death was always a possibility?

As an Italian speaker, I'd use this tense for this context, and also other contexts where someone has got some risks (diseases, accidents, extreme sports, and so on)

Cangarejo Cangarejo 30 days ago, edited 30 days ago March 1, 2023 at 2:15:12 PM UTC, edited March 1, 2023 at 5:05:30 PM UTC link Permalink

This reminds me of a sentence I once posted.

#11005383 I was told right at the beginning that running a solar cooker company and making it feasible and financially stable was impossible.

Normally, a sentence like this would end with “... would be impossible.” It’s much more common for Portuguese speakers to say “was” instead of ”would be”. I imagine that’s also true for Italian speakers. It’s grammatically correct in Portuguese, but maybe not so in English.

{{vm.hiddenReplies[39624] ? 'expand_more' : 'expand_less'}} hide replies show replies
AlanF_US AlanF_US 28 days ago, edited 28 days ago March 3, 2023 at 12:33:38 AM UTC, edited March 3, 2023 at 12:35:07 AM UTC link Permalink

Using "was" instead of "would be" here is not grammatically incorrect in English, as long as you think of "running a solar cooker company and making it feasible and financially stable" as a single action. Otherwise, you'd need to say "were" instead of "was" (or reword things a little: "I was told right at the beginning that running a feasible, financially stable solar cooker company was impossible"). However, I think that English speakers may be more likely to use the conditional rather than the indicative when discussing a situation like this (which was hypothetical at the time that the discussion took place).

mrbeef12 mrbeef12 29 days ago March 2, 2023 at 11:40:39 AM UTC link Permalink

After a lot of work I finally managed to release 39 Anki flash card frequency decks based on Tatoeba sentences. They don't only contain the sentences, but also audio for each sentence (taken from Tatoeba or high quality text-to-speech) and individual word translations. For most languages they contain 9000 cards (except where Tatoeba had less sentences.)

Code, screenshots and deck downloads are here: https://github.com/Vuizur/tatoeba-to-anki

I recently used one of the decks to learn Czech, and it was much more effective in my opinion than for example Duolingo. Most of all I recommend those decks to language learners who know the script of their language, but still don't know enough to understand natural input.

{{vm.hiddenReplies[39627] ? 'expand_more' : 'expand_less'}} hide replies show replies
DJ_Saidez DJ_Saidez 29 days ago March 2, 2023 at 5:32:42 PM UTC link Permalink

Very helpful, thank you!

Thanuir Thanuir 29 days ago March 2, 2023 at 5:48:57 PM UTC link Permalink

Can it also create other decks besides those with English as one side?

{{vm.hiddenReplies[39629] ? 'expand_more' : 'expand_less'}} hide replies show replies
mrbeef12 mrbeef12 28 days ago, edited 28 days ago March 2, 2023 at 9:09:04 PM UTC, edited March 2, 2023 at 9:15:17 PM UTC link Permalink

Yes, it supports all language combinations, but currently only supports English Wiktionary definitions though. (Although I'm looking for possibilities for other languages, but it's not that easy.)

Yorwba Yorwba 29 days ago March 2, 2023 at 7:46:51 PM UTC link Permalink

Good job! Using <details><summary> for expandable dictionary definitions is a nice solution.

However, I'm noticing a distinct lack of source attribution links on the cards. Not only are you required to provide attribution to comply with the CC BY license (CC BY-SA in the case of Wiktionary), having links would also make it easier for users of the decks to contribute back by e.g. fixing mistakes they notice while studying.

You might also want to upload shared decks to ankiweb.net to help people discover your work.

{{vm.hiddenReplies[39630] ? 'expand_more' : 'expand_less'}} hide replies show replies
mrbeef12 mrbeef12 28 days ago March 2, 2023 at 9:14:46 PM UTC link Permalink

Thanks, those are good points! I'll add attribution to the descriptions. You're right about the backlinks, it depends on how difficult they are to implement. And I'll try to do Ankiweb ๐Ÿ‘.