menu
Tatoeba
language
Kaydol Giriş yap
language Türkçe
menu
Tatoeba

chevron_right Kaydol

chevron_right Giriş yap

Göz At

chevron_right Rastgele cümle göster

chevron_right Dile göre ara

chevron_right Listeye göre ara

chevron_right Etikete göre ara

chevron_right Ses ara

Topluluk

chevron_right Duvar

chevron_right Tüm üyelerin listesi

chevron_right Üyelerin dilleri

chevron_right Ana diller

search
clear
swap_horiz
search

Duvar (7.280 konu)

Öneriler

Soru sormadan önce SSS'yi okuduğunuzdan emin olun.

Seviyeli tartışmalar için sağlıklı bir atmosfer yaratmayı amaçlıyoruz. Lütfen kötü davranışlara karşı kurallarımızı okuyun.

Son mesajlar subdirectory_arrow_right

araneo

18 saat önce

feedback

LeviHighway

19 saat önce

subdirectory_arrow_right

LeviHighway

2 gün önce

feedback

CK

3 gün önce

subdirectory_arrow_right

sharptoothed

9 gün önce

subdirectory_arrow_right

small_snow

10 gün önce

subdirectory_arrow_right

frpzzd

11 gün önce

subdirectory_arrow_right

LeviHighway

12 gün önce

subdirectory_arrow_right

frpzzd

12 gün önce

feedback

sharptoothed

12 gün önce

LeviHighway LeviHighway 19 saat önce, 19 saat önce düzenlendi 25 Aralık 2025 16:54:26 UTC, 25 Aralık 2025 16:55:16 UTC düzenlendi flag Report link Kalıcı bağlantı

Hi everyone,
​I want to add a new tag to Tatoeba for "吉祥話" (Auspicious words/sayings).
​Auspicious words are a unique cultural tradition in Chinese communities. These are specific phrases or idioms used on festivals and ceremonies to wish people obtain luck and fortune.
​According to the current tagging policy, tag names should be in English whenever possible.
​However, this category is specific to Chinese culture, so its sentences should be mostly Chinese. And I'm not sure if translations of these sentences should also be tagged the same.
​So I would like to ask, whether the tag name should be in English ("auspicious words" or "auspicioys sayings) or Chinese (吉祥話).
​And if we use English, what name should we choose? "Auspicious words," "Auspicious sayings," or just "Auspicious"? If we use Chinese, should we use Simplified or Traditional?

{{vm.hiddenReplies[41564] ? 'expand_more' : 'expand_less'}} cevapları gizle cevapları göster
araneo araneo 18 saat önce 25 Aralık 2025 18:27:17 UTC flag Report link Kalıcı bağlantı

As it's so tied to Chinese culture, it makes sense to have the tag be in Chinese, especially as there's a dedicated way to refer to them in comparison to English, as when I googled it, the translations "lucky sayings," or "words of good fortune" were also offered. The request to have tags in English is so that they're not repeated across multiple languages, but it seems unlikely that this tag would end up being repeated, and having it in English may actually make it more likely to be repeated (unless other languages have similar concepts?).

Could you possibly put it in both traditional and simplified with a / between? That way if someone searches in either it will come up (unless it already works like this, I haven't tried searching in Chinese before).

CK CK 3 gün önce, 3 gün önce düzenlendi 23 Aralık 2025 03:07:59 UTC, 23 Aralık 2025 03:21:55 UTC düzenlendi flag Report link Kalıcı bağlantı

🎅 🤶 Translate "Christmas" Sentences

If you are looking for Christmas-related sentences to translate, here are some ideas.


🎄Search for sentence using the "or" operator the |

Christmas|Santa|Claus

Try adding other words like this, which may also show some non-Christmas sentences:

Christmas|Santa|Claus|Carols|Gift|Present


🎄Using the "advanced search", you can fine-tune the searches to only show you sentences not yet translated into your own native language.


🎄Here are just a few sample preset English searches you can try

🎁 Random English Sentences with Audio
https://tatoeba.org/en/sentence...rd_count_min=2
Christmas|Santa|Claus (1,000 results out of 1,152 occurrences)

🎁 Random English Sentences with audio with no translations yet
https://tatoeba.org/en/sentence...rd_count_min=2
Christmas|Santa|Claus (80 results)

🎁 Random English Sentences with no audio with no translations yet
https://tatoeba.org/en/sentence...rd_count_min=2
Christmas|Santa|Claus (275 results)


🎄You can try similar searches for other languages.


🎄We also have a "Christmas" tag:

https://tatoeba.org/en/tags/sho...s_with_tag/493

The "advanced search" also allows filtering by tag.

{{vm.hiddenReplies[41562] ? 'expand_more' : 'expand_less'}} cevapları gizle cevapları göster
LeviHighway LeviHighway 2 gün önce 24 Aralık 2025 00:02:39 UTC flag Report link Kalıcı bağlantı

Merry Christmas, CK. The holiday special quest is sweet!

4 gün önce 22 Aralık 2025 09:46:24 UTC link Kalıcı bağlantı
warning

Bu mesajın içeriği kurallarımızla ters düşmektedir ve bu nedenle gizlenmiştir. Sadece yöneticiler ve mesajın sahibi görebilir.

7 gün önce 18 Aralık 2025 17:14:07 UTC link Kalıcı bağlantı
warning

Bu mesajın içeriği kurallarımızla ters düşmektedir ve bu nedenle gizlenmiştir. Sadece yöneticiler ve mesajın sahibi görebilir.

9 gün önce 16 Aralık 2025 17:43:21 UTC link Kalıcı bağlantı
warning

Bu mesajın içeriği kurallarımızla ters düşmektedir ve bu nedenle gizlenmiştir. Sadece yöneticiler ve mesajın sahibi görebilir.

sharptoothed sharptoothed 12 gün önce 14 Aralık 2025 06:07:02 UTC flag Report link Kalıcı bağlantı

✹✹ Stats & Graphs ✹✹

Tatoeba Stats, Graphs & Charts have been updated:
https://tatoeba.j-langtools.com/allstats/

{{vm.hiddenReplies[41551] ? 'expand_more' : 'expand_less'}} cevapları gizle cevapları göster
small_snow small_snow 10 gün önce 16 Aralık 2025 11:06:53 UTC flag Report link Kalıcı bağlantı

お元気ですか?
いつもありがとうございます!

{{vm.hiddenReplies[41555] ? 'expand_more' : 'expand_less'}} cevapları gizle cevapları göster
sharptoothed sharptoothed 9 gün önce 16 Aralık 2025 17:17:31 UTC flag Report link Kalıcı bağlantı

元気です。
どういたしまして!:-)

LeviHighway LeviHighway 12 gün önce 13 Aralık 2025 13:31:41 UTC flag Report link Kalıcı bağlantı

I'm posting here to ask if anyone know collaborative language learning platforms that I can contribute on like Tatoeba. I like it when my content can be directly useful for others.

{{vm.hiddenReplies[41546] ? 'expand_more' : 'expand_less'}} cevapları gizle cevapları göster
frpzzd frpzzd 12 gün önce 13 Aralık 2025 18:14:31 UTC flag Report link Kalıcı bağlantı

What kind of features would you have in mind for such a platform?

I've daydreamed for a while about a collaborative and open-source framework for writing open-source language/grammar textbooks containing structured data for things like grammar topics, vocab lists, and exercises. There are several language textbooks that I really admire, but they frustrate me for 2 reasons. (1) Their data (e.g. list of grammar topics, vocabulary per chapter, etc) is not delivered in a structured format and can only be extracted by OCR/scraping (which often has errors) or manual data entry, so it's hard to use it in tandem with other resources such as Tatoeba. (2) Textbook licenses make it hard to use textbook content for legal reasons as well.

Another part of this daydream is the idea of a language textbook with non-linear progression between chapters. Each chapter or exercise would have a list of "dependencies", i.e. other chapters whose content it depends on, but a reader would be free to traverse the chapters in any order so long as they complete each chapter's dependencies before reading it. All chapters together would form a DAG (directed acyclic graph) structure, not just a tree.

There are a few permissively licensed language textbooks out there, but they are usually one-off projects by some institution or professor that don't allow ongoing contributions, and they still lock up their data in an inconsistent/un-parseable format. I also find that a lot of more recent language textbooks de-emphasize grammar and instead focus on trying to simulate immersion through activities and media, which I personally find unhelpful.

So for now this is just a bit of a pipe dream. However, if I had some help and knew that a considerable number of people would be interested in contributing to open source language web-textbooks, I might try programming a framework for such textbooks myself.

{{vm.hiddenReplies[41547] ? 'expand_more' : 'expand_less'}} cevapları gizle cevapları göster
LeviHighway LeviHighway 12 gün önce, 12 gün önce düzenlendi 14 Aralık 2025 05:59:00 UTC, 14 Aralık 2025 05:59:17 UTC düzenlendi flag Report link Kalıcı bağlantı

About dependencies, you mean something like: before you start this lesson, you have to look through these words first?

{{vm.hiddenReplies[41550] ? 'expand_more' : 'expand_less'}} cevapları gizle cevapları göster
frpzzd frpzzd 12 gün önce 14 Aralık 2025 07:11:57 UTC flag Report link Kalıcı bağlantı

Regarding dependencies, I was thinking that not only vocabulary but also grammar concepts could be involved in dependencies. For example, imagine the following grammar concepts in Spanish which might correspond to chapters / sections in a textbook:

1) The alphabet and pronunciation
2) Nouns and grammatical gender
3) Plural nouns and grammatical number
4) Spanish subject pronouns
5) Adjective-noun agreement
6) The verb "ser"

Of course the alphabet is needed for all of these. The topics (2), (3) and (4) could be studied more or less independently of each other. However, (5) depends on both (2) and (3), since Spanish adjectives agree based on gender and number. Further, (6) might depend on (3) and (4) since conjugating verbs in Spanish requires some understanding of grammatical number and subject pronouns. So a person could progress through these topics in, say, any of the following orders:

1 --> 2 --> 3 --> 4 --> 5 --> 6
1 --> 2 --> 3 --> 4 --> 6 --> 5
1 --> 2 --> 3 --> 5 --> 4 --> 6
1 --> 4 --> 2 --> 3 --> 5 --> 6

This kind of thing would be especially handy for someone with some limited background knowledge of the language. In this hypothetical web textbook, the user could check off the grammar topics they had already studied, and the app would indicate which chapters they could study next with their background knowledge.

{{vm.hiddenReplies[41552] ? 'expand_more' : 'expand_less'}} cevapları gizle cevapları göster
LeviHighway LeviHighway 12 gün önce, 12 gün önce düzenlendi 14 Aralık 2025 08:16:02 UTC, 14 Aralık 2025 08:16:37 UTC düzenlendi flag Report link Kalıcı bağlantı

Do you have any existing processes for this? I mean mainly for learning content or just structure.

{{vm.hiddenReplies[41553] ? 'expand_more' : 'expand_less'}} cevapları gizle cevapları göster
frpzzd frpzzd 11 gün önce 15 Aralık 2025 02:44:15 UTC flag Report link Kalıcı bağlantı

I don't know what you mean by that, can you be a little more specific?

Anyways, I don't meant to fill up the wall with all of my long messages. Feel free to send me a DM if you want more details.

lingomaxim lingomaxim 12 gün önce 13 Aralık 2025 18:55:33 UTC flag Report link Kalıcı bağlantı

I forgot about it until reading this but there's a site called LangCorrect, which is not quite like what you described, but still may be interesting.

If I remember correctly, it's a writing prompt type thing in a bunch of languages. You can get corrections from native/fluent speakers as well as help people learning your own native language.

alt alt 14 gün önce 12 Aralık 2025 06:56:58 UTC flag Report link Kalıcı bağlantı

I was wondering if any one had a way to search for Japanese verbs that include all possible conjugations? I'm trying to automatically pull some sentences for my Anki cards but just searching for the dictionary form of a verb doesn't work well for proper sentences. Is there maybe some MantiCore syntax I could use here? Thanks!

{{vm.hiddenReplies[41538] ? 'expand_more' : 'expand_less'}} cevapları gizle cevapları göster
LeviHighway LeviHighway 14 gün önce 12 Aralık 2025 07:14:22 UTC flag Report link Kalıcı bağlantı

You can just strip the verb ending. 読 would match 読む, 読みます, 読んだ, 読んで, 読まない, 読める, 読もう, etc.

gillux gillux 14 gün önce 12 Aralık 2025 09:32:37 UTC flag Report link Kalıcı bağlantı

Hello,

Searching in Japanese on Tatoeba is currently limited in that it only works a character level, not at a "word" level. Put differently, every character is considered as a word of its own. We would like to improve that situation, but we don’t have the resources to do so at the moment. Note that Tatoeba is an open project and contributions are welcome.

If you are looking for a way to automate the process of retrieving example sentences containing any form of a given verb, I recommend that you download the Japanese Tatoeba corpus as a file and to process it yourself with a natural language processing toolkit. Weekly exports are available at https://tatoeba.org/downloads.

If you are looking for a way to manually search for sentences containing any form of a few verbs, let me explain some ways this can be achieved currently. I'll take the verb 振る(ふる) as example.

1. First, you need to use quotes around Japanese keywords:

振ります
→ will match sentences containing the four characters 振, り, ま and す anywhere in a sentence, in any order (but the provided order is prioritized).

"振ります"
→ will match those four characters contiguously.

2. You can use @text and @transcription to match furigana, too:

@transcription "ふります" @text "振ります"
→ will match sentences having "ふります" in furigana and "振ります" in sentence text.

3. There is a caveat: the furigana will not be specifically matched against the corresponding kanjis, for example:

@transcription "ふ" @text "振"
→ could potentially match the sentence 降りるふりをしました。 because of the presence of unrelated ふ elsewhere in the sentence.
→ however the probability of such mismatch gets very rare if you use two or more characters in @transcription

4. Regarding your question about how to match all forms of a verb. Let’s first consider verbs which reading is unambiguous, such as 思う. If you search for the root "思", you will get unwanted matches such as 不思議. The easiest way to exclude these is to provide a suffix to 思. Because 思う is a godan verb, there are only 6 possible suffixes:

思う(おもう)
→ 思わ, 思い, 思う, 思え, 思お, 思っ

We can give all the forms separated by the OR operator "|":

"思わ"|"思い"|"思う"|"思え"|"思お"|"思っ"
→ this search shows all sentences containing the verb 思う conjugated

5. If the verb is ichidan, and the root has more than one character, you can just search for the root:

"食べ"
→ this search shows all sentences containing the verb 食べる conjugated

6. If the verb is ichidan and the root has only one character, because of the caveat explained at 3, you need to provide a second character. It gets tricky because there are many possible second character:

見る(みる)
"見る"|"見ま"|"見さ"|"見ら"|"見た"|"見な"|"見ろ"|"見よ"|"見て"|"見え"
→ brings a few false positive such as この指輪ね、祖母の形見なの。

7. If the verb reading is ambiguous, you can combine example 2 and 4:

振る(ふる)
→ 降ら, 降り, 降る, 降れ, 降ろ, 降っ

So the final search is:
(@text "降ら" @transcription "ふら") | (@text "降り" @transcription "ふり") | (@text "降る" @transcription "ふる") | (@text "降れ" @transcription "ふれ") | (@text "降ろ" @transcription "ふろ") | (@text "降っ" @transcription "ふっ")

(We need to use parenthesis because the implicit AND operator has a
higher priority than OR "|")

CK CK 14 gün önce, 14 gün önce düzenlendi 12 Aralık 2025 11:22:06 UTC, 12 Aralık 2025 11:24:25 UTC düzenlendi flag Report link Kalıcı bağlantı

Something else that might save you time is to use Jim Breen's wwwjdic.

Look up a verb, click the "links" link, find the "verb conjugation" link, then copy all the forms, and find what you need to search for as gillux explains.

This should save you a lot of typing, Here's an example of one verb conjugation page.
https://www.edrdg.org/cgi-bin/w...A4%A8%A4%EB_v1

Here also has an "example search".

https://www.edrdg.org/cgi-bin/w...A4%A8%A4%EB_1_

AlanF_US AlanF_US 13 gün önce 12 Aralık 2025 13:46:18 UTC flag Report link Kalıcı bağlantı

These responses by @gillux and @CK contain useful information that should be made available on the Tatoeba wiki as well.

frpzzd frpzzd 13 gün önce 12 Aralık 2025 15:40:55 UTC flag Report link Kalıcı bağlantı

Here are a few more links that you may find useful.

The Python NLP library called spaCy has pipelines available for Japanese:

https://spacy.io/models/ja

Using this library, you can download spaCy models and use them to analyze a sentence. Part of this pipeline includes a component that attempts to split the sentence into individual words, and reduce each word to its "base form". I personally use this library to generate fill-in-the-blank vocabulary exercises for German and Russian (a la Clozemaster) by searching for Tatoeba sentences containing words whose base forms are a target word that I want to study.

You may also find this useful:

https://github.com/cl-tohoku/J-UniMorph

This dataset purports to list all (most?) inflected forms for many Japanese words. Not only that, but the data file also lists how each form is inflected.

alt alt 13 gün önce 12 Aralık 2025 16:06:38 UTC flag Report link Kalıcı bağlantı

Thanks @gillux and @CK, I had not known about the character level only search. I was initially just sniping off the last character of the verb but as you said, I'd be running into issues with words containing just the root. I think obtaining all the conjugations separately and searching for that is probably the way to go for now.

14 gün önce 12 Aralık 2025 07:35:57 UTC link Kalıcı bağlantı
warning

Bu mesajın içeriği kurallarımızla ters düşmektedir ve bu nedenle gizlenmiştir. Sadece yöneticiler ve mesajın sahibi görebilir.

14 gün önce 11 Aralık 2025 18:09:32 UTC link Kalıcı bağlantı
warning

Bu mesajın içeriği kurallarımızla ters düşmektedir ve bu nedenle gizlenmiştir. Sadece yöneticiler ve mesajın sahibi görebilir.