menu
Tatoeba
language
Tomar bibe Têkeve
language Kurdî
menu
Tatoeba

chevron_right Tomar bibe

chevron_right Têkeve

Lê bigere

chevron_right Show random sentence

chevron_right Li gorî zimên bigere

chevron_right Li gorî lîsteyê

chevron_right Lî gor etîketê

chevron_right Li dengan bigere

Civak

chevron_right Dîwar

chevron_right Lîsteya hemû endaman

chevron_right Zimanên endaman

chevron_right Zimanên dayikê

search
clear
swap_horiz
search

Dîwar (7.316 mijar)

Serbend

Before asking a question, make sure to read the FAQ.

We aim to maintain a healthy atmosphere for civilized discussions. Please read our rules against bad behavior.

Peyamên dawîn subdirectory_arrow_right

araneo

21 demjimêr berê

subdirectory_arrow_right

Alex_M

doh

subdirectory_arrow_right

Alex_M

2 roj berê

subdirectory_arrow_right

kumakyoo

3 roj berê

subdirectory_arrow_right

Tom9358

3 roj berê

subdirectory_arrow_right

Alex_M

3 roj berê

subdirectory_arrow_right

kumakyoo

3 roj berê

subdirectory_arrow_right

Alex_M

3 roj berê

feedback

gillux

4 roj berê

subdirectory_arrow_right

gillux

4 roj berê

Alex_M Alex_M 9 roj berê, edited 9 roj berê 2026 reşemiyê 9 07:01:11 UTC, edited 2026 reşemiyê 9 07:02:13 UTC flag Report link Girêdana mayînde

Question about index list.
I want to see an alphabetical list of say German words. Not sentences, but just words.
When I notice a word which I do not know and which I wish to learn, I click on the word and see a German sentence with this word.
I have a book with pairs of sentences, and this book includes two indexes for both languages with corresponding page numbers. It is very convenient for finding and learning unknown words.
Is there such functionality on the website or via API?

{{vm.hiddenReplies[41654] ? 'expand_more' : 'expand_less'}} bersivan veşêre bersivan nîşan bide
AlanF_US AlanF_US 7 roj berê 2026 reşemiyê 11 13:40:21 UTC flag Report link Girêdana mayînde

No, there is no such functionality on the website or via the API.

{{vm.hiddenReplies[41661] ? 'expand_more' : 'expand_less'}} bersivan veşêre bersivan nîşan bide
Alex_M Alex_M 6 roj berê 2026 reşemiyê 12 06:43:15 UTC flag Report link Girêdana mayînde

Thank you for the information.
Meanwhile I found the alphabetical index lists online. For example, for German language it is: https://de.wiktionary.org/wiki/...l:Präfixindex/
But this index is overwhelming as it includes all the words including rare terms.
It would be interesting to have an index list of words which native speakers use in the sentences.

gillux gillux 4 roj berê 2026 reşemiyê 14 11:01:12 UTC flag Report link Girêdana mayînde

I believe your idea would be very useful, but it might be a bit out of the scope of Tatoeba. What I mean is that Tatoeba focuses mainly on building the corpora and making it available to the world, while others can build upon this resource to make something more specific, such as an alphabetical list of German words.

That being said, the search engine that powers tatoeba.org precisely has word indexes for every language. There is a simple tool to dump these indexes, so we can consider exporting them as text files. However, because the purpose of these indexes is only to allow very fast retrieval of sentences, this data might be a bit too "raw" to be directly usable by language learners. But please let me know if you or anyone else is interested by such dumps.

{{vm.hiddenReplies[41665] ? 'expand_more' : 'expand_less'}} bersivan veşêre bersivan nîşan bide
Alex_M Alex_M 3 roj berê 2026 reşemiyê 14 22:18:53 UTC flag Report link Girêdana mayînde

Thank you for your input. For me personally a text file would not be useful. What I had in mind is the list of distinct words on an HTML page, where a word could be clicked to see the sentences in which it is used.

And it would be nice to have a possibility to sort it in alphabetical order and in frequency order.

Certainly, it would be a complex task. For example, there is a German word: Zugeständnis, but plural form of this word is: Zugeständnisse. I am not sure if both variants should be in such a list or only one. And what if a word exists in the database with a spelling error? That's to say there would be for sure an issue of rawness which you mentioned.

I asked about this index list only because I try to learn new words in a sentence. And it would be easier for me to spot in such a list words which I do not know yet. Especially if it is a word with high frequency.

{{vm.hiddenReplies[41668] ? 'expand_more' : 'expand_less'}} bersivan veşêre bersivan nîşan bide
kumakyoo kumakyoo 3 roj berê 2026 reşemiyê 15 09:53:01 UTC flag Report link Girêdana mayînde

Ich hab' mir kürzlich ein kleines PHP-Script geschrieben, dass aus dem Tatoeba-Datendump eine Liste der griechischen Wörter extrahiert und zu jedem Wort notiert, wie oft dieses Wort im Korpus vorkommt. (Mir ging es dabei darum, die Sätze im Korpus nach "Einfachheit" zu sortieren, mit der Idee, dass häufige Wörter einfacher sind, als seltene Wörter. Das hat ganz gut geklappt, auch wenn es nicht perfekt ist.)

Für dein Anliegen könnte man das Programm benutzen, um eine entsprechende Wortliste für den deutschen Korpus zu erstellen (evtl. beschränkt auf die 10.000 häufigsten Wörter). Die Ausgabe könnte man so gestalten, dass dabei eine HTML-Seite entsteht, die Links auf die Tatoeba-Suche enthält, sodass du mit einem Klick Sätze mit diesem Wort erhältst. Mein Programm kann dabei unterschiedliche Varianten eines Wortes (Zugeständnis/Zugeständnisse) nicht berücksichtigen, macht also zwei Einträge daraus. Aber, soweit ich weiß, kann die Tatoeba-Suche das bei einigen Sprachen und Deutsch war da dabei.

Ich denke, es ist nicht viel Arbeit für mich, das Programm entsprechend anzupassen. Ich könnte dir also anbieten, so eine HTML-Seite zu erstellen und dir per E-Mail zu schicken (schreib mir einfach eine private Nachricht mit deiner E-Mail-Adresse, dann mache ich das). Evtl. könnte ich das Programm auch so überarbeiten, dass ich es auf GitHub allen zur Verfügung stellen kann, mal sehen...

{{vm.hiddenReplies[41669] ? 'expand_more' : 'expand_less'}} bersivan veşêre bersivan nîşan bide
Alex_M Alex_M 3 roj berê 2026 reşemiyê 15 12:33:50 UTC flag Report link Girêdana mayînde

Vielen Dank! Selbstverständlich können Sie mir den Link per privater Nachricht schicken. Oder, vielleicht, Sie könnten ihn hier posten, damit auch andere Teilnehmer die Liste ansehen können?

{{vm.hiddenReplies[41670] ? 'expand_more' : 'expand_less'}} bersivan veşêre bersivan nîşan bide
kumakyoo kumakyoo 3 roj berê, edited 3 roj berê 2026 reşemiyê 15 17:47:31 UTC, edited 2026 reşemiyê 15 18:44:57 UTC flag Report link Girêdana mayînde

Ich hab' das Programm mal bei GitHub hochgeladen: https://github.com/kumakyoo42/tatoeba_stuff

Das Programm selber heißt "count_words.php". Man benötigt dafür die Sätze-Datei von https://tatoeba.org/de/downloads (sentences.tar.bz2). Diese muss entpackt sein. Als Beispiel habe ich die Top 10.000 der deutschen Wörter ebenfalls hochgeladen (top10000_deu.html).

Ich hoffe, das ist in etwa das, was du gesucht hast.

{{vm.hiddenReplies[41672] ? 'expand_more' : 'expand_less'}} bersivan veşêre bersivan nîşan bide
Alex_M Alex_M 2 roj berê 2026 reşemiyê 16 17:23:27 UTC flag Report link Girêdana mayînde

It is exactly what I need! Plus, there is a number near each word which shows how many times it was used in the sentences.

It works for every language (the language code to be uses is ISO 639-3, i.e. 3-letter code, not 2).

I changed the script a little for myself, - I removed converting to lowercase. It's easier for me to distinguish nouns this way.

I've already found a word which I did not know, and I could click on it in the list and see some sentences with it. Incredible programming! Thank you!

{{vm.hiddenReplies[41674] ? 'expand_more' : 'expand_less'}} bersivan veşêre bersivan nîşan bide
Alex_M Alex_M doh 2026 reşemiyê 17 10:52:58 UTC flag Report link Girêdana mayînde

I created three HML pages based on the PHP script from https://github.com/kumakyoo42/tatoeba_stuff

These are frequency lists for German, English, and French languages, twenty thousand words in each:
https://labellechose.ch/frequency-lists/deu.html
https://labellechose.ch/frequency-lists/eng.html
https://labellechose.ch/frequency-lists/fra.html

Click on a word and the sentences with this word are displayed, click on the arrow and the definition from TFD dictionary is displayed.

These pages were created for personal use. They are located on the self-hosted mini-server which is not always online. The links are published for demonstration only as part of this discussion.

{{vm.hiddenReplies[41675] ? 'expand_more' : 'expand_less'}} bersivan veşêre bersivan nîşan bide
araneo araneo 21 demjimêr berê 2026 reşemiyê 17 22:01:59 UTC flag Report link Girêdana mayînde

Thank you both!

2 roj berê 2026 reşemiyê 16 11:53:23 UTC link Girêdana mayînde
warning

The content of this message goes against our rules and was therefore hidden. It is displayed only to admins and to the author of the message.

gillux gillux 4 roj berê, edited 4 roj berê 2026 reşemiyê 14 12:06:58 UTC, edited 2026 reşemiyê 14 12:09:20 UTC flag Report link Girêdana mayînde

Tatoeba was updated today. What’s new?

- The icon of the Marathi language was updated, thanks to @sabretou and me. Sabretou added the original icon a very long time ago, but now believes it is outdated, and that there is a better, more representative option for Marathi.
More background info: https://github.com/Tatoeba/tatoeba2/issues/3238
New vs. old image: https://github.com/Tatoeba/tato...d1ca024cf6dd95

- The new API at https://api.tatoeba.org/, dubbed "v1", is now considered stable, so people are welcome to build tools upon it. (An API allows external programs or apps to directly connect to Tatoeba and browse the corpus.) This "v1" release is the culmination of months of work to provide a well-documented, modern, stable and easy-to-use API. If you are using the previous "v0" API, you are encouraged to migrate: https://en.wiki.tatoeba.org/art...i-migration-v1 The "v0" API will keep working for some time to ensure backward compatibility, but people should not build new tools based on it.

- @kumakyoo made his first contribution to Tatoeba’s code: https://github.com/Tatoeba/tato...l/3244/changes Thank you very much! If you would like to get involved with the development of Tatoeba, feel free to contact us. We need help from coders, but also for interface translation, documentation as well as UI/UX designers.

{{vm.hiddenReplies[41667] ? 'expand_more' : 'expand_less'}} bersivan veşêre bersivan nîşan bide
Tom9358 Tom9358 3 roj berê 2026 reşemiyê 15 15:24:18 UTC flag Report link Girêdana mayînde

I think the API v1 being considered stable is a big step! Congrats, gefeliciteerd!! 👏

4 roj berê, edited 4 roj berê 2026 reşemiyê 14 11:33:40 UTC, edited 2026 reşemiyê 14 11:34:02 UTC link Girêdana mayînde
warning

The content of this message goes against our rules and was therefore hidden. It is displayed only to admins and to the author of the message.

CK CK 5 roj berê 2026 reşemiyê 13 06:43:46 UTC flag Report link Girêdana mayînde

🍎 These have been updated.

https://www.manythings.org/bilingual/

Bilingual Sentence Pairs

https://www.manythings.org/anki/

Tab-delimited Bilingual Sentence Pairs

These are selected sentence pairs from the Tatoeba Project.

Thanks to all of you who make this possible.

sharptoothed sharptoothed 24 roj berê 2026 rêbendanê 25 07:09:55 UTC flag Report link Girêdana mayînde

✹✹ Stats & Graphs ✹✹

Tatoeba Stats, Graphs & Charts have been updated:
https://tatoeba.j-langtools.com/allstats/

{{vm.hiddenReplies[41616] ? 'expand_more' : 'expand_less'}} bersivan veşêre bersivan nîşan bide
kumakyoo kumakyoo 24 roj berê 2026 rêbendanê 25 09:17:47 UTC flag Report link Girêdana mayînde

Cool. Couldn't these be linked somewhere on the website?

{{vm.hiddenReplies[41617] ? 'expand_more' : 'expand_less'}} bersivan veşêre bersivan nîşan bide
sharptoothed sharptoothed 23 roj berê 2026 rêbendanê 26 08:46:47 UTC flag Report link Girêdana mayînde

Well, I don't mind. :-)

{{vm.hiddenReplies[41618] ? 'expand_more' : 'expand_less'}} bersivan veşêre bersivan nîşan bide
AlanF_US AlanF_US 7 roj berê 2026 reşemiyê 11 13:50:04 UTC flag Report link Girêdana mayînde

I added it to the "Projects using Tatoeba" wiki page:

https://en.wiki.tatoeba.org/art...using-tatoeba#

cafoc64474 cafoc64474 9 roj berê 2026 reşemiyê 8 21:13:36 UTC flag Report link Girêdana mayînde

Yeah I always find it difficult to find the link.

Rovo Rovo 7 roj berê 2026 reşemiyê 10 21:06:03 UTC flag Report link Girêdana mayînde

Tatoeba me dit que la phrase
"Il ne faut pas appeler richesses les choses que l'on peut perdre.",
citation de Léonard de Vinci,
n'existe pas encore, donc je cherche à l'ajouter, ce que je ne peux pas faire car dans un second temps, Tatoeba me dit que cette phrase ne peut pas être ajoutée car le robot Voltaire l'a déjà fait...

{{vm.hiddenReplies[41657] ? 'expand_more' : 'expand_less'}} bersivan veşêre bersivan nîşan bide
Pfirsichbaeumchen Pfirsichbaeumchen 7 roj berê, edited 7 roj berê 2026 reşemiyê 11 02:07:52 UTC, edited 2026 reşemiyê 11 02:36:02 UTC flag Report link Girêdana mayînde

Ich habe den Satz freigeschaltet: #7709449. Er kann jetzt wieder gefunden werden.

{{vm.hiddenReplies[41658] ? 'expand_more' : 'expand_less'}} bersivan veşêre bersivan nîşan bide
Rovo Rovo 7 roj berê 2026 reşemiyê 11 13:18:36 UTC flag Report link Girêdana mayînde

Merci pour la réactivité. Dankon pro la rapida respondo. Vielen Dank für die schnelle Antwort.

LeviHighway LeviHighway 14 roj berê 2026 reşemiyê 4 16:34:39 UTC flag Report link Girêdana mayînde

Tatoeba doesn't have an active live-time chatroom. Discord servers are popular and convenient, would it be nice if we create a server for Tatoeba on Discord?

{{vm.hiddenReplies[41642] ? 'expand_more' : 'expand_less'}} bersivan veşêre bersivan nîşan bide
frpzzd frpzzd 13 roj berê, edited 13 roj berê 2026 reşemiyê 4 19:20:07 UTC, edited 2026 reşemiyê 4 19:20:28 UTC flag Report link Girêdana mayînde

I'd be in favor of something like this. However, I'm not sure whether Discord is popular worldwide / among other Tatoeba users. Perhaps a group over WhatsApp, Telegram or even IRC would be more likely to draw interest. I'm curious what others here think about this topic.

gillux gillux 13 roj berê 2026 reşemiyê 5 07:41:00 UTC flag Report link Girêdana mayînde

Tatoeba has an XMPP chatroom, see https://tatoeba.org/contact
It is rather quiet, but I believe most of our workflow does not require live interaction anyway.

{{vm.hiddenReplies[41646] ? 'expand_more' : 'expand_less'}} bersivan veşêre bersivan nîşan bide
PaulP PaulP 13 roj berê 2026 reşemiyê 5 07:58:09 UTC flag Report link Girêdana mayînde

Btw, the link to the Facebook group on that page is wrong. It should be https://www.facebook.com/groups/129340017083187

LeviHighway LeviHighway 8 roj berê 2026 reşemiyê 10 13:40:46 UTC flag Report link Girêdana mayînde

I think it would be good as a language learning community, aside from work flow related issues.

hecko hecko 10 roj berê 2026 reşemiyê 8 13:17:55 UTC flag Report link Girêdana mayînde

i recall being in a discord server about tatoeba, there wasn't much activity though so i left soon after

{{vm.hiddenReplies[41652] ? 'expand_more' : 'expand_less'}} bersivan veşêre bersivan nîşan bide
Thanuir Thanuir 7 roj berê, edited 7 roj berê 2026 reşemiyê 11 08:03:38 UTC, edited 2026 reşemiyê 11 08:04:03 UTC flag Report link Girêdana mayînde

Muistan myös kuulleeni tästä, mutta en liittynyt.

Discord-palvelimen voi kuka tahansa perustaa, jos kokee sen hyödylliseksi.

Toisaalta Discord on paskaantumassa, joten kenties mieluummin käyttää jotain avoimen koodin vaihtoehtoa.

11 roj berê 2026 reşemiyê 7 16:46:40 UTC link Girêdana mayînde
warning

The content of this message goes against our rules and was therefore hidden. It is displayed only to admins and to the author of the message.

11 roj berê 2026 reşemiyê 7 16:46:01 UTC link Girêdana mayînde
warning

The content of this message goes against our rules and was therefore hidden. It is displayed only to admins and to the author of the message.