menu
Tatoeba
language
Register Log in
language English
menu
Tatoeba

chevron_right Register

chevron_right Log in

Browse

chevron_right Show random sentence

chevron_right Browse by language

chevron_right Browse by list

chevron_right Browse by tag

chevron_right Browse audio

Community

chevron_right Wall

chevron_right List of all members

chevron_right Languages of members

chevron_right Native speakers

search
clear
swap_horiz
search

Wall (7,128 threads)

Tips

Before asking a question, make sure to read the FAQ.

We aim to maintain a healthy atmosphere for civilized discussions. Please read our rules against bad behavior.

Latest messages subdirectory_arrow_right

gillux

25 minutes ago

subdirectory_arrow_right

araneo

41 minutes ago

subdirectory_arrow_right

gillux

2 hours ago

subdirectory_arrow_right

gillux

2 hours ago

subdirectory_arrow_right

PaulP

3 hours ago

subdirectory_arrow_right

frpzzd

5 hours ago

subdirectory_arrow_right

Waldelfe

5 hours ago

feedback

gillux

22 hours ago

subdirectory_arrow_right

gillux

23 hours ago

subdirectory_arrow_right

ecorralest101

yesterday

sharptoothed sharptoothed December 31, 2024 December 31, 2024 at 9:52:41 AM UTC flag Report link Permalink

✹✹ Tatoeba Year 2024 Graphs ✹✹

https://tatoeba.j-langtools.com...24/graphs.html

Previous years:
https://tatoeba.j-langtools.com...23/graphs.html
https://tatoeba.j-langtools.com...22/graphs.html
https://tatoeba.j-langtools.com...21/graphs.html
https://tatoeba.j-langtools.com...20/graphs.html
https://tatoeba.j-langtools.com...19/graphs.html
https://tatoeba.j-langtools.com...18/graphs.html
https://tatoeba.j-langtools.com...17/graphs.html
https://tatoeba.j-langtools.com...16/graphs.html

felix63 felix63 December 25, 2024 December 25, 2024 at 11:29:55 AM UTC flag Report link Permalink

🎁 🎉 Joyeuses fêtes de fin d'année à chacun d'entre vous ! 🔔 🎅

sharptoothed sharptoothed December 22, 2024 December 22, 2024 at 7:18:17 AM UTC flag Report link Permalink

✹✹ Stats & Graphs ✹✹

Tatoeba Stats, Graphs & Charts have been updated:
https://tatoeba.j-langtools.com/allstats/

Ergulis Ergulis December 21, 2024 December 21, 2024 at 9:09:52 AM UTC flag Report link Permalink

I was wondering if I could change the name of a tag. I would like to modify a tag created by me, but I don't know how to do it, if it is even possible.

{{vm.hiddenReplies[40843] ? 'expand_more' : 'expand_less'}} hide replies show replies
Yorwba Yorwba December 21, 2024 December 21, 2024 at 2:03:42 PM UTC flag Report link Permalink

You can remove the old tag from a sentence and add a new tag with a different name instead. If you want to change all occurrences of a tag, it's going to be a lot of work, of course. (Unless you know how to automate it. That's how I added "quote" tags to a bunch of sentences tagged "by <Somebody>")

And you cannot change tags that someone else added, but you can add the new tag next to the old tag.

{{vm.hiddenReplies[40844] ? 'expand_more' : 'expand_less'}} hide replies show replies
Ergulis Ergulis December 21, 2024 December 21, 2024 at 2:52:19 PM UTC flag Report link Permalink

Thanks for the reply. Personally I think that only admins can change the names of tags.

Borbie Borbie December 12, 2024, edited December 12, 2024 December 12, 2024 at 1:26:47 AM UTC, edited December 12, 2024 at 1:32:21 AM UTC flag Report link Permalink

I think the Cyrillic/Latin transliterator for Uzbek is redundant, since the language has officially transitioned into the Latin alphabet in 2023, and some languages using both alphabets already don't have that feature (for example, Serbian).

The transliteration feature was there when Uzbek was first added into Tatoeba, back in 2010.

I remember Georgian having the transliteration feature, but that was removed since it was redundant for a phonemic language.

{{vm.hiddenReplies[40838] ? 'expand_more' : 'expand_less'}} hide replies show replies
Yorwba Yorwba December 14, 2024 December 14, 2024 at 2:53:55 PM UTC flag Report link Permalink

It's not completely redundant, as there are a bunch of Uzbek sentences in the database using Cyrillic script. And the transliteration feature makes it possible to find them even when you're using Latin script to search: https://tatoeba.org/en/sentence...ry=dushmanning

{{vm.hiddenReplies[40839] ? 'expand_more' : 'expand_less'}} hide replies show replies
Borbie Borbie December 15, 2024 December 15, 2024 at 1:14:19 AM UTC flag Report link Permalink

I wasn't aware of this side of the transliteration feature until now. Thank you for letting me know about it!

coinxee coinxee July 17, 2024 July 17, 2024 at 2:52:34 AM UTC flag Report link Permalink

Is there an open-source English sentence database similar to Tatoeba?

{{vm.hiddenReplies[40692] ? 'expand_more' : 'expand_less'}} hide replies show replies
Augustus Augustus July 18, 2024 July 18, 2024 at 8:53:16 PM UTC flag Report link Permalink

Mozilla's Common Voice is similar in collecting sentences and recordings thereof. It does not have the translation aspect of Tatoeba.

See https://commonvoice.mozilla.org/

urro urro July 20, 2024, edited July 20, 2024 July 20, 2024 at 1:23:22 AM UTC, edited July 20, 2024 at 1:27:38 AM UTC flag Report link Permalink

If you just need English sentences, there are a few. However, I have looked myself, and found Tatoeba to be of the best quality, especially for English.

English-only:
• English Penn Treebank (Pennsylvania State University)
... is not something I know much about.
• English Web Treebank (Universal Dependencies)
... is mostly composed of biased sentence picks, but each has a grammatical breakdown. Stanford's NLP project Stanza uses it.
• Common Voice (Mozilla Foundation)
... as Augustus said!

With translation:
• OpenSubtitles2018 Corpus (OpenSubtitles)
... isn't very good for high-fidelity translation, but is rather natural, apart from its dramatizations.

Honorable mentions:
• Google Books Ngram Dataset (Google)
... only has a few languages. For example, their Japanese dataset is old and can only be accessed via purchase in yen.
• Wikipedia and Wiktionary (Wikimedia Foundation)

• Any other English (meta)corpora out there

https://www.google.com/search?q...s"%7C"dataset"

It really depends on your intentions and usage, as all corpora have their biases, unfortunately.

CK CK December 7, 2024 December 7, 2024 at 10:11:37 AM UTC flag Report link Permalink

🍎 Random Esperanto Sentences with Audio by PaulP

https://bit.ly/rndepoaudio

{{vm.hiddenReplies[40831] ? 'expand_more' : 'expand_less'}} hide replies show replies
PaulP PaulP December 8, 2024 December 8, 2024 at 6:43:31 AM UTC flag Report link Permalink

Interesting link, CK. Thanks!

sharptoothed sharptoothed December 8, 2024 December 8, 2024 at 6:19:54 AM UTC flag Report link Permalink

✹✹ Stats & Graphs ✹✹

Tatoeba Stats, Graphs & Charts have been updated:
https://tatoeba.j-langtools.com/allstats/

CK CK November 28, 2024 November 28, 2024 at 1:28:14 PM UTC flag Report link Permalink

🍎 Top 5 Languages by number of sentences with audio

Esperanto advanced to 3rd place today.

English (849,032)
Spanish (118,277)
Esperanto (53,135)
Kabyle (53,056)
German (32,943)

Since last December, PaulP has contributed over 48,300 audio files for Esperanto sentences.

You can listen to his most-recently added audio files at the top of this list.

https://tatoeba.org/en/sentence...how/171975/und

This link will also show all linked translations and will show the "add a translation" icon.

sananab sananab November 24, 2024 November 24, 2024 at 10:35:03 PM UTC flag Report link Permalink

Hi, your website seems to be down. The front page works, but every attempt to search sends me the message "Tatoeba is currently unavailable. We are sorry for the inconvenience. You can check our blog or Twitter for more information."

{{vm.hiddenReplies[40820] ? 'expand_more' : 'expand_less'}} hide replies show replies
AlanF_US AlanF_US November 25, 2024 November 25, 2024 at 12:51:15 AM UTC flag Report link Permalink

What are you searching for? When I try searching for "all" or "thing" in English, I get hits. However, when I leave the word field blank and search in English (which normally gives me every English sentence), I get a message that an internal error occurred.