menu
Tatoeba
language
Рэгістрацыя Уваход
language Беларуская
menu
Tatoeba

chevron_right Рэгістрацыя

chevron_right Уваход

Прагляд

chevron_right Show random sentence

chevron_right Прагляд па мовах

chevron_right Прагляд спісаў

chevron_right Прагляд па цэтліках

chevron_right Прагляд аўдыёзапісаў

Community

chevron_right Сцяна

chevron_right List of all members

chevron_right Languages of members

chevron_right Native speakers

search
clear
swap_horiz
search

Wall (7 162 threads)

Парады

Before asking a question, make sure to read the FAQ.

We aim to maintain a healthy atmosphere for civilized discussions. Please read our rules against bad behavior.

Апошнія паведамленні feedback

sharptoothed

15 hours ago

feedback

CK

3 days ago

subdirectory_arrow_right

Seael

3 days ago

subdirectory_arrow_right

gillux

6 days ago

subdirectory_arrow_right

sacredceltic

6 days ago

subdirectory_arrow_right

sacredceltic

6 days ago

subdirectory_arrow_right

Seael

7 days ago

subdirectory_arrow_right

Guybrush88

7 days ago

feedback

Seael

7 days ago

feedback

StanJones

10 days ago

rdgscratch rdgscratch 16 мая 2025 г. 16 мая 2025 г. у 22:43:20 UTC flag Report link Permalink

Can you do recordings of my sentences?

{{vm.hiddenReplies[41067] ? 'expand_more' : 'expand_less'}} hide replies show replies
PaulP PaulP 18 мая 2025 г. 18 мая 2025 г. у 04:07:46 UTC flag Report link Permalink

You can do it yourself. Here's a short guide:
https://www.manythings.org/tatoeba/audacity.html

@CK will help you if you need assistance. But, btw, I see that you added sentences in about 60 languages. I don't suppose that you know how to pronounce them all, right?

I can do the Dutch and Esperanto sentences for you if they don't come from copyrighted sources.

sharptoothed sharptoothed 15 мая 2025 г. 15 мая 2025 г. у 12:50:31 UTC flag Report link Permalink

✹✹ Stats & Graphs ✹✹

Tatoeba Top 30 Languages Graphs since Tatoeba "epoch"
https://tatoeba.j-langtools.com/epoch/

15 мая 2025 г. 15 мая 2025 г. у 12:45:39 UTC link Permalink
warning

Гэта паведамлення парушае нашы правілы, і таму яно прыхаванае. Яго могуць убачыць толькі адміністратары і аўтар(ка) паведамлення.

frpzzd frpzzd 12 мая 2025 г., edited 12 мая 2025 г. 12 мая 2025 г. у 22:53:24 UTC, edited 12 мая 2025 г. у 23:02:57 UTC flag Report link Permalink

Just for funsies, I ran a script to list the languages that are least well represented on Tatoeba, compared to the estimated speaker population sizes of those languages. (Specifically, the languages were restricted to those with >= 1mil speakers sorted by the quotient of the number of sentences on Tatoeba to the speaker population size.)

As you might expect, many of the worst-represented languages by this metric are various different variants of Chinese. Aside from those, the top 10 worst-represented languages are:

1. Sindhi (snd, 6 sentences vs. ~38.4mil speakers)
2. Sesotho (sot, 2 sentences vs. ~6.4mil speakers)
3. Maithili (mai, 8 sentences vs. ~19.3mil speakers)
4. Madurese (mad, 8 sentences vs. ~17.0mil speakers)
5. Libyan Arabic (ayl, 3 sentences vs. ~5.6mil speakers)
6. Western Punjabi (pnb, 72 sentences vs. ~113mil speakers)
7. Aymara (aym, 2 sentences vs. ~2.8mil speakers)
8. Pashto (pus, 47 sentences vs. ~53.0mil speakers)
9. Igbo (ibo, 35 sentences vs. ~28.0mil speakers)
10. Sundanese (sun, 40 sentences vs. ~32.0mil speakers)

If we restrict instead to languages with an estimated number of speakers >= 50mil, then here are the top 5 (excluding Chinese variants):

1. Western Punjabi (pnb, 72 sentences vs. ~113mil speakers)
2. Pashto (pus, 47 sentences vs. ~53mil speakers)
3. Punjabi (pan, 204 sentences vs. ~200mil speakers)
4. Gujarati (guj, 168 sentences vs. ~60mil speakers)
5. Telugu (tel, 271 sentences vs. ~95mil speakers)

On a more cheery note, here are the 5 *best* represented languages (that are not conlangs) with >= 1mil speakers, by the same metric:

1. Kabyle (kab, ~765k sentences vs. ~3.4mil speakers)
2. Macedonian (mkd, ~78k sentences vs. ~1.4mil speakers)
3. Lithuanian (lit, ~123k sentences vs. ~2.3mil speakers)
4. Hungarian (hun, ~420k sentences vs. ~11.8mil speakers)
5. Finnish (fin, ~151k sentences vs. ~5.2mil speakers)

And those with >= 50mil speakers:

1. Italian (ita, ~918k sentences vs. ~65mil speakers)
2. Turkish (tur, ~739k sentences vs. ~76mil speakers)
3. German (deu, ~721k sentences vs. ~92mil speakers)
4. Russian (rus, ~1.1mil sentences vs. ~170mil speakers)
5. French (fra, ~665k sentences vs. ~203mil speakers)

{{vm.hiddenReplies[41063] ? 'expand_more' : 'expand_less'}} hide replies show replies
lbdx lbdx 13 мая 2025 г., edited 14 мая 2025 г. 13 мая 2025 г. у 16:13:08 UTC, edited 14 мая 2025 г. у 10:18:31 UTC flag Report link Permalink

Thanks Franklin. It's interesting to see how Eurocentric the Tatoeba corpus still is.

Based on the 2025 edition of Ethnologue 200, I found that some of the world's 100 most widely spoken languages are still completely unavailable on Tatoeba:

- Nigerian Pidgin [pcm] → 120.7M speakkers
- Dari [prs] → 33.4M speakkers
- Magahi [mag] → 21.0M speakkers
- Chhattisgarhi [hne] → 16.3M speakkers
- Pedi [nso] → 13.7M speakkers
- Chittagonian [ctg] → 13.0M speakkers
- Dyula [dyu] → 12.8M speakkers


All 7 of these languages are spoken either in Africa or South Asia.

12 мая 2025 г. 12 мая 2025 г. у 15:38:40 UTC link Permalink
warning

Гэта паведамлення парушае нашы правілы, і таму яно прыхаванае. Яго могуць убачыць толькі адміністратары і аўтар(ка) паведамлення.

sharptoothed sharptoothed 11 мая 2025 г. 11 мая 2025 г. у 07:12:12 UTC flag Report link Permalink

✹✹ Stats & Graphs ✹✹

Tatoeba Stats, Graphs & Charts have been updated:
https://tatoeba.j-langtools.com/allstats/

8 мая 2025 г. 8 мая 2025 г. у 05:22:50 UTC link Permalink
warning

Гэта паведамлення парушае нашы правілы, і таму яно прыхаванае. Яго могуць убачыць толькі адміністратары і аўтар(ка) паведамлення.

6 мая 2025 г. 6 мая 2025 г. у 11:54:49 UTC link Permalink
warning

Гэта паведамлення парушае нашы правілы, і таму яно прыхаванае. Яго могуць убачыць толькі адміністратары і аўтар(ка) паведамлення.

atitarev atitarev 6 мая 2025 г., edited 6 мая 2025 г. 6 мая 2025 г. у 05:16:06 UTC, edited 6 мая 2025 г. у 07:33:28 UTC flag Report link Permalink

Hi,

Pls help me unlink https://tatoeba.org/en/sentences/show/13203796
This Korean sentence "이 문장을 설명해 주십시오." (i munjang-eul seolmyeonghae jusipsio.)
from https://tatoeba.org/en/sentences/show/2213956 ("Please translate this.")

It should only link to
https://tatoeba.org/en/sentences/show/60278 ("Please explain this sentence to me.")

I don't have the privilege to link/unlink sentences

{{vm.hiddenReplies[41055] ? 'expand_more' : 'expand_less'}} hide replies show replies
araneo araneo 6 мая 2025 г. 6 мая 2025 г. у 06:57:17 UTC flag Report link Permalink

I have unlinked it :]

{{vm.hiddenReplies[41056] ? 'expand_more' : 'expand_less'}} hide replies show replies
atitarev atitarev 6 мая 2025 г. 6 мая 2025 г. у 07:34:07 UTC flag Report link Permalink

Thank you, @araneo!

4 мая 2025 г. 4 мая 2025 г. у 09:45:06 UTC link Permalink
warning

Гэта паведамлення парушае нашы правілы, і таму яно прыхаванае. Яго могуць убачыць толькі адміністратары і аўтар(ка) паведамлення.