menu
Tatoeba
language
Inscriber te Aperir session
language Interlingua
menu
Tatoeba

chevron_right Inscriber te

chevron_right Aperir session

Percurrer

chevron_right Monstrar phrase aleatori

chevron_right Percurrer per lingua

chevron_right Percurrer per lista

chevron_right Percurrer per etiquetta

chevron_right Percurrer audio

Communitate

chevron_right Muro

chevron_right Lista de tote le membros

chevron_right Linguas del membros

chevron_right Parlantes native

search
clear
swap_horiz
search

Wall (1 discussion)

Consilios

Ante de poner un question, assecura te de haber legite le FAQ.

Nostre intention es mantener un atmosphere salubre pro discussiones civilisate. Per favor lege nostre regulas contra mal conducta.

Ultime messages feedback

sharptoothed

un hora retro

subdirectory_arrow_right

Seael

heri

subdirectory_arrow_right

Shishir

heri

subdirectory_arrow_right

Vortarulo

heri

subdirectory_arrow_right

gillux

heri

subdirectory_arrow_right

gillux

heri

subdirectory_arrow_right

brauchinet

heri

feedback

gillux

heri

subdirectory_arrow_right

TATAR1

heri

feedback

Tartar

heri

rdgscratch rdgscratch 16 de maio 2025 16 de maio 2025 a 22:43:20 UTC flag Report link Permaligamine

Can you do recordings of my sentences?

{{vm.hiddenReplies[41067] ? 'expand_more' : 'expand_less'}} celar responsas monstrar responsas
PaulP PaulP 18 de maio 2025 18 de maio 2025 a 04:07:46 UTC flag Report link Permaligamine

You can do it yourself. Here's a short guide:
https://www.manythings.org/tatoeba/audacity.html

@CK will help you if you need assistance. But, btw, I see that you added sentences in about 60 languages. I don't suppose that you know how to pronounce them all, right?

I can do the Dutch and Esperanto sentences for you if they don't come from copyrighted sources.

sharptoothed sharptoothed 15 de maio 2025 15 de maio 2025 a 12:50:31 UTC flag Report link Permaligamine

✹✹ Stats & Graphs ✹✹

Tatoeba Top 30 Languages Graphs since Tatoeba "epoch"
https://tatoeba.j-langtools.com/epoch/

15 de maio 2025 15 de maio 2025 a 12:45:39 UTC link Permaligamine
warning

Le contento de iste message infringe nostre regulas e ha dunque essite celate. Illo es monstrate solmente al administratores e al autor del message.

frpzzd frpzzd 12 de maio 2025, modificate le 12 de maio 2025 12 de maio 2025 a 22:53:24 UTC, modificate le 12 de maio 2025 a 23:02:57 UTC flag Report link Permaligamine

Just for funsies, I ran a script to list the languages that are least well represented on Tatoeba, compared to the estimated speaker population sizes of those languages. (Specifically, the languages were restricted to those with >= 1mil speakers sorted by the quotient of the number of sentences on Tatoeba to the speaker population size.)

As you might expect, many of the worst-represented languages by this metric are various different variants of Chinese. Aside from those, the top 10 worst-represented languages are:

1. Sindhi (snd, 6 sentences vs. ~38.4mil speakers)
2. Sesotho (sot, 2 sentences vs. ~6.4mil speakers)
3. Maithili (mai, 8 sentences vs. ~19.3mil speakers)
4. Madurese (mad, 8 sentences vs. ~17.0mil speakers)
5. Libyan Arabic (ayl, 3 sentences vs. ~5.6mil speakers)
6. Western Punjabi (pnb, 72 sentences vs. ~113mil speakers)
7. Aymara (aym, 2 sentences vs. ~2.8mil speakers)
8. Pashto (pus, 47 sentences vs. ~53.0mil speakers)
9. Igbo (ibo, 35 sentences vs. ~28.0mil speakers)
10. Sundanese (sun, 40 sentences vs. ~32.0mil speakers)

If we restrict instead to languages with an estimated number of speakers >= 50mil, then here are the top 5 (excluding Chinese variants):

1. Western Punjabi (pnb, 72 sentences vs. ~113mil speakers)
2. Pashto (pus, 47 sentences vs. ~53mil speakers)
3. Punjabi (pan, 204 sentences vs. ~200mil speakers)
4. Gujarati (guj, 168 sentences vs. ~60mil speakers)
5. Telugu (tel, 271 sentences vs. ~95mil speakers)

On a more cheery note, here are the 5 *best* represented languages (that are not conlangs) with >= 1mil speakers, by the same metric:

1. Kabyle (kab, ~765k sentences vs. ~3.4mil speakers)
2. Macedonian (mkd, ~78k sentences vs. ~1.4mil speakers)
3. Lithuanian (lit, ~123k sentences vs. ~2.3mil speakers)
4. Hungarian (hun, ~420k sentences vs. ~11.8mil speakers)
5. Finnish (fin, ~151k sentences vs. ~5.2mil speakers)

And those with >= 50mil speakers:

1. Italian (ita, ~918k sentences vs. ~65mil speakers)
2. Turkish (tur, ~739k sentences vs. ~76mil speakers)
3. German (deu, ~721k sentences vs. ~92mil speakers)
4. Russian (rus, ~1.1mil sentences vs. ~170mil speakers)
5. French (fra, ~665k sentences vs. ~203mil speakers)

{{vm.hiddenReplies[41063] ? 'expand_more' : 'expand_less'}} celar responsas monstrar responsas
lbdx lbdx 13 de maio 2025, modificate le 14 de maio 2025 13 de maio 2025 a 16:13:08 UTC, modificate le 14 de maio 2025 a 10:18:31 UTC flag Report link Permaligamine

Thanks Franklin. It's interesting to see how Eurocentric the Tatoeba corpus still is.

Based on the 2025 edition of Ethnologue 200, I found that some of the world's 100 most widely spoken languages are still completely unavailable on Tatoeba:

- Nigerian Pidgin [pcm] → 120.7M speakkers
- Dari [prs] → 33.4M speakkers
- Magahi [mag] → 21.0M speakkers
- Chhattisgarhi [hne] → 16.3M speakkers
- Pedi [nso] → 13.7M speakkers
- Chittagonian [ctg] → 13.0M speakkers
- Dyula [dyu] → 12.8M speakkers


All 7 of these languages are spoken either in Africa or South Asia.

12 de maio 2025 12 de maio 2025 a 15:38:40 UTC link Permaligamine
warning

Le contento de iste message infringe nostre regulas e ha dunque essite celate. Illo es monstrate solmente al administratores e al autor del message.

sharptoothed sharptoothed 11 de maio 2025 11 de maio 2025 a 07:12:12 UTC flag Report link Permaligamine

✹✹ Stats & Graphs ✹✹

Tatoeba Stats, Graphs & Charts have been updated:
https://tatoeba.j-langtools.com/allstats/

8 de maio 2025 8 de maio 2025 a 05:22:50 UTC link Permaligamine
warning

Le contento de iste message infringe nostre regulas e ha dunque essite celate. Illo es monstrate solmente al administratores e al autor del message.

6 de maio 2025 6 de maio 2025 a 11:54:49 UTC link Permaligamine
warning

Le contento de iste message infringe nostre regulas e ha dunque essite celate. Illo es monstrate solmente al administratores e al autor del message.

atitarev atitarev 6 de maio 2025, modificate le 6 de maio 2025 6 de maio 2025 a 05:16:06 UTC, modificate le 6 de maio 2025 a 07:33:28 UTC flag Report link Permaligamine

Hi,

Pls help me unlink https://tatoeba.org/en/sentences/show/13203796
This Korean sentence "이 문장을 설명해 주십시오." (i munjang-eul seolmyeonghae jusipsio.)
from https://tatoeba.org/en/sentences/show/2213956 ("Please translate this.")

It should only link to
https://tatoeba.org/en/sentences/show/60278 ("Please explain this sentence to me.")

I don't have the privilege to link/unlink sentences

{{vm.hiddenReplies[41055] ? 'expand_more' : 'expand_less'}} celar responsas monstrar responsas
araneo araneo 6 de maio 2025 6 de maio 2025 a 06:57:17 UTC flag Report link Permaligamine

I have unlinked it :]

{{vm.hiddenReplies[41056] ? 'expand_more' : 'expand_less'}} celar responsas monstrar responsas
atitarev atitarev 6 de maio 2025 6 de maio 2025 a 07:34:07 UTC flag Report link Permaligamine

Thank you, @araneo!

4 de maio 2025 4 de maio 2025 a 09:45:06 UTC link Permaligamine
warning

Le contento de iste message infringe nostre regulas e ha dunque essite celate. Illo es monstrate solmente al administratores e al autor del message.