menu
तातोएबा
language
पंजीकरण लॉग इन
language हिन्दी
menu
तातोएबा

chevron_right पंजीकरण

chevron_right लॉग इन

ब्राउज़

chevron_right यादृच्छिक वाक्य दिखाएँ

chevron_right भाषा के अनुसार ब्राउज़ करें

chevron_right सूची के अनुसार ब्राउज़ करें

chevron_right टैग के अनुसार ब्राउज़ करें

chevron_right ऑडियो ब्राउज़ करें

समुदाय

chevron_right वाल

chevron_right सभी सदस्यों की सूची

chevron_right सदस्यों की भाषाएँ

chevron_right देशी वक्ता

search
clear
swap_horiz
search

वाल (7,134 धागे)

नुस्ख़े

सवाल पूछने से पहले, फ़ॅक जरूर पढ़ें।

हमारा उद्देश्य सभ्य चर्चाओं के लिए एक स्वस्थ वातावरण बनाए रखना है। कृपया हमारे बुरे व्यवहार के विरुद्ध नियम पढ़ें।

नवीनतम संदेश feedback

mraz

3 दिनों पहले

subdirectory_arrow_right

mraz

3 दिनों पहले

feedback

Dovud123

3 दिनों पहले

feedback

sharptoothed

3 दिनों पहले

subdirectory_arrow_right

frpzzd

11 दिनों पहले

subdirectory_arrow_right

hecko

11 दिनों पहले

subdirectory_arrow_right

frpzzd

12 दिनों पहले

subdirectory_arrow_right

araneo

12 दिनों पहले

subdirectory_arrow_right

gillux

12 दिनों पहले

subdirectory_arrow_right

araneo

12 दिनों पहले

tsunhua tsunhua 30 अप्रैल 2025 30 अप्रैल 2025 को 9:49:57 am UTC flag Report link पर्मालिंक

I'm thrilled to have discovered such an amazing website. I would like to suggest adding the Teochew dialect (a branch of Southern Min/Hokkien language) to the language options list.

{{vm.hiddenReplies[41047] ? 'expand_more' : 'expand_less'}} जवाब छिपाएँ जवाब दिखाएँ
lbdx lbdx 30 अप्रैल 2025 30 अप्रैल 2025 को 11:58:52 am UTC flag Report link पर्मालिंक

Sorry, the Teochew dialect cannot be added to Tatoeba as it does not have an ISO 639-3 language identifier. But feel free to add your sentences to our Southern Min/Min Nan Chinese [nan] corpus.

https://en.wikipedia.org/wiki/Teochew_Min

gillux gillux 18 मई 2025 18 मई 2025 को 7:20:21 am UTC flag Report link पर्मालिंक

As Ibdx said, unfortunately we cannot add it as we have the strict rule of following the ISO 639-3 standard.

Note that this standard is evolving slowly as people are requesting the addition of new languages, so it could be that Teochew is added at some point, but that would be years in the future, if it ever happen.

In the past there have been several requests made to split Min Nan Chinese into different languages, mostly rejected: https://iso639-3.sil.org/code_c...t_cd_value=nan

The latest request tried to include Teochew ("Tio-Sua"), but it was rejected: https://iso639-3.sil.org/request/2021-045

rdgscratch rdgscratch 16 मई 2025 16 मई 2025 को 10:43:20 pm UTC flag Report link पर्मालिंक

Can you do recordings of my sentences?

{{vm.hiddenReplies[41067] ? 'expand_more' : 'expand_less'}} जवाब छिपाएँ जवाब दिखाएँ
PaulP PaulP 18 मई 2025 18 मई 2025 को 4:07:46 am UTC flag Report link पर्मालिंक

You can do it yourself. Here's a short guide:
https://www.manythings.org/tatoeba/audacity.html

@CK will help you if you need assistance. But, btw, I see that you added sentences in about 60 languages. I don't suppose that you know how to pronounce them all, right?

I can do the Dutch and Esperanto sentences for you if they don't come from copyrighted sources.

sharptoothed sharptoothed 15 मई 2025 15 मई 2025 को 12:50:31 pm UTC flag Report link पर्मालिंक

✹✹ Stats & Graphs ✹✹

Tatoeba Top 30 Languages Graphs since Tatoeba "epoch"
https://tatoeba.j-langtools.com/epoch/

15 मई 2025 15 मई 2025 को 12:45:39 pm UTC link पर्मालिंक
warning

इस संदेश की सामग्री हमारे नियमों के खिलाफ जाती है और इसलिए इसे छिपाया गया था। यह केवल व्यवस्थापक और संदेश के लेखक को प्रदर्शित किया जाता है।

frpzzd frpzzd 12 मई 2025, संपादित 12 मई 2025 को 10:53:24 pm UTC, संपादित 12 मई 2025 को 11:02:57 pm UTC flag Report link पर्मालिंक

Just for funsies, I ran a script to list the languages that are least well represented on Tatoeba, compared to the estimated speaker population sizes of those languages. (Specifically, the languages were restricted to those with >= 1mil speakers sorted by the quotient of the number of sentences on Tatoeba to the speaker population size.)

As you might expect, many of the worst-represented languages by this metric are various different variants of Chinese. Aside from those, the top 10 worst-represented languages are:

1. Sindhi (snd, 6 sentences vs. ~38.4mil speakers)
2. Sesotho (sot, 2 sentences vs. ~6.4mil speakers)
3. Maithili (mai, 8 sentences vs. ~19.3mil speakers)
4. Madurese (mad, 8 sentences vs. ~17.0mil speakers)
5. Libyan Arabic (ayl, 3 sentences vs. ~5.6mil speakers)
6. Western Punjabi (pnb, 72 sentences vs. ~113mil speakers)
7. Aymara (aym, 2 sentences vs. ~2.8mil speakers)
8. Pashto (pus, 47 sentences vs. ~53.0mil speakers)
9. Igbo (ibo, 35 sentences vs. ~28.0mil speakers)
10. Sundanese (sun, 40 sentences vs. ~32.0mil speakers)

If we restrict instead to languages with an estimated number of speakers >= 50mil, then here are the top 5 (excluding Chinese variants):

1. Western Punjabi (pnb, 72 sentences vs. ~113mil speakers)
2. Pashto (pus, 47 sentences vs. ~53mil speakers)
3. Punjabi (pan, 204 sentences vs. ~200mil speakers)
4. Gujarati (guj, 168 sentences vs. ~60mil speakers)
5. Telugu (tel, 271 sentences vs. ~95mil speakers)

On a more cheery note, here are the 5 *best* represented languages (that are not conlangs) with >= 1mil speakers, by the same metric:

1. Kabyle (kab, ~765k sentences vs. ~3.4mil speakers)
2. Macedonian (mkd, ~78k sentences vs. ~1.4mil speakers)
3. Lithuanian (lit, ~123k sentences vs. ~2.3mil speakers)
4. Hungarian (hun, ~420k sentences vs. ~11.8mil speakers)
5. Finnish (fin, ~151k sentences vs. ~5.2mil speakers)

And those with >= 50mil speakers:

1. Italian (ita, ~918k sentences vs. ~65mil speakers)
2. Turkish (tur, ~739k sentences vs. ~76mil speakers)
3. German (deu, ~721k sentences vs. ~92mil speakers)
4. Russian (rus, ~1.1mil sentences vs. ~170mil speakers)
5. French (fra, ~665k sentences vs. ~203mil speakers)

{{vm.hiddenReplies[41063] ? 'expand_more' : 'expand_less'}} जवाब छिपाएँ जवाब दिखाएँ
lbdx lbdx 13 मई 2025, संपादित 13 मई 2025 को 4:13:08 pm UTC, संपादित 14 मई 2025 को 10:18:31 am UTC flag Report link पर्मालिंक

Thanks Franklin. It's interesting to see how Eurocentric the Tatoeba corpus still is.

Based on the 2025 edition of Ethnologue 200, I found that some of the world's 100 most widely spoken languages are still completely unavailable on Tatoeba:

- Nigerian Pidgin [pcm] → 120.7M speakkers
- Dari [prs] → 33.4M speakkers
- Magahi [mag] → 21.0M speakkers
- Chhattisgarhi [hne] → 16.3M speakkers
- Pedi [nso] → 13.7M speakkers
- Chittagonian [ctg] → 13.0M speakkers
- Dyula [dyu] → 12.8M speakkers


All 7 of these languages are spoken either in Africa or South Asia.

12 मई 2025 12 मई 2025 को 3:38:40 pm UTC link पर्मालिंक
warning

इस संदेश की सामग्री हमारे नियमों के खिलाफ जाती है और इसलिए इसे छिपाया गया था। यह केवल व्यवस्थापक और संदेश के लेखक को प्रदर्शित किया जाता है।

sharptoothed sharptoothed 11 मई 2025 11 मई 2025 को 7:12:12 am UTC flag Report link पर्मालिंक

✹✹ Stats & Graphs ✹✹

Tatoeba Stats, Graphs & Charts have been updated:
https://tatoeba.j-langtools.com/allstats/

8 मई 2025 8 मई 2025 को 5:22:50 am UTC link पर्मालिंक
warning

इस संदेश की सामग्री हमारे नियमों के खिलाफ जाती है और इसलिए इसे छिपाया गया था। यह केवल व्यवस्थापक और संदेश के लेखक को प्रदर्शित किया जाता है।

6 मई 2025 6 मई 2025 को 11:54:49 am UTC link पर्मालिंक
warning

इस संदेश की सामग्री हमारे नियमों के खिलाफ जाती है और इसलिए इसे छिपाया गया था। यह केवल व्यवस्थापक और संदेश के लेखक को प्रदर्शित किया जाता है।

atitarev atitarev 6 मई 2025, संपादित 6 मई 2025 को 5:16:06 am UTC, संपादित 6 मई 2025 को 7:33:28 am UTC flag Report link पर्मालिंक

Hi,

Pls help me unlink https://tatoeba.org/en/sentences/show/13203796
This Korean sentence "이 문장을 설명해 주십시오." (i munjang-eul seolmyeonghae jusipsio.)
from https://tatoeba.org/en/sentences/show/2213956 ("Please translate this.")

It should only link to
https://tatoeba.org/en/sentences/show/60278 ("Please explain this sentence to me.")

I don't have the privilege to link/unlink sentences

{{vm.hiddenReplies[41055] ? 'expand_more' : 'expand_less'}} जवाब छिपाएँ जवाब दिखाएँ
araneo araneo 6 मई 2025 6 मई 2025 को 6:57:17 am UTC flag Report link पर्मालिंक

I have unlinked it :]

{{vm.hiddenReplies[41056] ? 'expand_more' : 'expand_less'}} जवाब छिपाएँ जवाब दिखाएँ
atitarev atitarev 6 मई 2025 6 मई 2025 को 7:34:07 am UTC flag Report link पर्मालिंक

Thank you, @araneo!