menu
Tatoeba
language
Register Log in
language English
menu
Tatoeba

chevron_right Register

chevron_right Log in

Browse

chevron_right Show random sentence

chevron_right Browse by language

chevron_right Browse by list

chevron_right Browse by tag

chevron_right Browse audio

Community

chevron_right Wall

chevron_right List of all members

chevron_right Languages of members

chevron_right Native speakers

search
clear
swap_horiz
search

Wall (6,084 threads)

Tips

Before asking a question, make sure to read the FAQ.

We aim to maintain a healthy atmosphere for civilized discussions. Please read our rules against bad behavior.

Latest messages subdirectory_arrow_right

Pfirsichbaeumchen

10 hours ago

subdirectory_arrow_right

DJ_Saidez

11 hours ago

subdirectory_arrow_right

Pfirsichbaeumchen

12 hours ago

subdirectory_arrow_right

Sobsz

15 hours ago

subdirectory_arrow_right

AlanF_US

15 hours ago

feedback

Sobsz

21 hours ago

subdirectory_arrow_right

deniko

21 hours ago

feedback

CK

2 days ago

subdirectory_arrow_right

Guybrush88

2 days ago

subdirectory_arrow_right

Igider

2 days ago

CK CK November 30, 2020 November 30, 2020 at 4:34:23 AM UTC link Permalink

** Milestone Passed **

The Tatoeba Project now has over 9 million sentences.

{{vm.hiddenReplies[36237] ? 'expand_more' : 'expand_less'}} hide replies show replies
Igider Igider November 30, 2020 November 30, 2020 at 12:13:04 PM UTC link Permalink

Congrats!
Keep going@

Ergulis Ergulis November 30, 2020 November 30, 2020 at 4:05:01 PM UTC link Permalink

That's great.

imalaqvayli imalaqvayli November 30, 2020 November 30, 2020 at 5:21:11 PM UTC link Permalink

Nice !

DJ_Saidez DJ_Saidez November 30, 2020 November 30, 2020 at 5:33:56 PM UTC link Permalink

すごい!
Buen trabajo a todos!

Pfirsichbaeumchen Pfirsichbaeumchen November 22, 2020 November 22, 2020 at 8:40:01 AM UTC link Permalink

Ergulis (Josef): https://tatoeba.org/deu/user/profile/Ergulis

[DEU] Korpuspflegerkandidat für Tschechisch

Josef kandidiert als Korpuspfleger für Tschechisch, um bei der Korrektur fehlerhafter Sätze nicht mehr aktiver Mitglieder zu helfen. Wie immer ist jeder eingeladen, sich hierzu in einer Privatnachricht zu äußern (einfach auf die Verknüpfung am Ende dieser Nachricht klicken).

[ENG] Corpus Maintainer Candidate for Czech

Josef has offered himself as a corpus maintainer for Czech to help apply necessary corrections to sentences owned by inactive members. As usual, you are all invited to give us your feedback in a private message (click on the link at the end of this message).

[EPO] Kandidato por iĝi bontenanto de la ĉeĥa frazaro

Josef kandidatas por iĝi bontenanto de la ĉeĥa frazaro kaj do helpi la korektadon de eraraj frazoj de anoj ne plu aktivaj. Kiel ĉiam ni invitas ĉiun komenti pri tio en privata mesaĝo (simple alklaku la ligilon je la fino de ĉi tiu mesaĝo).

[JPN] チェコ語コーパス整備員候補者
Josefさんは、非アクティブなメンバーが所持する例文を必要に応じて改善できるチェコ語コーパス整備員になりたいと志願しています。従来通り、プライベートメッセージでフィードバックをお寄せください。(このメッセージの文末にあるリンクをクリックしてください。)

https://tatoeba.org/private_mes...rsichbaeumchen

{{vm.hiddenReplies[36188] ? 'expand_more' : 'expand_less'}} hide replies show replies
Pfirsichbaeumchen Pfirsichbaeumchen November 29, 2020 November 29, 2020 at 1:11:54 AM UTC link Permalink

Josef ...

... ist jetzt Korpuspfleger für das Tschechische.
... is corpus maintainer for Czech now.
... nun estas frazara bontenanto de la ĉeĥa frazaro.

{{vm.hiddenReplies[36224] ? 'expand_more' : 'expand_less'}} hide replies show replies
DJ_Saidez DJ_Saidez November 29, 2020 November 29, 2020 at 6:48:01 AM UTC link Permalink

Congrats 🥳

Ergulis Ergulis November 29, 2020 November 29, 2020 at 8:39:29 AM UTC link Permalink

Hello friends,

Thanks you very much for granting me the corpus maintainer post. I really appreciate that.

{{vm.hiddenReplies[36227] ? 'expand_more' : 'expand_less'}} hide replies show replies
ZegPhig ZegPhig November 29, 2020 November 29, 2020 at 8:56:13 AM UTC link Permalink

Congrats!🥳
Поздравляю!
Gratulon!

Ricardo14 Ricardo14 November 29, 2020 November 29, 2020 at 11:14:33 PM UTC link Permalink

Congratulations!

AlanF_US AlanF_US November 30, 2020 November 30, 2020 at 12:22:24 AM UTC link Permalink

Congratulations, Josef!

Tepan Tepan November 29, 2020 November 29, 2020 at 9:42:47 AM UTC link Permalink

Congrats! :)

Cabo Cabo November 29, 2020 November 29, 2020 at 10:11:07 AM UTC link Permalink

Gratulace!

small_snow small_snow November 29, 2020 November 29, 2020 at 11:10:06 AM UTC link Permalink

おめでとうございます!ご健勝とご飛躍を祈念申し上げます:)

bandeirante bandeirante November 29, 2020 November 29, 2020 at 12:15:52 PM UTC link Permalink

Kedves magyar nyelvű felhasználók!

A mai nappal lemondtam korpuszfenntartói tisztségemről, mert nem kívánok asszisztálni a magyar nyelvű korpusz minőségi mélyrepüléséhez. A jövőben semmilyen formában nem kívánok részt venni a magyar nyelvű korpusz fenntartásában. Mindenkinek köszönöm az együttműködést!

{{vm.hiddenReplies[36232] ? 'expand_more' : 'expand_less'}} hide replies show replies
Cabo Cabo November 29, 2020 November 29, 2020 at 12:30:10 PM UTC link Permalink

Hány mondatot javítottál ki, amin change cimke volt?

Igider Igider November 28, 2020, edited November 28, 2020 November 28, 2020 at 4:56:59 PM UTC, edited November 28, 2020 at 5:12:07 PM UTC link Permalink

Iwiziwen/ Tiwiziwin,

Lemdet qbel "ilugan n tira" d wamek ara teddum ɣer wannar-a, mulac d aserwet kan. Ma yella ad nesseɣtay kan, d aḍeggeṛ n wakud akked tezmert n wallen d weɛrur ara tent-yaɣen... Mulac ad ɣ-tḥettmem ad nettlusu nnwaḍeṛ-nni n tqeṛɛet n lgazuz (am win akken-nni meskin!)! Cuḥḥet-aɣ ttxil-wet/kent!

Tanemmirt.

Luce Luce November 28, 2020, edited November 30, 2020 November 28, 2020 at 11:57:52 AM UTC, edited November 30, 2020 at 8:20:17 AM UTC link Permalink

« Vava atan dagi » blayak gan Salahnamar sometir kan Kotava.

Pfirsichbaeumchen Pfirsichbaeumchen November 28, 2020, edited November 28, 2020 November 28, 2020 at 2:11:15 AM UTC, edited November 28, 2020 at 2:12:24 AM UTC link Permalink

@PaulP

Alles Gute zum Geburtstag, lieber Paul! Many happy returns of the day! Feliĉan naskiĝotagon! 🎉

{{vm.hiddenReplies[36220] ? 'expand_more' : 'expand_less'}} hide replies show replies
PaulP PaulP November 28, 2020 November 28, 2020 at 6:55:22 AM UTC link Permalink

Danke schön, liebe Lisa! Koran dankon, thanks a lot!

fjay69 fjay69 November 15, 2020 November 15, 2020 at 8:17:15 AM UTC link Permalink

We have over 200 thousands Japanese sentences now!
現在、20万以上の日本語の文があります!

{{vm.hiddenReplies[36141] ? 'expand_more' : 'expand_less'}} hide replies show replies
Pfirsichbaeumchen Pfirsichbaeumchen November 15, 2020, edited November 15, 2020 November 15, 2020 at 8:24:11 AM UTC, edited November 15, 2020 at 8:26:34 AM UTC link Permalink

A reason for great joy. Hooray! おめでとうございます。大喜びですね😊

Igider Igider November 15, 2020, edited November 15, 2020 November 15, 2020 at 9:04:00 AM UTC, edited November 15, 2020 at 9:04:25 AM UTC link Permalink

Congrats!! よくやった!Ayyuz!

Hybrid Hybrid November 15, 2020 November 15, 2020 at 12:53:46 PM UTC link Permalink

Congratulations to everyone!

small_snow small_snow November 15, 2020 November 15, 2020 at 10:16:26 PM UTC link Permalink

I see; https://tatoeba.org/jpn/sentenc...&sort_reverse=
✨🎉

CK CK November 16, 2020, edited November 16, 2020 November 16, 2020 at 12:40:43 AM UTC, edited November 16, 2020 at 12:41:09 AM UTC link Permalink

Of these Japanese sentences, ...

67,731 are owned by the 51 native Japanese speakers.

117,460 are "orphans" (sentences with no owners)

The rest are owned by non-native Japanese speakers.

For those who are seriously studying Japanese, I would recommend that you only trust the ones owned by native Japanese speakers.

{{vm.hiddenReplies[36147] ? 'expand_more' : 'expand_less'}} hide replies show replies
DJ_Saidez DJ_Saidez November 16, 2020, edited November 16, 2020 November 16, 2020 at 1:20:23 AM UTC, edited November 16, 2020 at 3:08:54 AM UTC link Permalink

(redacted)

It'd be great if we could get more regular Japanese speakers on board with this project

I appreciate all the work that bunbuku and small_snow have done in maintaining the Japanese corpus, and feel like they deserve more help

Edit: forgot to mention tommy_san and I’m probably forgetting others

{{vm.hiddenReplies[36148] ? 'expand_more' : 'expand_less'}} hide replies show replies
bill bill November 16, 2020 November 16, 2020 at 1:27:59 AM UTC link Permalink

No one in their right mind would do that. You would have to have a lot of free time to go over 117,460 sentences.

{{vm.hiddenReplies[36149] ? 'expand_more' : 'expand_less'}} hide replies show replies
DJ_Saidez DJ_Saidez November 16, 2020 November 16, 2020 at 1:32:59 AM UTC link Permalink

I worded it wrong, sorry

{{vm.hiddenReplies[36150] ? 'expand_more' : 'expand_less'}} hide replies show replies
bill bill November 16, 2020 November 16, 2020 at 1:36:04 AM UTC link Permalink

It happens. 👍

{{vm.hiddenReplies[36151] ? 'expand_more' : 'expand_less'}} hide replies show replies
CK CK November 16, 2020, edited November 16, 2020 November 16, 2020 at 1:53:52 AM UTC, edited November 16, 2020 at 2:18:41 AM UTC link Permalink

Until we run out of native-owned Japanese sentences for you to translate, I don't think you need to pressure anyone into proofreading all these.

I know how boring and tedious this can be. I proofread all the English sentences in the Tanaka Corpus up to a given length and adopted the ones that I thought sounded good and natural, editing the ones that could easily be edited, since they weren't already linked to various languages, which complicates editing. I ignored the long, multi-sentence examples, thinking I would eventually proofread those later.

Here are some sentences you could translate owned by 3 of our Japanese contributors with lots of sentences.

https://tatoeba.org/eng/sentenc...n&user=bunbuku
bunbuku's Japanese sentences with no translations.

https://tatoeba.org/eng/sentenc...ser=small_snow
small_snow's Japanese sentences with no translations

https://tatoeba.org/eng/sentenc...user=tommy_san
tommy_san's Japanese sentences with no translations

{{vm.hiddenReplies[36152] ? 'expand_more' : 'expand_less'}} hide replies show replies
DJ_Saidez DJ_Saidez November 16, 2020 November 16, 2020 at 5:16:22 AM UTC link Permalink

I mentioned it because I originally came here from Clozemaster, and the orphan sentences turned up more often than not in the Japanese section, and some of them made no sense to me
I’m not worried about running out of sentences to translate

{{vm.hiddenReplies[36153] ? 'expand_more' : 'expand_less'}} hide replies show replies
Cabo Cabo November 16, 2020, edited November 16, 2020 November 16, 2020 at 11:02:16 AM UTC, edited November 16, 2020 at 11:05:06 AM UTC link Permalink

Clozemaster is not enough good for learning a language.
iKnow has better sentence examples for words in order: https://iknow.jp/content/japanese

I use clozemaster. I have an extension to mark words.
The 'English from Magyar' is not only contains translations of Hungarian, but translations of translations, and later if the sentence is changed in Tatoeba, in clozemaster it remains unchanged.

{{vm.hiddenReplies[36155] ? 'expand_more' : 'expand_less'}} hide replies show replies
AlanF_US AlanF_US November 16, 2020 November 16, 2020 at 6:04:11 PM UTC link Permalink

I think the quality of Clozemaster varies from language pair to language pair. The English->Russian pair is good, and has helped me quite a bit, but there are other pairs that have the problems you describe.

Thanuir Thanuir November 16, 2020 November 16, 2020 at 7:27:30 PM UTC link Permalink

Norjan oppiminen on samanlaista hupia; tietokantaa ei ole päivitetty Tatoebasta pitkään aikaan. Toisaalta minulla on tarpeeksi esitietoja, että huomaan aika monet kirjoitusvirheet eikä se ole ainoa menetelmä, jota käytän.

DJ_Saidez DJ_Saidez November 17, 2020, edited November 17, 2020 November 17, 2020 at 4:13:10 AM UTC, edited November 17, 2020 at 4:13:35 AM UTC link Permalink

I've stopped using Clozemaster months ago and switched to Anki today using confirmed sentences from here
I also have a wide array of resources for Japanese

I just mentioned them because I found out about this through there.

maaster maaster November 22, 2020, edited November 22, 2020 November 22, 2020 at 7:09:03 AM UTC, edited November 22, 2020 at 9:49:10 AM UTC link Permalink

Congratulations!

(In the meantime, the Kabyle competitors also passed a mile stone as a rocket – 300,000 sentences!)

{{vm.hiddenReplies[36187] ? 'expand_more' : 'expand_less'}} hide replies show replies
K_hina K_hina November 22, 2020 November 22, 2020 at 4:10:53 PM UTC link Permalink

Thank you @maaster.

Selyan Selyan November 23, 2020 November 23, 2020 at 11:18:36 AM UTC link Permalink

@maaster
Thank you

Igider Igider November 23, 2020 November 23, 2020 at 11:32:16 AM UTC link Permalink

Thank you @Maaster.
Regards.

bill bill November 22, 2020, edited November 22, 2020 November 22, 2020 at 2:56:11 PM UTC, edited November 22, 2020 at 2:59:44 PM UTC link Permalink

Is this a bot account? https://tatoeba.org/eng/user/profile/Bilgi

I don't know about you guys, but I think new users shouldn't be allowed to add so many sentences at once.

{{vm.hiddenReplies[36189] ? 'expand_more' : 'expand_less'}} hide replies show replies
soliloquist soliloquist November 22, 2020, edited November 22, 2020 November 22, 2020 at 3:16:24 PM UTC, edited November 22, 2020 at 3:50:17 PM UTC link Permalink

>I think new users shouldn't be allowed to add so many sentences at once.

It belongs to me. I could have added those sentences with my main account, but I preffered to create a separate one that is dedicated to that content. I have some other "special purpose" accounts, too. Since I use the same IP address, anyone that has access to IP logs can see that it's me. Maybe I should slow down the speed a bit. Sorry for the inconvenience.

{{vm.hiddenReplies[36190] ? 'expand_more' : 'expand_less'}} hide replies show replies
bill bill November 22, 2020 November 22, 2020 at 3:38:02 PM UTC link Permalink

Thanks for replying, soliloquist. I thought it was a troll spamming the Turkish Corpus.

{{vm.hiddenReplies[36191] ? 'expand_more' : 'expand_less'}} hide replies show replies
soliloquist soliloquist November 22, 2020 November 22, 2020 at 3:39:03 PM UTC link Permalink

Thanks for your concern.

CK CK November 22, 2020 November 22, 2020 at 3:56:35 AM UTC link Permalink

** Updated Projects **

http://www.manythings.org/anki/
Tab-delimited Bilingual Sentence Pairs

http://www.manythings.org/bilingual/
Bilingual Sentence Pairs - Selected Sentences from the Tatoeba Corpus

** Related Stats **

http://tatoeba.ueuo.com/stats-2...907-links.html
Native-speaker-owned sentences in each language that are linked to English sentences on List 907

http://tatoeba.ueuo.com/stats-2...ed-to-907.html
Counts by Username of Native Speaker-Owned Sentences Linked to Sentences on List 907


Thanks to all of you who help make these projects possible.

fjay69 fjay69 November 18, 2020, edited November 18, 2020 November 18, 2020 at 2:33:17 PM UTC, edited November 18, 2020 at 11:32:55 PM UTC link Permalink

Found a search engine bug.
The result of searching "sweep" does not include sentences with "swept".
Can anyone make a proper issue on GitHub?

{{vm.hiddenReplies[36167] ? 'expand_more' : 'expand_less'}} hide replies show replies
Cabo Cabo November 18, 2020 November 18, 2020 at 5:02:41 PM UTC link Permalink

Is it a bug? I don't think so. Just one of the irregular verbs.
Do you see 'saw' or 'seen' if you search for word 'see'?

{{vm.hiddenReplies[36168] ? 'expand_more' : 'expand_less'}} hide replies show replies
fjay69 fjay69 November 18, 2020 November 18, 2020 at 5:10:46 PM UTC link Permalink

I had no idea that it doesn't work with irregular verbs. Forgive me please.

{{vm.hiddenReplies[36169] ? 'expand_more' : 'expand_less'}} hide replies show replies
Cabo Cabo November 18, 2020 November 18, 2020 at 6:02:33 PM UTC link Permalink

Forgive for what?
You had an assumption that this is a bug. I would say, it is not a bug. I'm not a programmer or something, I just say what I've seen.

{{vm.hiddenReplies[36170] ? 'expand_more' : 'expand_less'}} hide replies show replies
AlanF_US AlanF_US November 19, 2020, edited November 19, 2020 November 19, 2020 at 2:22:25 PM UTC, edited November 19, 2020 at 2:24:17 PM UTC link Permalink

To elaborate: We're talking about inexact ("fuzzy") matches that involve stemming, a feature supported for about two dozen languages, which are listed on this page:

https://en.wiki.tatoeba.org/art...w/text-search#

The stemmer works by removing common suffixes from both the search words and the words contained in sentences to see whether the remaining parts of the words match. For instance, if you remove "-ing" from "walking" and "-ed" from "walked", the results match (and also match the bare form "walk").

There are two things to know about the stemmer:

(1) It doesn't belong to us, so the best we can do is pass along reports, and I don't think we even do that for individual words.
(2) It is not supposed to be 100% accurate. We expect to see both false positives (for example, "sing" matches "singe") and false negatives ("sweep" doesn't match "swept", as you saw).

You can explicitly combine word forms with a vertical bar in order to compensate for false negatives, and you can use the equals sign to specify an exact match in order to compensate for false positives. Note that forms recognized by the stemmer do not necessarily have to be verbs. For instance, "beauty" will match "beautiful".

CK CK November 18, 2020 November 18, 2020 at 11:33:16 PM UTC link Permalink

For irregular verbs, you need to search like this.

sleep|slept
swim|swam|swum

Here is a page set up with all forms of the English verbs included in the search links.

http://tatoeba.ueuo.com/eng-verbs.html

{{vm.hiddenReplies[36172] ? 'expand_more' : 'expand_less'}} hide replies show replies
belkacem77 belkacem77 November 21, 2020, edited November 21, 2020 November 21, 2020 at 1:28:55 PM UTC, edited November 21, 2020 at 5:31:24 PM UTC link Permalink

Stemming in Kabyle is a complex process because of its morphology.

nouns and adjectives in kabyle have a state : free state and annexed state. So, there is a change at the beginning of the word following the state.

Verbs also are very inflectional, so it's hard to find the stem from a flexion.

We are working on a stemmer and a lemmatizer for Kabyle language. We will share the work when done

{{vm.hiddenReplies[36179] ? 'expand_more' : 'expand_less'}} hide replies show replies
AlanF_US AlanF_US November 21, 2020 November 21, 2020 at 3:33:38 PM UTC link Permalink

Interesting. Have you been in touch with the people at Manticore Search, who produce our search engine?

{{vm.hiddenReplies[36180] ? 'expand_more' : 'expand_less'}} hide replies show replies
belkacem77 belkacem77 November 21, 2020 November 21, 2020 at 5:29:11 PM UTC link Permalink

I've just heard about it. Is there any link, any contact information?

{{vm.hiddenReplies[36181] ? 'expand_more' : 'expand_less'}} hide replies show replies
Yorwba Yorwba November 21, 2020 November 21, 2020 at 7:12:34 PM UTC link Permalink

Manticore Search's stemming system is actually based on Snowball https://snowballstem.org/ , which is a specialized programming language for writing stemmers as well as a collection of stemmers for various languages.

AFAIK, Kabyle is an Afroasiatic language distantly related to Arabic, so you might want to have a look at Snowball's Arabic stemmer: https://github.com/snowballstem...hms/arabic.sbl

As for sharing your work, see their CONTRIBUTING.rst file https://github.com/snowballstem...NTRIBUTING.rst and maybe also have a look at what it took to get some other languages added (the most recent addition being Yiddish: https://github.com/snowballstem/snowball/pull/137 )

{{vm.hiddenReplies[36182] ? 'expand_more' : 'expand_less'}} hide replies show replies
belkacem77 belkacem77 November 21, 2020, edited November 21, 2020 November 21, 2020 at 7:23:36 PM UTC, edited November 21, 2020 at 7:24:14 PM UTC link Permalink

Thank you @Yorwba for sharing. I'll see this.
Indeed, Kabyle belongs to the Berber family, the same branch as Arabic, Hebrew, Amharic, . the derivation and inflectional systems are almost similar. But the script is different. Kabyle is written with Latin script.
I will see the project.