menu
Tatoeba
language
注册 登录
language 吳語
menu
Tatoeba

chevron_right 注册

chevron_right 登录

浏览

chevron_right 随机句子

chevron_right 选择闲话

chevron_right 选择列表

chevron_right 选择标签

chevron_right 选择音频

社群

chevron_right 留言墙

chevron_right 全部用户列表

chevron_right 用户额闲话

chevron_right 母语者

search
clear
swap_horiz
search

留言墙(7148则话题)

提醒

提问前头确定已经读了常见问题解答

阿拉额目标是保持文明讨论额健康氛围。 请读阿拉对于伐良行为额规定

最新留言 subdirectory_arrow_right

sharptoothed

1日前头

subdirectory_arrow_right

aymen

2日前头

subdirectory_arrow_right

Igider

3日前头

feedback

gillux

5日前头

feedback

sharptoothed

5日前头

subdirectory_arrow_right

frpzzd

15日前头

subdirectory_arrow_right

EugeneGS

15日前头

subdirectory_arrow_right

frpzzd

15日前头

subdirectory_arrow_right

EugeneGS

16日前头

subdirectory_arrow_right

frpzzd

16日前头

sharptoothed sharptoothed 5日前头 August 31, 2025 at 2:55:48 PM UTC flag Report link 永久链接

✹✹ Stats & Graphs ✹✹

Tatoeba Stats, Graphs & Charts have been updated:
https://tatoeba.j-langtools.com/allstats/

{{vm.hiddenReplies[41231] ? 'expand_more' : 'expand_less'}} 隐藏回复 显示回复
aymen aymen 2日前头 September 3, 2025 at 9:37:33 PM UTC flag Report link 永久链接

Thank you! ⵜⴰⵏⵎⵎⵉⵔⵜ!

{{vm.hiddenReplies[41235] ? 'expand_more' : 'expand_less'}} 隐藏回复 显示回复
sharptoothed sharptoothed 1日前头 September 4, 2025 at 4:10:06 PM UTC flag Report link 永久链接

You're welcome! ⴰⵏⵙⵓⴼ ⵢⴻⵙⵙⵯⴽ! :-)

felix63 felix63 22日前头 August 15, 2025 at 8:46:45 AM UTC flag Report link 永久链接

Heureux anniversaire, Lisa ! 🎂

{{vm.hiddenReplies[41215] ? 'expand_more' : 'expand_less'}} 隐藏回复 显示回复
marafon marafon 22日前头 August 15, 2025 at 12:50:59 PM UTC flag Report link 永久链接

Besser spät als nie. Herzlichen Glückwunsch zum Geburtstag, Lisa!

Pfirsichbaeumchen Pfirsichbaeumchen 20日前头 August 16, 2025 at 5:21:05 PM UTC flag Report link 永久链接

Vielen Dank! 😊

Auch Dir alles Gute zum Geburtstag, Christian! 🎉🎊🎈🎂

{{vm.hiddenReplies[41217] ? 'expand_more' : 'expand_less'}} 隐藏回复 显示回复
marafon marafon 20日前头 August 16, 2025 at 5:39:04 PM UTC flag Report link 永久链接

Je me joins aux félicitations !

3日前头,编辑3日前头 September 2, 2025 at 8:15:20 PM UTC,编辑September 2, 2025 at 8:17:33 PM UTC link 永久链接
warning

搿消息额内容违反了阿拉额规定,所以伊被隐藏,伊只好拨管理员帮发布搿消息额宁看到。

{{vm.hiddenReplies[41233] ? 'expand_more' : 'expand_less'}} 隐藏回复 显示回复
Igider Igider 3日前头 September 2, 2025 at 8:33:45 PM UTC flag Report link 永久链接

🤣

gillux gillux 5日前头 September 1, 2025 at 11:40:52 AM UTC flag Report link 永久链接

Hello, Tatoeba was updated today. What’s new?

- Some listings have been limited to display the last 1000 sentences: the list of sentences having audio, the list of sentences in user-created lists, and the list of sentences having a specific tag. Background on this new limitation: https://github.com/Tatoeba/tatoeba2/pull/3199

- It is now possible to use @ immediately followed by a username to notify a member in Wall posts. It was already possible to do so in sentence comments, but not in Wall posts. Now you can do it in both.

- The API served at https://api.tatoeba.org was improved. While it is still a work in progress, it is now close to become stable. Technical details: https://github.com/Tatoeba/tatoeba2/pull/3201

Interface improvements:

- Error pages (such as 404 error) can now be localized, thanks to the contribution of SrulyP: https://github.com/Tatoeba/tatoeba2/pull/3198 The error pages can be localized just like any other part of the website, for how to contribute, please look at https://en.wiki.tatoeba.org/art...ce-translation

- For sentences in Chinese, the converted simplified/traditional script is now displayed correctly. Details: https://github.com/Tatoeba/tatoeba2/issues/3188

- 52 language icons should look nicer. They have been vectorized, which means they won’t look pixelated no matter how much you zoom in. A total of 153 out of 429 language icons (35%) are now vectorized. Today, it is my honor to publish the vectorized brazilian/portuguese flag of our deeply regretted fellow member Ricardo.

13日前头,编辑13日前头 August 23, 2025 at 5:56:16 PM UTC,编辑August 23, 2025 at 5:56:42 PM UTC link 永久链接
warning

搿消息额内容违反了阿拉额规定,所以伊被隐藏,伊只好拨管理员帮发布搿消息额宁看到。

15日前头 August 22, 2025 at 6:25:19 AM UTC link 永久链接
warning

搿消息额内容违反了阿拉额规定,所以伊被隐藏,伊只好拨管理员帮发布搿消息额宁看到。

frpzzd frpzzd 18日前头,编辑18日前头 August 18, 2025 at 7:22:04 PM UTC,编辑August 18, 2025 at 11:05:19 PM UTC flag Report link 永久链接

Hey all! Wanted to share that recently I wrote some scripts that help find sentences to translate on based on lists of vocab. Taking as input the Tatoeba corpus and a tiered list of vocab (e.g. lists of words ranked by the chapters in which they appear in a textbook) as well as some limited grammar concepts (cases, parts of speech, verb forms, etc), it generates tiered lists of sentences such that the sentences at level N only use the vocab/grammar at level N or lower.

I've been working through a Russian textbook and using it to find sentences here that appropriate for attempting to translate into English/Spanish based on the chapter I'm on. My scripts use the spaCy Python library to rank sentences by chapter, and this library is sometimes flawed in terms of lemmatization and grammar annotation, but so far it has been a huge help for me. (Of course, my translation attempts can also be very flawed, as the Russian natives here know... thank you marafon, Ooneykcall and EugeneGS for your patience! :-) )

As an example, here are some sentences (as of the current data dumps) at chapter 10 level based on my book:

1006495 Да, у меня есть дочь.
1006500 Вы изучаете английский.
10074251 У нас всё свежее.
10101554 Вода слишком холодная.
10110989 Кофе слишком горячий.

And here are some at chapter 20 level:

10031177 Интересно, пойдёт ли снег.
10031179 Интересно, будет ли снег.
10038848 Ничего странного я в этом не вижу.
10043008 Я ушёл домой пешком.
10055872 Он бежал в школу.

Just wanted to share this info here in case any other learners would find this helpful for their own studies. Whether you intend to translate sentences found this way or not, it can be helpful for finding lots of example sentences at your level when working through a textbook. If this sounds useful to you, feel free to reply / send me a message and I'll be happy to try generating ranked translated/untranslated sentence lists for your choice of language too.

{{vm.hiddenReplies[41222] ? 'expand_more' : 'expand_less'}} 隐藏回复 显示回复
gillux gillux 17日前头 August 20, 2025 at 11:03:21 AM UTC flag Report link 永久链接

That’s amazing, Franklin, thank you for telling us. As I understand, you are making a curated list of Tatoeba sentences organized by textbook level. Do you plan to make your list available on Tatoeba or elsewhere?

{{vm.hiddenReplies[41223] ? 'expand_more' : 'expand_less'}} 隐藏回复 显示回复
frpzzd frpzzd 16日前头 August 20, 2025 at 1:53:16 PM UTC flag Report link 永久链接

That is correct!

The sentences are selected programmatically, and I could make them available as lists on Tatoeba (as I have a list of sentence IDs comprising the lists), but I'm not aware of a feature on the site allowing users to form lists other than by manually clicking to add each sentence. Right now, the Russian sentences ranked by my textbook number over 100,000 so of course I cannot add them manually this way. As far as I understand, the Tatoeba API is also read-only for the moment. Is there a way to do form lists by uploading (long) lists of sentence IDs that I'm not aware of?

In any case, I may post some of the lists on Github corresponding to my particular textbook. But I wanted to mention it here in case anyone is similarly working through another textbook for any other language and could also make use of ranked sentences. I am currently only using this for Russian but it would be easy to do for many other languages, provided a vocab list.

{{vm.hiddenReplies[41224] ? 'expand_more' : 'expand_less'}} 隐藏回复 显示回复
EugeneGS EugeneGS 16日前头,编辑16日前头 August 20, 2025 at 5:58:56 PM UTC,编辑August 20, 2025 at 9:03:31 PM UTC flag Report link 永久链接

Hello, Franklin! I'm not sure if there's currently a way to upload lists directly on Tatoeba, but I once made a small program for myself that can automatically add sentences to a list by their IDs. If you'd like, you could share your sentence ID lists somewhere (for example on GitHub), and I could help import them into Tatoeba lists. (By the way, the program is rather slow — in my testing it processes about a thousand sentences per hour, so a large list would take some time.)

{{vm.hiddenReplies[41225] ? 'expand_more' : 'expand_less'}} 隐藏回复 显示回复
frpzzd frpzzd 15日前头 August 21, 2025 at 3:17:14 PM UTC flag Report link 永久链接

Hi Eugene, that would be fantastic! Here's a link to a TSV file in which the first column contains IDs of Tatoeba sentences in Russian, and the second column contains the "rankings", which are numbers 3-29 corresponding to chapters in the book. Ideally each chapter would get its own list.

https://gist.githubusercontent...._sentences.tsv

I haven't even added all of the book's vocab into my input data yet, since that is a slow-going process. For that reason, from chapter 24 onwards, ranking is based purely on grammar (as estimated by spaCy) and not by vocab for now.

{{vm.hiddenReplies[41226] ? 'expand_more' : 'expand_less'}} 隐藏回复 显示回复
EugeneGS EugeneGS 15日前头 August 21, 2025 at 6:31:10 PM UTC flag Report link 永久链接

Okay! I've started it. It will take a few days (around 4–6, maybe a bit more). Here is the link to all the lists:

https://gist.github.com/kilsens...4a27b9658e5e55

I'll update it with new lists as they appear. If you need, I can also rename the lists.

{{vm.hiddenReplies[41227] ? 'expand_more' : 'expand_less'}} 隐藏回复 显示回复
frpzzd frpzzd 15日前头 August 21, 2025 at 6:42:18 PM UTC flag Report link 永久链接

Thank you very much, this is fantastic! This will be a huge help. Previously I was copying over the IDs into the URL, which was a very clunky workflow.

19日前头 August 18, 2025 at 5:51:10 AM UTC link 永久链接
warning

搿消息额内容违反了阿拉额规定,所以伊被隐藏,伊只好拨管理员帮发布搿消息额宁看到。

20日前头 August 17, 2025 at 1:18:20 PM UTC link 永久链接
warning

搿消息额内容违反了阿拉额规定,所以伊被隐藏,伊只好拨管理员帮发布搿消息额宁看到。

sharptoothed sharptoothed 20日前头 August 17, 2025 at 10:03:27 AM UTC flag Report link 永久链接

✹✹ Stats & Graphs ✹✹

Tatoeba Stats, Graphs & Charts have been updated:
https://tatoeba.j-langtools.com/allstats/

26日前头 August 10, 2025 at 8:59:26 PM UTC link 永久链接
warning

搿消息额内容违反了阿拉额规定,所以伊被隐藏,伊只好拨管理员帮发布搿消息额宁看到。

{{vm.hiddenReplies[41209] ? 'expand_more' : 'expand_less'}} 隐藏回复 显示回复
AmarMecheri AmarMecheri 22日前头 August 14, 2025 at 4:18:17 PM UTC flag Report link 永久链接

I don't understand why my message violates the Tatoeba.org charter. Please explain.
Thanks anyway for agreeing to distribute it to the interested party and the administrators.

Je ne comprends pas pourquoi mon message enfreindrait la charte de Tatoeba.org. Il serait bon de nous expliquer.
Merci quand même d'avoir accepté de le distribuer à l'intéressë et aux administrateurs.

{{vm.hiddenReplies[41212] ? 'expand_more' : 'expand_less'}} 隐藏回复 显示回复
Shishir Shishir 22日前头 August 14, 2025 at 7:02:59 PM UTC flag Report link 永久链接

Hi Amar, I think you made a mistake, your message was not hidden, it's right under this one. The hidden message you replied to was actually a Vietnamese advertisement.

{{vm.hiddenReplies[41213] ? 'expand_more' : 'expand_less'}} 隐藏回复 显示回复
AmarMecheri AmarMecheri 22日前头 August 14, 2025 at 7:59:27 PM UTC flag Report link 永久链接

@Shishir
I thank you so much for your useful explanatiion.