menu
Tatoeba
language
注册 登录
language 吳語
menu
Tatoeba

chevron_right 注册

chevron_right 登录

浏览

chevron_right 随机句子

chevron_right 选择闲话

chevron_right 选择列表

chevron_right 选择标签

chevron_right 选择音频

社群

chevron_right 留言墙

chevron_right 全部用户列表

chevron_right 用户额闲话

chevron_right 母语者

search
clear
swap_horiz
search

留言墙(7161则话题)

提醒

提问前头确定已经读了常见问题解答

阿拉额目标是保持文明讨论额健康氛围。 请读阿拉对于伐良行为额规定

最新留言 subdirectory_arrow_right

brauchinet

1日前头

feedback

gillux

2日前头

subdirectory_arrow_right

TATAR1

4日前头

feedback

Tartar

4日前头

subdirectory_arrow_right

TATAR1

4日前头

subdirectory_arrow_right

Rok

4日前头

subdirectory_arrow_right

TATAR1

5日前头

subdirectory_arrow_right

TATAR1

5日前头

subdirectory_arrow_right

Tatar

5日前头

subdirectory_arrow_right

Feniks

5日前头

sharptoothed sharptoothed July 27, 2025 July 27, 2025 at 6:20:17 AM UTC flag Report link 永久链接

✹✹ Stats & Graphs ✹✹

Tatoeba Stats, Graphs & Charts have been updated:
https://tatoeba.j-langtools.com/allstats/

July 25, 2025 July 25, 2025 at 6:05:04 AM UTC link 永久链接
warning

搿消息额内容违反了阿拉额规定,所以伊被隐藏,伊只好拨管理员帮发布搿消息额宁看到。

July 23, 2025 July 23, 2025 at 7:51:28 AM UTC link 永久链接
warning

搿消息额内容违反了阿拉额规定,所以伊被隐藏,伊只好拨管理员帮发布搿消息额宁看到。

gillux gillux July 17, 2025 July 17, 2025 at 10:35:51 AM UTC flag Report link 永久链接

Hello, Tatoeba was updated today. What’s new?

- Content report: there is a new "flag" button on the top-right of Wall posts and sentence comments to ease the report of inappropriate content to admins. This is a counter-measure to the increased spam Tatoeba is seeing recently, and also a feature some people have been asking for in the past.

- Two new languages have been added: Svan and Ao Naga. Cheers to abiniz and monsen_sanang_ai, respectively, for requesting them! Tatoeba now supports 429 languages.

Learn more about Svan: https://en.wikipedia.org/wiki/Svan_language
Learn more about Ao Naga: https://en.wikipedia.org/wiki/Ao_language

- A total of 50 language icons (flags) should now look nicer. The icons have been updated to a vector-based image instead of a raster image, which means they won’t look pixelated or blurry but always sharp, even when zooming in.

{{vm.hiddenReplies[41177] ? 'expand_more' : 'expand_less'}} 隐藏回复 显示回复
Waldelfe Waldelfe July 18, 2025,编辑July 18, 2025 July 18, 2025 at 4:12:09 AM UTC,编辑July 18, 2025 at 4:12:40 AM UTC flag Report link 永久链接

Thanks for the update, Gillux.

If I might use this opportunity to make a suggestion, I think it would be good to implement something that will make it more difficult for spam accounts to register themselves in the first place (captcha, email links etc.?) and delete tens of thousands of existing ones (there are quite possibly over a hundred thousand by now). Nine out of ten newly registered accounts are spammers. Of course, most of them remain silent and passive with ads and links in their profiles. This cannot be not a problem.

{{vm.hiddenReplies[41178] ? 'expand_more' : 'expand_less'}} 隐藏回复 显示回复
frpzzd frpzzd July 18, 2025,编辑July 18, 2025 July 18, 2025 at 4:24:07 AM UTC,编辑July 18, 2025 at 4:47:59 AM UTC flag Report link 永久链接

I agree. I have been watching the spam accounts come in literally by the minute.

I’ve been reporting spam comments as I see them, but it’s troubling that they can still have a bunch of links in their profile without making any posts. I’d like to help out with that if there’s any way I can. Probably I am too new of a user to have admin privileges, but if it would be helpful, I’m happy to throw together a script that will make a list of likely spam accounts based on common indicators (long link lists in the bio, certain keywords, lack of languages or sentences, etc). Then some admin can run through the list and delete the ones they see fit.

EDIT: just realized that user bios aren’t included in the downloadable data dumps, so maybe that won’t work.

{{vm.hiddenReplies[41179] ? 'expand_more' : 'expand_less'}} 隐藏回复 显示回复
gillux gillux July 18, 2025 July 18, 2025 at 7:09:27 AM UTC flag Report link 永久链接

frpzzd, thank you for your help. I have tried to figure out a common feature of spam accounts, but there is nothing really standing out, so I also think we’d need some human verification to confirm account purge.

However, I think it is too early to start deleting spam accounts. It would be better to find a way to stop the influx first, and only then start to cleanup.

As you realized, we do not export user bios. If you are willing to help, I can provide you with a database export of user bios and other information.

{{vm.hiddenReplies[41182] ? 'expand_more' : 'expand_less'}} 隐藏回复 显示回复
frpzzd frpzzd July 18, 2025,编辑July 18, 2025 July 18, 2025 at 5:04:45 PM UTC,编辑July 18, 2025 at 5:05:33 PM UTC flag Report link 永久链接

Certainly, carrying out any kind of account purge without double-checking each account would be a bad idea. Maybe the job can be streamlined with some analysis of the spam accounts though.

If you think it is too early to begin removing the spam accounts, I guess there's no rush. But in any case, I'd definitely love to take a look at a data dump of user bios if you're willing to share one. Maybe there are some useful features that can be extracted.

gillux gillux July 18, 2025 July 18, 2025 at 7:02:13 AM UTC flag Report link 永久链接

Waldelfe, I understand your concern. I am aware of the constant influx of spammy accounts, and be sure that I am willing to stop it. But this is not an easy task.

About adding a confirmation email step, I think it’s is unlikely to help. According to my own research, these accounts are created by humans. They are already going through as many open-registration websites as possible, to create user pages containing a lot of links (see, for example, user xx88art56). Tatoeba is likely the only website in 2025 not requiring email confirmation. If they can do it on other websites which require email confirmation, they will keep doing it on Tatoeba regardless of email confirmation.

About adding a captcha, I don’t think the benefits of the captcha (less spam accounts that are barely noticeable) outweigh the drawbacks (less legitimate users are able to register). Besides, captchas are unlikely to stop the spammers because they are likely real humans.

You are welcome to join the ongoing discussion about prevention of spam accounts on GitHub: https://github.com/Tatoeba/tatoeba2/issues/1613

{{vm.hiddenReplies[41181] ? 'expand_more' : 'expand_less'}} 隐藏回复 显示回复
araneo araneo July 18, 2025 July 18, 2025 at 8:47:35 AM UTC flag Report link 永久链接

Why would less legitimate users be able to register if a captcha was added? And thank you for the update!

{{vm.hiddenReplies[41183] ? 'expand_more' : 'expand_less'}} 隐藏回复 显示回复
gillux gillux July 18, 2025 July 18, 2025 at 9:03:39 AM UTC flag Report link 永久链接

You are welcome, araneo.

Captcha notoriously lower accessibility: https://en.wikipedia.org/wiki/C...#Accessibility

Captcha can be challenging to solve, even when not visually impaired. If you have poor mental of physical health, simply browsing a website may require a lot of energy; having to solve captchas on the top of that is not helping.

In general, I don’t think anybody is happy to be prompted to solve a captcha.

For these reasons, I’d rather not add more captchas on the web unless there is a really good reason to do so.

{{vm.hiddenReplies[41184] ? 'expand_more' : 'expand_less'}} 隐藏回复 显示回复
araneo araneo July 18, 2025 July 18, 2025 at 3:06:28 PM UTC flag Report link 永久链接

Thank you for the explanation and for taking that into consideration!

hecko hecko July 19, 2025 July 19, 2025 at 1:12:58 PM UTC flag Report link 永久链接

in my experience captchas are easily solved by modern language models, perhaps even more so than by humans https://www.youtube.com/watch?v=satnl1KTEXM

and even before that there are services employing humans from third-world countries to solve captchas for pennies

one approach i personally like is to hide submissions from new users until at least one sentence/post/whatever from them is approved by a moderator, but that might take effort to implement here since i don't think there's even a way to hide sentences yet

{{vm.hiddenReplies[41190] ? 'expand_more' : 'expand_less'}} 隐藏回复 显示回复
frpzzd frpzzd July 19, 2025 July 19, 2025 at 10:11:51 PM UTC flag Report link 永久链接

Interesting idea. Alternatively, for an approach that avoids making more work for the mods, we could consider hiding profiles (or just hiding profile bios) of users who have not written any sentences yet by default.

Many spammers seem to not write any sentences at all, and just fill up their bios with links. The spam accounts that write sentences are at least a little more likely to get noticed, flagged and deleted.

PaulP PaulP July 18, 2025 July 18, 2025 at 5:49:07 AM UTC flag Report link 永久链接

> - Content report: there is a new "flag" button on the top-right of Wall posts and sentence comments to ease the report of inappropriate content to admins.

Thanks! That's very useful!

July 19, 2025 July 19, 2025 at 10:29:17 AM UTC link 永久链接
warning

搿消息额内容违反了阿拉额规定,所以伊被隐藏,伊只好拨管理员帮发布搿消息额宁看到。

aleteacher2 aleteacher2 July 15, 2025 July 15, 2025 at 4:39:25 PM UTC flag Report link 永久链接

Hi Guys! This is Alexander from Brazil! I am very happy to contribute to the development of this site. And I am very happy that the participants here can contribute to the project.

{{vm.hiddenReplies[41170] ? 'expand_more' : 'expand_less'}} 隐藏回复 显示回复
PaulP PaulP July 15, 2025 July 15, 2025 at 8:19:57 PM UTC flag Report link 永久链接

Welcome, Alexander!

{{vm.hiddenReplies[41171] ? 'expand_more' : 'expand_less'}} 隐藏回复 显示回复
gillux gillux July 17, 2025 July 17, 2025 at 9:41:45 AM UTC flag Report link 永久链接

Hello, thank you for all your contributions to this project!

ecorralest101 ecorralest101 July 16, 2025 July 16, 2025 at 8:59:49 PM UTC flag Report link 永久链接

Hi, just a quick comment. I wish speakers of minority languages would contribute more on these. Many of these languages have less than 10 phrases. Let's try to add more digital presence on these linguistic groups.

{{vm.hiddenReplies[41172] ? 'expand_more' : 'expand_less'}} 隐藏回复 显示回复
frpzzd frpzzd July 17, 2025 July 17, 2025 at 12:06:55 AM UTC flag Report link 永久链接

Which languages are you talking about, if you have specific ones in mind?

Anyways, I agree. For a few weeks, I have also been mulling over how to attract more speakers of Panjabi, Gujarati, Telugu and other major languages of India to this site. They are some of the world languages that are the most underrepresented on Tatoeba (comparing the number of speakers in the world with the number of sentences in the corpus). From what I understand, multilingualism is also extremely common in India so these contributors would probably be very beneficial for the corpus.

For the moment, I don't have any good ideas for accomplishing this. Maybe someone else will chime in with thoughts.

{{vm.hiddenReplies[41173] ? 'expand_more' : 'expand_less'}} 隐藏回复 显示回复
ecorralest101 ecorralest101 July 17, 2025 July 17, 2025 at 2:36:01 AM UTC flag Report link 永久链接

Hello, thanks for your reply. Mostly I mean African and Indigenous languages. If you go to the all languages section, you will find many that have less than 10 phrases like Urhobo, Aymara, Haida, Cuyonon to mention some.

By the way, I also have Indian languages. In the past I studied some Kannada, which is a Dravidian language, it has an amazing alphabet, quite artistic.

Now going back to the main topic, I think we might capture some people via social media. It's the idea I have now. On Facebook you may find lots of groups that promote polyglottery and defend minority languages.

sharptoothed sharptoothed July 13, 2025 July 13, 2025 at 3:40:44 PM UTC flag Report link 永久链接

✹✹ Stats & Graphs ✹✹

Tatoeba Stats, Graphs & Charts have been updated:
https://tatoeba.j-langtools.com/allstats/

July 9, 2025 July 9, 2025 at 12:49:20 PM UTC link 永久链接
warning

搿消息额内容违反了阿拉额规定,所以伊被隐藏,伊只好拨管理员帮发布搿消息额宁看到。

July 7, 2025 July 7, 2025 at 11:37:27 AM UTC link 永久链接
warning

搿消息额内容违反了阿拉额规定,所以伊被隐藏,伊只好拨管理员帮发布搿消息额宁看到。