menu
Tatoeba
language
Zarejestruj się Zaloguj się
language Polski
menu
Tatoeba

chevron_right Zarejestruj się

chevron_right Zaloguj się

Przeglądaj

chevron_right Wyświetl losowe zdanie

chevron_right Przeglądaj po języku

chevron_right Przeglądaj według listy

chevron_right Przeglądaj po tagu

chevron_right Przeszukuj audio

Społeczność

chevron_right Tablica ogłoszeń

chevron_right Spis członków

chevron_right Członkowie wg języka

chevron_right Rodzimi użytkownicy języka

search
clear
swap_horiz
search
soliloquist {{ icon }} keyboard_arrow_right

Profil

keyboard_arrow_right

Zdania

keyboard_arrow_right

Słownictwo

keyboard_arrow_right

Oceny

keyboard_arrow_right

Listy

keyboard_arrow_right

Ulubione

keyboard_arrow_right

Komentarze

keyboard_arrow_right

Komentarze do zdań użytkownika soliloquist

keyboard_arrow_right

Wiadomości na tablicy ogłoszeń

keyboard_arrow_right

Logi

keyboard_arrow_right

Nagranie

keyboard_arrow_right

Transkrypcje

translate

Tłumacz zdania członka soliloquist

Wypowiedzi na tablicy ogłoszeń użytkownika soliloquist (łącznie 151)

soliloquist soliloquist 7 lipca 2021 7 lipca 2021 22:12:52 UTC link Bezpośredni link

> And for RTL, maybe if we can somehow mark it as RTL would be great.

The writing direction of the language needs to be changed from LTR to auto. I had made a similar request for Ottoman Turkish. Sentences in both scripts display correctly now. #9674971

soliloquist soliloquist 1 czerwca 2021 1 czerwca 2021 19:33:36 UTC link Bezpośredni link

I guess many users would either prefer to receive them all or none, but if you want to get email notifications only of a certain type, you can link your Tatoeba account to a new email address, and create some rules/filters on that email account to forward notifications to your main email address only of that type.

The subject lines of emails from Tatoeba are as below:

Tatoeba PM

Tatoeba - Comment on sentence

Tatoeba - X has replied to you on the Wall

You can create rules with those patterns to forward or filter the notifications you want.

soliloquist soliloquist 28 marca 2021 28 marca 2021 12:38:42 UTC link Bezpośredni link

https://github.com/Tatoeba/tatoeba2/issues/183

soliloquist soliloquist 21 marca 2021 21 marca 2021 13:52:41 UTC link Bezpośredni link

https://en.wiki.tatoeba.org/art...nguage-request

soliloquist soliloquist 5 marca 2021, edytowane 5 marca 2021 5 marca 2021 13:37:44 UTC, edytowane 5 marca 2021 13:37:59 UTC link Bezpośredni link

Thanks for working on it! It seems you found a better method to redirect wiki pages, other than using Transifex. (https://github.com/Tatoeba/tatoeba2/issues/2626 )

soliloquist soliloquist 12 lutego 2021 12 lutego 2021 13:33:23 UTC link Bezpośredni link

> but many of them disappear in the following three days.

I agree. I usually try not to write many pedantic comments under new members' sentences to avoid intimidating them.

Another underlying reason of this impermanence, I believe, is the lack of a reward mechanism and gamification which may cause monotony. But of course, it can be argued that they have drawbacks, too. See the discussion on GitHub.

https://github.com/Tatoeba/tatoeba2/issues/2481

soliloquist soliloquist 5 lutego 2021 5 lutego 2021 11:37:50 UTC link Bezpośredni link

Much better than the old design, thank you.

soliloquist soliloquist 13 grudnia 2020 13 grudnia 2020 22:02:49 UTC link Bezpośredni link

> Now "Tatoeba Sentences & Translations Stats" and "Tatoeba User Activity Chart"
> show "Original sentences" numbers and "Original / total" ratio.

Thank you very much for that.

soliloquist soliloquist 25 listopada 2020 25 listopada 2020 18:35:23 UTC link Bezpośredni link

The suffixes now seem to be handled well. #8904790, #9094761 and #8871066 are all listed. Thank you very much.

soliloquist soliloquist 25 listopada 2020, edytowane 25 listopada 2020 25 listopada 2020 18:31:45 UTC, edytowane 25 listopada 2020 18:36:12 UTC link Bezpośredni link

You're right. We should remove ... from it. That would give a lot of false positives.

It should only include exact matches.

onla -> onunla

(without ...)

soliloquist soliloquist 25 listopada 2020, edytowane 25 listopada 2020 25 listopada 2020 17:46:36 UTC, edytowane 25 listopada 2020 18:24:59 UTC link Bezpośredni link

That's it! Here's the new list. I hope it works this time.

...çca -> ...çça
...çce -> ...ççe
...çci -> ...ççi
...çcı -> ...ççı
...çcu -> ...ççu
...çcü -> ...ççü
...çda -> ...çta
...çdan -> ...çtan
...çde -> ...çte
...çden -> ...çten
...fca -> ...fça
...fce -> ...fçe
...fci -> ...fçi
...fcı -> ...fçı
...fcu -> ...fçu
...fcü -> ...fçü
...fda -> ...fta
...fdan -> ...ftan
...fde -> ...fte
...fden -> ...ften
...hca -> ...hça
...hce -> ...hçe
...hci -> ...hçi
...hcı -> ...hçı
...hcu -> ...hçu
...hcü -> ...hçü
...hda -> ...hta
...hdan -> ...htan
...hde -> ...hte
...hden -> ...hten
...kca -> ...kça
...kce -> ...kçe
...kci -> ...kçi
...kcı -> ...kçı
...kcu -> ...kçu
...kcü -> ...kçü
...kda -> ...kta
...kdan -> ...ktan
...kde -> ...kte
...kden -> ...kten
...pca -> ...pça
...pce -> ...pçe
...pci -> ...pçi
...pcı -> ...pçı
...pcu -> ...pçu
...pcü -> ...pçü
...pda -> ...pta
...pdan -> ...ptan
...pde -> ...pte
...pden -> ...pten
...sca -> ...sça
...sce -> ...sçe
...sci -> ...sçi
...scı -> ...sçı
...scu -> ...sçu
...scü -> ...sçü
...sda -> ...sta
...sdan -> ...stan
...sde -> ...ste
...sden -> ...sten
...şca -> ...şça
...şce -> ...şçe
...şci -> ...şçi
...şcı -> ...şçı
...şcu -> ...şçu
...şcü -> ...şçü
...şda -> ...şta
...şdan -> ...ştan
...şde -> ...şte
...şden -> ...şten
acenta... -> acente...
aç gözlü... -> açgözlü...
açıkca... -> açıkça...
adele... -> adale...
afedersin... -> affedersin...
aksesuvar... -> aksesuar...
aktrist... -> aktris...
akıl almaz... -> akılalmaz...
Alaman... -> Alman...
allerji... -> alerji...
alt üst... -> altüst...
alış veriş... -> alışveriş...
aliminyum... -> alüminyum...
ambülans... -> ambulans
ampül... -> ampul...
ana okulu... -> anaokulu...
antreman... -> antrenman...
apandist... -> apandisit...
aperatif... -> aperitif...
Arabça... -> Arapça...
arasıra... -> ara sıra...
ardarda... -> art arda...
Arjentin... -> Arjantin...
atmış... -> altmış... (çoğu false positive)
avusturalya... -> avustralya...
ayırım... -> ayrım...
Azarbaycan... -> Azerbaycan
Azarbeycan... -> Azerbaycan
Azerbeycan... -> Azerbaycan
banço... -> banjo...
başbaşa... -> baş başa...
başı boş... -> başıboş...
belkide... -> belki de...
benle... -> benimle...
beysbol... -> beyzbol...
bir kaç... -> birkaç...
bir çok... -> birçok...
birarada... -> bir arada...
birden bire... -> birdenbire...
Biritan... -> Britan...
birsürü... -> bir sürü...
Brazilya... -> Brezilya...
bu gün... -> bugün...
bugünki... -> bugünkü...
bulüz... -> bluz...
burda... -> burada...
buyrun... -> buyurun...
büfte... -> bifte...
büyük anne... -> büyükanne...
büyük baba... -> büyükbaba...
can kurtaran... -> cankurtaran...
cimnastik... -> jimnastik...
çarşanba... -> çarşamba...
çeki düzen... -> çekidüzen...
çokaz... -> çok az...
çoşku... -> coşku...
dahada... -> daha da...
deniz aşırı... -> denizaşırı...
değilmi... -> değil mi...
deyer... -> değer...
deyme... -> değme...
doğumgünü... -> doğum günü...
döküman... -> doküman...
döğdü... -> dövdü...
döğüş... -> dövüş...
dünki... -> dünkü...
düz taban... -> düztaban...
eczahane... -> eczane...
eksoz... -> egzoz...
elele... -> el ele...
entellektüel... -> entelektüel...
eposta... -> e-posta...
eylence... -> eğlence...
eylenmek... -> eğlenmek...
fantazi... -> fantezi...
farked... -> fark ed...
farket... -> fark et...
farzed... -> farz ed...
farzet... -> farz et...
Fırans... -> Frans...
filim... -> film...
fonksyon... -> fonksiyon...
fotograf... -> fotoğraf...
gardrop... -> gardırop...
gurup... -> grup...
gök kuşağı... -> gökkuşağı...
gök yüzü... -> gökyüzü...
göz yaşı... -> gözyaşı...
gözardı... -> göz ardı...
gözkulak... -> göz kulak...
gözüpek... -> gözü pek...
haftasonu... -> hafta sonu...
haked... -> hak ed...
haket... -> hak et...
hakket... -> hak et...
halbu ki... > halbuki...
hastahane... -> hastane...
hava alanı... -> havaalanı...
hava limanı... -> havalimanı...
hem fikir... -> hemfikir...
hemde... -> hem de...
her hangi... -> herhangi...
hergün... -> her gün...
herkez... -> herkes...
herne... -> her ne...
heryer... -> her yer...
herzaman... -> her zaman...
herşey... -> her şey...
hiç bir... -> hiçbir...
hiçkimse... -> hiç kimse...
hiçte... -> hiç de...
humani... -> hümani...
ısraf... -> israf...
iki yüzlü... -> ikiyüzlü...
ilk okul... -> ilkokul...
insan oğlu... -> insanoğlu...
insiyatif... -> inisiyatif...
israr... -> ısrar...
istakoz... -> ıstakoz...
istambul... -> istanbul...
itibariyle... -> itibarıyla...
iyiki... -> iyi ki...
kamu oyu... -> kamuoyu...
kapşon... -> kapüşon...
kareografi... -> koreografi...
kaysı... -> kayısı...
klavuz... -> kılavuz...
klüp... -> kulüp...
kolleksiyon... -> koleksiyon...
kominist... -> komünist...
kompartman... -> kompartıman...
koperatif... -> kooperatif...
kılınç... -> kılıç...
kıral... -> kral...
kıraliyet... -> kraliyet...
kıraliçe... -> kraliçe...
kırallık... -> krallık...
kızarkadaş... -> kız arkadaş...
kızkardeş... -> kız kardeş...
Kürdçe... -> Kürtçe...
labaratuar... -> laboratuvar...
labaratuvar... -> laboratuvar...
madem ki... -> mademki...
mahçup... -> mahcup...
makina... -> makine...
malesef... -> maalesef...
malolmak... -> mal olmak...
Marry... -> Mary...
Mary'e... -> Mary'ye...
Mary'i... -> Mary'yi...
Mary'le... -> Mary'yle...
matamatik... -> matematik...
menejer... -> menajer...
metod... -> metot...
meyva... -> meyve...
meşkul... -> meşgul...
motorsiklet... -> motosiklet...
müdahele... -> müdahale...
müsade... -> müsaade...
müstehak... -> müstahak...
mütevazi... -> mütevazı...
nerde... -> nerede...
neyseki... -> neyse ki...
nufus... -> nüfus...
okur yazar... -> okuryazar...
onaltı... -> on altı...
onbeş... -> on beş...
onbir... -> on bir...
ondokuz... -> on dokuz...
ondört... -> on dört...
oniki... -> on iki...
onla... -> onunla...
onsekiz... -> on sekiz...
onyedi... -> on yedi...
onüç... -> on üç...
orda... -> orada...
orjinal... -> orijinal...
orta okul... -> ortaokul...
oysa ki... -> oysaki...
pantalon... -> pantolon...
parlemento... -> parlamento...
pastahane... -> pastane...
pekaz... -> pek az...
pekçok... -> pek çok...
penbe... -> pembe...
perşenbe... -> perşembe...
peşpeşe... -> peş peşe...
pilaj... -> plaj...
postahane... -> postane...
proğram... -> program...
rasgele... -> rastgele...
rasgelm... -> rast gelm...
raslantı... -> rastlantı...
sarfed... -> sarf ed...
sarfet... -> sarf et...
sarmısak... -> sarımsak...
senle... -> seninle...
sivri sinek... -> sivrisinek...
sohpet... -> sohbet...
sueter... -> süveter...
süpriz... -> sürpriz...
şöför... -> şoför...
tabi ki... -> tabii ki...
taktir... -> takdir...
terked... -> terk ed...
terket... -> terk et...
tesbih... -> tespih...
tesbit... -> tespit...
traş... -> tıraş...
Türküye... -> Türkiye...
umru... -> umuru...
ünüversite... -> üniversite...
ünvan... -> unvan...
vaz geç... -> vazgeç...
yada... -> ya da...
yanlız... -> yalnız...
yanyana... -> yan yana...
yanısıra... -> yanı sıra...
yinede... -> yine de...
yüksek okul... -> yüksekokul...
yüz ölçüm... -> yüzölçüm...
zıttı... -> zıddı...

soliloquist soliloquist 25 listopada 2020 25 listopada 2020 17:45:48 UTC link Bezpośredni link

> In that case, the rule should be written as
> herşey... -> her şey...

Yes. Thanks for clarifying.

> But is there a difference between rules ending in ... and other rules, then?

No, not really, as long as ... is placed correctly.

soliloquist soliloquist 25 listopada 2020, edytowane 25 listopada 2020 25 listopada 2020 15:38:29 UTC, edytowane 25 listopada 2020 15:40:17 UTC link Bezpośredni link

Thanks. The script now detects the suffix errors (in which the error is in the suffix itself) nicely, but misspelled words with suffixes (in which the error is in the root, not in the suffix) are still ignored.

For example,

herşey -> her şey

It only detects #8904790, but it should also detect #9094761 and #8871066. They have the same error, but I guess it is set to search terms as 'whole words' so the suffixed forms are ignored. Many of the terms are in a similar situation. Only the ones without suffixes are found.

soliloquist soliloquist 25 listopada 2020 25 listopada 2020 15:38:19 UTC link Bezpośredni link

> - is that a common Turkish habit? :P )

No, it's just a trivial and stylistic error. :-) But it's better to standardize them to avoid having duplicates. Thanks for picking them up.

soliloquist soliloquist 25 listopada 2020 25 listopada 2020 13:13:33 UTC link Bezpośredni link

> What's the difference between
> "Haked... -> Hak ed..."

The latter is the correct form. '...' means that the word certainly has a suffix so you can think of it as 'haked*'.

> and
> "* Öğe -> Öge"

Sorry for the confusion. The asterisk at the beginnig of that example has a different meaning. There are several other examples like that. You can ignore them. We have two Turkish language associations and they have differences of opinion about spelling of some words and that word is one of them. I put an asterisk at the beginning of them to indicate that. I now replaced them with ※ to avoid confusion.

Examples like,

...pde -> ...pte

...sden -> ...sten

show suffix errors. Such words end with those letters without having other suffixes. They can be searched on Tatoeba by putting an asterisk before them. (*pde, *sden etc.). If it's difficult to add them to the script, you can ignore them. Just having suffix support would be sufficient.

soliloquist soliloquist 25 listopada 2020 25 listopada 2020 10:41:04 UTC link Bezpośredni link

Thanks a million for adapting the script for Turkish. The reason I put asterisks on some of the vocabulary items was to include their suffixed forms. Otherwise the search finds only exact matches, without suffixed ones. For example,

herşey -> her şey

Without the asterisk, the search finds only 'herşey' but not the suffixed forms like 'herşeyi', 'herşeye', 'herşeyde' etc. So it's kind of a stemming support. Your script does a great job as is, but if you could find a way to include other forms, it would be much better.

Thank you very much again.

soliloquist soliloquist 24 listopada 2020 24 listopada 2020 19:14:33 UTC link Bezpośredni link

Thank you. It's really a good idea.

> Would there be any use for something like this for other languages?

I had created some accounts to monitor spelling errors in the Turkish corpus using vocabulary items.

https://tatoeba.org/eng/sentences/show/8243466

It would be nice to have a similar script that collects the vocabulary items of those accounts (they can be added to the script manually), searches them in the Turkish corpus and reports the results.

soliloquist soliloquist 22 listopada 2020, edytowane 26 maja 2021 22 listopada 2020 15:16:24 UTC, edytowane 26 maja 2021 20:17:34 UTC link Bezpośredni link

https://tatoeba.org/en/sentence...omment-1268881

soliloquist soliloquist 18 listopada 2020 18 listopada 2020 13:46:42 UTC link Bezpośredni link

Just take your time, thank you.

soliloquist soliloquist 18 listopada 2020 18 listopada 2020 11:59:16 UTC link Bezpośredni link

Thanks. Would you consider adding the original sentence ratio (ocnt/scnt) into the chart like you do with tcnt/scnt? The "Base of Sentences" part on the Downloads page probably has the necessary data.