menu
Tatoeba
language
Registriĝi Ensaluti
language Esperanto
menu
Tatoeba

chevron_right Registriĝi

chevron_right Ensaluti

Foliumi

chevron_right Montri hazardan frazon

chevron_right Foliumi laŭ lingvo

chevron_right Foliumi laŭ listo

chevron_right Foliumi laŭ etikedo

chevron_right Foliumi sonregistraĵojn

Komunumo

chevron_right Muro

chevron_right Listo de ĉiuj membroj

chevron_right Lingvoj de la membroj

chevron_right Denaskaj parolantoj

search
clear
swap_horiz
search
soliloquist {{ icon }} keyboard_arrow_right

Profilo

keyboard_arrow_right

Frazoj

keyboard_arrow_right

Vortaro

keyboard_arrow_right

Revizioj

keyboard_arrow_right

Listoj

keyboard_arrow_right

Preferaĵoj

keyboard_arrow_right

Komentoj

keyboard_arrow_right

Komentoj pri frazoj de soliloquist

keyboard_arrow_right

Muraj mesaĝoj

keyboard_arrow_right

Registroj

keyboard_arrow_right

Sono

keyboard_arrow_right

Transskriboj

translate

Traduki la frazojn de soliloquist

Surmuraj mesaĝoj de s% (entute soliloquist)

soliloquist soliloquist 2021-julio-07 2021-julio-07 22:12:52 UTC link Konstanta ligilo

> And for RTL, maybe if we can somehow mark it as RTL would be great.

The writing direction of the language needs to be changed from LTR to auto. I had made a similar request for Ottoman Turkish. Sentences in both scripts display correctly now. #9674971

soliloquist soliloquist 2021-junio-01 2021-junio-01 19:33:36 UTC link Konstanta ligilo

I guess many users would either prefer to receive them all or none, but if you want to get email notifications only of a certain type, you can link your Tatoeba account to a new email address, and create some rules/filters on that email account to forward notifications to your main email address only of that type.

The subject lines of emails from Tatoeba are as below:

Tatoeba PM

Tatoeba - Comment on sentence

Tatoeba - X has replied to you on the Wall

You can create rules with those patterns to forward or filter the notifications you want.

soliloquist soliloquist 2021-marto-28 2021-marto-28 12:38:42 UTC link Konstanta ligilo

https://github.com/Tatoeba/tatoeba2/issues/183

soliloquist soliloquist 2021-marto-21 2021-marto-21 13:52:41 UTC link Konstanta ligilo

https://en.wiki.tatoeba.org/art...nguage-request

soliloquist soliloquist 2021-marto-05, modifita 2021-marto-05 2021-marto-05 13:37:44 UTC, modifita 2021-marto-05 13:37:59 UTC link Konstanta ligilo

Thanks for working on it! It seems you found a better method to redirect wiki pages, other than using Transifex. (https://github.com/Tatoeba/tatoeba2/issues/2626 )

soliloquist soliloquist 2021-februaro-12 2021-februaro-12 13:33:23 UTC link Konstanta ligilo

> but many of them disappear in the following three days.

I agree. I usually try not to write many pedantic comments under new members' sentences to avoid intimidating them.

Another underlying reason of this impermanence, I believe, is the lack of a reward mechanism and gamification which may cause monotony. But of course, it can be argued that they have drawbacks, too. See the discussion on GitHub.

https://github.com/Tatoeba/tatoeba2/issues/2481

soliloquist soliloquist 2021-februaro-05 2021-februaro-05 11:37:50 UTC link Konstanta ligilo

Much better than the old design, thank you.

soliloquist soliloquist 2020-decembro-13 2020-decembro-13 22:02:49 UTC link Konstanta ligilo

> Now "Tatoeba Sentences & Translations Stats" and "Tatoeba User Activity Chart"
> show "Original sentences" numbers and "Original / total" ratio.

Thank you very much for that.

soliloquist soliloquist 2020-novembro-25 2020-novembro-25 18:35:23 UTC link Konstanta ligilo

The suffixes now seem to be handled well. #8904790, #9094761 and #8871066 are all listed. Thank you very much.

soliloquist soliloquist 2020-novembro-25, modifita 2020-novembro-25 2020-novembro-25 18:31:45 UTC, modifita 2020-novembro-25 18:36:12 UTC link Konstanta ligilo

You're right. We should remove ... from it. That would give a lot of false positives.

It should only include exact matches.

onla -> onunla

(without ...)

soliloquist soliloquist 2020-novembro-25, modifita 2020-novembro-25 2020-novembro-25 17:46:36 UTC, modifita 2020-novembro-25 18:24:59 UTC link Konstanta ligilo

That's it! Here's the new list. I hope it works this time.

...çca -> ...çça
...çce -> ...ççe
...çci -> ...ççi
...çcı -> ...ççı
...çcu -> ...ççu
...çcü -> ...ççü
...çda -> ...çta
...çdan -> ...çtan
...çde -> ...çte
...çden -> ...çten
...fca -> ...fça
...fce -> ...fçe
...fci -> ...fçi
...fcı -> ...fçı
...fcu -> ...fçu
...fcü -> ...fçü
...fda -> ...fta
...fdan -> ...ftan
...fde -> ...fte
...fden -> ...ften
...hca -> ...hça
...hce -> ...hçe
...hci -> ...hçi
...hcı -> ...hçı
...hcu -> ...hçu
...hcü -> ...hçü
...hda -> ...hta
...hdan -> ...htan
...hde -> ...hte
...hden -> ...hten
...kca -> ...kça
...kce -> ...kçe
...kci -> ...kçi
...kcı -> ...kçı
...kcu -> ...kçu
...kcü -> ...kçü
...kda -> ...kta
...kdan -> ...ktan
...kde -> ...kte
...kden -> ...kten
...pca -> ...pça
...pce -> ...pçe
...pci -> ...pçi
...pcı -> ...pçı
...pcu -> ...pçu
...pcü -> ...pçü
...pda -> ...pta
...pdan -> ...ptan
...pde -> ...pte
...pden -> ...pten
...sca -> ...sça
...sce -> ...sçe
...sci -> ...sçi
...scı -> ...sçı
...scu -> ...sçu
...scü -> ...sçü
...sda -> ...sta
...sdan -> ...stan
...sde -> ...ste
...sden -> ...sten
...şca -> ...şça
...şce -> ...şçe
...şci -> ...şçi
...şcı -> ...şçı
...şcu -> ...şçu
...şcü -> ...şçü
...şda -> ...şta
...şdan -> ...ştan
...şde -> ...şte
...şden -> ...şten
acenta... -> acente...
aç gözlü... -> açgözlü...
açıkca... -> açıkça...
adele... -> adale...
afedersin... -> affedersin...
aksesuvar... -> aksesuar...
aktrist... -> aktris...
akıl almaz... -> akılalmaz...
Alaman... -> Alman...
allerji... -> alerji...
alt üst... -> altüst...
alış veriş... -> alışveriş...
aliminyum... -> alüminyum...
ambülans... -> ambulans
ampül... -> ampul...
ana okulu... -> anaokulu...
antreman... -> antrenman...
apandist... -> apandisit...
aperatif... -> aperitif...
Arabça... -> Arapça...
arasıra... -> ara sıra...
ardarda... -> art arda...
Arjentin... -> Arjantin...
atmış... -> altmış... (çoğu false positive)
avusturalya... -> avustralya...
ayırım... -> ayrım...
Azarbaycan... -> Azerbaycan
Azarbeycan... -> Azerbaycan
Azerbeycan... -> Azerbaycan
banço... -> banjo...
başbaşa... -> baş başa...
başı boş... -> başıboş...
belkide... -> belki de...
benle... -> benimle...
beysbol... -> beyzbol...
bir kaç... -> birkaç...
bir çok... -> birçok...
birarada... -> bir arada...
birden bire... -> birdenbire...
Biritan... -> Britan...
birsürü... -> bir sürü...
Brazilya... -> Brezilya...
bu gün... -> bugün...
bugünki... -> bugünkü...
bulüz... -> bluz...
burda... -> burada...
buyrun... -> buyurun...
büfte... -> bifte...
büyük anne... -> büyükanne...
büyük baba... -> büyükbaba...
can kurtaran... -> cankurtaran...
cimnastik... -> jimnastik...
çarşanba... -> çarşamba...
çeki düzen... -> çekidüzen...
çokaz... -> çok az...
çoşku... -> coşku...
dahada... -> daha da...
deniz aşırı... -> denizaşırı...
değilmi... -> değil mi...
deyer... -> değer...
deyme... -> değme...
doğumgünü... -> doğum günü...
döküman... -> doküman...
döğdü... -> dövdü...
döğüş... -> dövüş...
dünki... -> dünkü...
düz taban... -> düztaban...
eczahane... -> eczane...
eksoz... -> egzoz...
elele... -> el ele...
entellektüel... -> entelektüel...
eposta... -> e-posta...
eylence... -> eğlence...
eylenmek... -> eğlenmek...
fantazi... -> fantezi...
farked... -> fark ed...
farket... -> fark et...
farzed... -> farz ed...
farzet... -> farz et...
Fırans... -> Frans...
filim... -> film...
fonksyon... -> fonksiyon...
fotograf... -> fotoğraf...
gardrop... -> gardırop...
gurup... -> grup...
gök kuşağı... -> gökkuşağı...
gök yüzü... -> gökyüzü...
göz yaşı... -> gözyaşı...
gözardı... -> göz ardı...
gözkulak... -> göz kulak...
gözüpek... -> gözü pek...
haftasonu... -> hafta sonu...
haked... -> hak ed...
haket... -> hak et...
hakket... -> hak et...
halbu ki... > halbuki...
hastahane... -> hastane...
hava alanı... -> havaalanı...
hava limanı... -> havalimanı...
hem fikir... -> hemfikir...
hemde... -> hem de...
her hangi... -> herhangi...
hergün... -> her gün...
herkez... -> herkes...
herne... -> her ne...
heryer... -> her yer...
herzaman... -> her zaman...
herşey... -> her şey...
hiç bir... -> hiçbir...
hiçkimse... -> hiç kimse...
hiçte... -> hiç de...
humani... -> hümani...
ısraf... -> israf...
iki yüzlü... -> ikiyüzlü...
ilk okul... -> ilkokul...
insan oğlu... -> insanoğlu...
insiyatif... -> inisiyatif...
israr... -> ısrar...
istakoz... -> ıstakoz...
istambul... -> istanbul...
itibariyle... -> itibarıyla...
iyiki... -> iyi ki...
kamu oyu... -> kamuoyu...
kapşon... -> kapüşon...
kareografi... -> koreografi...
kaysı... -> kayısı...
klavuz... -> kılavuz...
klüp... -> kulüp...
kolleksiyon... -> koleksiyon...
kominist... -> komünist...
kompartman... -> kompartıman...
koperatif... -> kooperatif...
kılınç... -> kılıç...
kıral... -> kral...
kıraliyet... -> kraliyet...
kıraliçe... -> kraliçe...
kırallık... -> krallık...
kızarkadaş... -> kız arkadaş...
kızkardeş... -> kız kardeş...
Kürdçe... -> Kürtçe...
labaratuar... -> laboratuvar...
labaratuvar... -> laboratuvar...
madem ki... -> mademki...
mahçup... -> mahcup...
makina... -> makine...
malesef... -> maalesef...
malolmak... -> mal olmak...
Marry... -> Mary...
Mary'e... -> Mary'ye...
Mary'i... -> Mary'yi...
Mary'le... -> Mary'yle...
matamatik... -> matematik...
menejer... -> menajer...
metod... -> metot...
meyva... -> meyve...
meşkul... -> meşgul...
motorsiklet... -> motosiklet...
müdahele... -> müdahale...
müsade... -> müsaade...
müstehak... -> müstahak...
mütevazi... -> mütevazı...
nerde... -> nerede...
neyseki... -> neyse ki...
nufus... -> nüfus...
okur yazar... -> okuryazar...
onaltı... -> on altı...
onbeş... -> on beş...
onbir... -> on bir...
ondokuz... -> on dokuz...
ondört... -> on dört...
oniki... -> on iki...
onla... -> onunla...
onsekiz... -> on sekiz...
onyedi... -> on yedi...
onüç... -> on üç...
orda... -> orada...
orjinal... -> orijinal...
orta okul... -> ortaokul...
oysa ki... -> oysaki...
pantalon... -> pantolon...
parlemento... -> parlamento...
pastahane... -> pastane...
pekaz... -> pek az...
pekçok... -> pek çok...
penbe... -> pembe...
perşenbe... -> perşembe...
peşpeşe... -> peş peşe...
pilaj... -> plaj...
postahane... -> postane...
proğram... -> program...
rasgele... -> rastgele...
rasgelm... -> rast gelm...
raslantı... -> rastlantı...
sarfed... -> sarf ed...
sarfet... -> sarf et...
sarmısak... -> sarımsak...
senle... -> seninle...
sivri sinek... -> sivrisinek...
sohpet... -> sohbet...
sueter... -> süveter...
süpriz... -> sürpriz...
şöför... -> şoför...
tabi ki... -> tabii ki...
taktir... -> takdir...
terked... -> terk ed...
terket... -> terk et...
tesbih... -> tespih...
tesbit... -> tespit...
traş... -> tıraş...
Türküye... -> Türkiye...
umru... -> umuru...
ünüversite... -> üniversite...
ünvan... -> unvan...
vaz geç... -> vazgeç...
yada... -> ya da...
yanlız... -> yalnız...
yanyana... -> yan yana...
yanısıra... -> yanı sıra...
yinede... -> yine de...
yüksek okul... -> yüksekokul...
yüz ölçüm... -> yüzölçüm...
zıttı... -> zıddı...

soliloquist soliloquist 2020-novembro-25 2020-novembro-25 17:45:48 UTC link Konstanta ligilo

> In that case, the rule should be written as
> herşey... -> her şey...

Yes. Thanks for clarifying.

> But is there a difference between rules ending in ... and other rules, then?

No, not really, as long as ... is placed correctly.

soliloquist soliloquist 2020-novembro-25, modifita 2020-novembro-25 2020-novembro-25 15:38:29 UTC, modifita 2020-novembro-25 15:40:17 UTC link Konstanta ligilo

Thanks. The script now detects the suffix errors (in which the error is in the suffix itself) nicely, but misspelled words with suffixes (in which the error is in the root, not in the suffix) are still ignored.

For example,

herşey -> her şey

It only detects #8904790, but it should also detect #9094761 and #8871066. They have the same error, but I guess it is set to search terms as 'whole words' so the suffixed forms are ignored. Many of the terms are in a similar situation. Only the ones without suffixes are found.

soliloquist soliloquist 2020-novembro-25 2020-novembro-25 15:38:19 UTC link Konstanta ligilo

> - is that a common Turkish habit? :P )

No, it's just a trivial and stylistic error. :-) But it's better to standardize them to avoid having duplicates. Thanks for picking them up.

soliloquist soliloquist 2020-novembro-25 2020-novembro-25 13:13:33 UTC link Konstanta ligilo

> What's the difference between
> "Haked... -> Hak ed..."

The latter is the correct form. '...' means that the word certainly has a suffix so you can think of it as 'haked*'.

> and
> "* Öğe -> Öge"

Sorry for the confusion. The asterisk at the beginnig of that example has a different meaning. There are several other examples like that. You can ignore them. We have two Turkish language associations and they have differences of opinion about spelling of some words and that word is one of them. I put an asterisk at the beginning of them to indicate that. I now replaced them with ※ to avoid confusion.

Examples like,

...pde -> ...pte

...sden -> ...sten

show suffix errors. Such words end with those letters without having other suffixes. They can be searched on Tatoeba by putting an asterisk before them. (*pde, *sden etc.). If it's difficult to add them to the script, you can ignore them. Just having suffix support would be sufficient.

soliloquist soliloquist 2020-novembro-25 2020-novembro-25 10:41:04 UTC link Konstanta ligilo

Thanks a million for adapting the script for Turkish. The reason I put asterisks on some of the vocabulary items was to include their suffixed forms. Otherwise the search finds only exact matches, without suffixed ones. For example,

herşey -> her şey

Without the asterisk, the search finds only 'herşey' but not the suffixed forms like 'herşeyi', 'herşeye', 'herşeyde' etc. So it's kind of a stemming support. Your script does a great job as is, but if you could find a way to include other forms, it would be much better.

Thank you very much again.

soliloquist soliloquist 2020-novembro-24 2020-novembro-24 19:14:33 UTC link Konstanta ligilo

Thank you. It's really a good idea.

> Would there be any use for something like this for other languages?

I had created some accounts to monitor spelling errors in the Turkish corpus using vocabulary items.

https://tatoeba.org/eng/sentences/show/8243466

It would be nice to have a similar script that collects the vocabulary items of those accounts (they can be added to the script manually), searches them in the Turkish corpus and reports the results.

soliloquist soliloquist 2020-novembro-22, modifita 2021-majo-26 2020-novembro-22 15:16:24 UTC, modifita 2021-majo-26 20:17:34 UTC link Konstanta ligilo

https://tatoeba.org/en/sentence...omment-1268881

soliloquist soliloquist 2020-novembro-18 2020-novembro-18 13:46:42 UTC link Konstanta ligilo

Just take your time, thank you.

soliloquist soliloquist 2020-novembro-18 2020-novembro-18 11:59:16 UTC link Konstanta ligilo

Thanks. Would you consider adding the original sentence ratio (ocnt/scnt) into the chart like you do with tcnt/scnt? The "Base of Sentences" part on the Downloads page probably has the necessary data.