menu
Tatoeba
language
Vytvoriť účet Prihlásiť sa
language Slovenčina
menu
Tatoeba

chevron_right Vytvoriť účet

chevron_right Prihlásiť sa

Hľadať

chevron_right Zobraziť náhodnú vetu

chevron_right Hľadať podľa jazyka

chevron_right Hľadať podľa zoznamu

chevron_right Hľadať podľa štítka

chevron_right Hľadať medzi zvukovými nahrávkami

Komunita

chevron_right Nástenka

chevron_right Zoznam všetkých členov

chevron_right Jazyky členov

chevron_right Rodení hovoriaci

search
clear
swap_horiz
search
soliloquist {{ icon }} keyboard_arrow_right

Profil

keyboard_arrow_right

Vety

keyboard_arrow_right

Slovná zásoba

keyboard_arrow_right

Preverenia

keyboard_arrow_right

Zoznamy

keyboard_arrow_right

Obľúbené

keyboard_arrow_right

Komentáre

keyboard_arrow_right

Komentáre na vetách užívateľa soliloquist

keyboard_arrow_right

Správy na Nástenke

keyboard_arrow_right

Záznamy

keyboard_arrow_right

Zvukové nahrávky

keyboard_arrow_right

Prepisy

translate

Prekladať vety užívateľa soliloquist

Správy užívateľa soliloquist na Nástenke (spolu 151)

soliloquist soliloquist 7. júla 2021 7. júla 2021, 22:12:52 UTC link Trvalý odkaz

> And for RTL, maybe if we can somehow mark it as RTL would be great.

The writing direction of the language needs to be changed from LTR to auto. I had made a similar request for Ottoman Turkish. Sentences in both scripts display correctly now. #9674971

soliloquist soliloquist 1. júna 2021 1. júna 2021, 19:33:36 UTC link Trvalý odkaz

I guess many users would either prefer to receive them all or none, but if you want to get email notifications only of a certain type, you can link your Tatoeba account to a new email address, and create some rules/filters on that email account to forward notifications to your main email address only of that type.

The subject lines of emails from Tatoeba are as below:

Tatoeba PM

Tatoeba - Comment on sentence

Tatoeba - X has replied to you on the Wall

You can create rules with those patterns to forward or filter the notifications you want.

soliloquist soliloquist 28. marca 2021 28. marca 2021, 12:38:42 UTC link Trvalý odkaz

https://github.com/Tatoeba/tatoeba2/issues/183

soliloquist soliloquist 21. marca 2021 21. marca 2021, 13:52:41 UTC link Trvalý odkaz

https://en.wiki.tatoeba.org/art...nguage-request

soliloquist soliloquist 5. marca 2021, upravené dňa 5. marca 2021 5. marca 2021, 13:37:44 UTC, upravené 5. marca 2021, 13:37:59 UTC link Trvalý odkaz

Thanks for working on it! It seems you found a better method to redirect wiki pages, other than using Transifex. (https://github.com/Tatoeba/tatoeba2/issues/2626 )

soliloquist soliloquist 12. februára 2021 12. februára 2021, 13:33:23 UTC link Trvalý odkaz

> but many of them disappear in the following three days.

I agree. I usually try not to write many pedantic comments under new members' sentences to avoid intimidating them.

Another underlying reason of this impermanence, I believe, is the lack of a reward mechanism and gamification which may cause monotony. But of course, it can be argued that they have drawbacks, too. See the discussion on GitHub.

https://github.com/Tatoeba/tatoeba2/issues/2481

soliloquist soliloquist 5. februára 2021 5. februára 2021, 11:37:50 UTC link Trvalý odkaz

Much better than the old design, thank you.

soliloquist soliloquist 13. decembra 2020 13. decembra 2020, 22:02:49 UTC link Trvalý odkaz

> Now "Tatoeba Sentences & Translations Stats" and "Tatoeba User Activity Chart"
> show "Original sentences" numbers and "Original / total" ratio.

Thank you very much for that.

soliloquist soliloquist 25. novembra 2020 25. novembra 2020, 18:35:23 UTC link Trvalý odkaz

The suffixes now seem to be handled well. #8904790, #9094761 and #8871066 are all listed. Thank you very much.

soliloquist soliloquist 25. novembra 2020, upravené dňa 25. novembra 2020 25. novembra 2020, 18:31:45 UTC, upravené 25. novembra 2020, 18:36:12 UTC link Trvalý odkaz

You're right. We should remove ... from it. That would give a lot of false positives.

It should only include exact matches.

onla -> onunla

(without ...)

soliloquist soliloquist 25. novembra 2020, upravené dňa 25. novembra 2020 25. novembra 2020, 17:46:36 UTC, upravené 25. novembra 2020, 18:24:59 UTC link Trvalý odkaz

That's it! Here's the new list. I hope it works this time.

...çca -> ...çça
...çce -> ...ççe
...çci -> ...ççi
...çcı -> ...ççı
...çcu -> ...ççu
...çcü -> ...ççü
...çda -> ...çta
...çdan -> ...çtan
...çde -> ...çte
...çden -> ...çten
...fca -> ...fça
...fce -> ...fçe
...fci -> ...fçi
...fcı -> ...fçı
...fcu -> ...fçu
...fcü -> ...fçü
...fda -> ...fta
...fdan -> ...ftan
...fde -> ...fte
...fden -> ...ften
...hca -> ...hça
...hce -> ...hçe
...hci -> ...hçi
...hcı -> ...hçı
...hcu -> ...hçu
...hcü -> ...hçü
...hda -> ...hta
...hdan -> ...htan
...hde -> ...hte
...hden -> ...hten
...kca -> ...kça
...kce -> ...kçe
...kci -> ...kçi
...kcı -> ...kçı
...kcu -> ...kçu
...kcü -> ...kçü
...kda -> ...kta
...kdan -> ...ktan
...kde -> ...kte
...kden -> ...kten
...pca -> ...pça
...pce -> ...pçe
...pci -> ...pçi
...pcı -> ...pçı
...pcu -> ...pçu
...pcü -> ...pçü
...pda -> ...pta
...pdan -> ...ptan
...pde -> ...pte
...pden -> ...pten
...sca -> ...sça
...sce -> ...sçe
...sci -> ...sçi
...scı -> ...sçı
...scu -> ...sçu
...scü -> ...sçü
...sda -> ...sta
...sdan -> ...stan
...sde -> ...ste
...sden -> ...sten
...şca -> ...şça
...şce -> ...şçe
...şci -> ...şçi
...şcı -> ...şçı
...şcu -> ...şçu
...şcü -> ...şçü
...şda -> ...şta
...şdan -> ...ştan
...şde -> ...şte
...şden -> ...şten
acenta... -> acente...
aç gözlü... -> açgözlü...
açıkca... -> açıkça...
adele... -> adale...
afedersin... -> affedersin...
aksesuvar... -> aksesuar...
aktrist... -> aktris...
akıl almaz... -> akılalmaz...
Alaman... -> Alman...
allerji... -> alerji...
alt üst... -> altüst...
alış veriş... -> alışveriş...
aliminyum... -> alüminyum...
ambülans... -> ambulans
ampül... -> ampul...
ana okulu... -> anaokulu...
antreman... -> antrenman...
apandist... -> apandisit...
aperatif... -> aperitif...
Arabça... -> Arapça...
arasıra... -> ara sıra...
ardarda... -> art arda...
Arjentin... -> Arjantin...
atmış... -> altmış... (çoğu false positive)
avusturalya... -> avustralya...
ayırım... -> ayrım...
Azarbaycan... -> Azerbaycan
Azarbeycan... -> Azerbaycan
Azerbeycan... -> Azerbaycan
banço... -> banjo...
başbaşa... -> baş başa...
başı boş... -> başıboş...
belkide... -> belki de...
benle... -> benimle...
beysbol... -> beyzbol...
bir kaç... -> birkaç...
bir çok... -> birçok...
birarada... -> bir arada...
birden bire... -> birdenbire...
Biritan... -> Britan...
birsürü... -> bir sürü...
Brazilya... -> Brezilya...
bu gün... -> bugün...
bugünki... -> bugünkü...
bulüz... -> bluz...
burda... -> burada...
buyrun... -> buyurun...
büfte... -> bifte...
büyük anne... -> büyükanne...
büyük baba... -> büyükbaba...
can kurtaran... -> cankurtaran...
cimnastik... -> jimnastik...
çarşanba... -> çarşamba...
çeki düzen... -> çekidüzen...
çokaz... -> çok az...
çoşku... -> coşku...
dahada... -> daha da...
deniz aşırı... -> denizaşırı...
değilmi... -> değil mi...
deyer... -> değer...
deyme... -> değme...
doğumgünü... -> doğum günü...
döküman... -> doküman...
döğdü... -> dövdü...
döğüş... -> dövüş...
dünki... -> dünkü...
düz taban... -> düztaban...
eczahane... -> eczane...
eksoz... -> egzoz...
elele... -> el ele...
entellektüel... -> entelektüel...
eposta... -> e-posta...
eylence... -> eğlence...
eylenmek... -> eğlenmek...
fantazi... -> fantezi...
farked... -> fark ed...
farket... -> fark et...
farzed... -> farz ed...
farzet... -> farz et...
Fırans... -> Frans...
filim... -> film...
fonksyon... -> fonksiyon...
fotograf... -> fotoğraf...
gardrop... -> gardırop...
gurup... -> grup...
gök kuşağı... -> gökkuşağı...
gök yüzü... -> gökyüzü...
göz yaşı... -> gözyaşı...
gözardı... -> göz ardı...
gözkulak... -> göz kulak...
gözüpek... -> gözü pek...
haftasonu... -> hafta sonu...
haked... -> hak ed...
haket... -> hak et...
hakket... -> hak et...
halbu ki... > halbuki...
hastahane... -> hastane...
hava alanı... -> havaalanı...
hava limanı... -> havalimanı...
hem fikir... -> hemfikir...
hemde... -> hem de...
her hangi... -> herhangi...
hergün... -> her gün...
herkez... -> herkes...
herne... -> her ne...
heryer... -> her yer...
herzaman... -> her zaman...
herşey... -> her şey...
hiç bir... -> hiçbir...
hiçkimse... -> hiç kimse...
hiçte... -> hiç de...
humani... -> hümani...
ısraf... -> israf...
iki yüzlü... -> ikiyüzlü...
ilk okul... -> ilkokul...
insan oğlu... -> insanoğlu...
insiyatif... -> inisiyatif...
israr... -> ısrar...
istakoz... -> ıstakoz...
istambul... -> istanbul...
itibariyle... -> itibarıyla...
iyiki... -> iyi ki...
kamu oyu... -> kamuoyu...
kapşon... -> kapüşon...
kareografi... -> koreografi...
kaysı... -> kayısı...
klavuz... -> kılavuz...
klüp... -> kulüp...
kolleksiyon... -> koleksiyon...
kominist... -> komünist...
kompartman... -> kompartıman...
koperatif... -> kooperatif...
kılınç... -> kılıç...
kıral... -> kral...
kıraliyet... -> kraliyet...
kıraliçe... -> kraliçe...
kırallık... -> krallık...
kızarkadaş... -> kız arkadaş...
kızkardeş... -> kız kardeş...
Kürdçe... -> Kürtçe...
labaratuar... -> laboratuvar...
labaratuvar... -> laboratuvar...
madem ki... -> mademki...
mahçup... -> mahcup...
makina... -> makine...
malesef... -> maalesef...
malolmak... -> mal olmak...
Marry... -> Mary...
Mary'e... -> Mary'ye...
Mary'i... -> Mary'yi...
Mary'le... -> Mary'yle...
matamatik... -> matematik...
menejer... -> menajer...
metod... -> metot...
meyva... -> meyve...
meşkul... -> meşgul...
motorsiklet... -> motosiklet...
müdahele... -> müdahale...
müsade... -> müsaade...
müstehak... -> müstahak...
mütevazi... -> mütevazı...
nerde... -> nerede...
neyseki... -> neyse ki...
nufus... -> nüfus...
okur yazar... -> okuryazar...
onaltı... -> on altı...
onbeş... -> on beş...
onbir... -> on bir...
ondokuz... -> on dokuz...
ondört... -> on dört...
oniki... -> on iki...
onla... -> onunla...
onsekiz... -> on sekiz...
onyedi... -> on yedi...
onüç... -> on üç...
orda... -> orada...
orjinal... -> orijinal...
orta okul... -> ortaokul...
oysa ki... -> oysaki...
pantalon... -> pantolon...
parlemento... -> parlamento...
pastahane... -> pastane...
pekaz... -> pek az...
pekçok... -> pek çok...
penbe... -> pembe...
perşenbe... -> perşembe...
peşpeşe... -> peş peşe...
pilaj... -> plaj...
postahane... -> postane...
proğram... -> program...
rasgele... -> rastgele...
rasgelm... -> rast gelm...
raslantı... -> rastlantı...
sarfed... -> sarf ed...
sarfet... -> sarf et...
sarmısak... -> sarımsak...
senle... -> seninle...
sivri sinek... -> sivrisinek...
sohpet... -> sohbet...
sueter... -> süveter...
süpriz... -> sürpriz...
şöför... -> şoför...
tabi ki... -> tabii ki...
taktir... -> takdir...
terked... -> terk ed...
terket... -> terk et...
tesbih... -> tespih...
tesbit... -> tespit...
traş... -> tıraş...
Türküye... -> Türkiye...
umru... -> umuru...
ünüversite... -> üniversite...
ünvan... -> unvan...
vaz geç... -> vazgeç...
yada... -> ya da...
yanlız... -> yalnız...
yanyana... -> yan yana...
yanısıra... -> yanı sıra...
yinede... -> yine de...
yüksek okul... -> yüksekokul...
yüz ölçüm... -> yüzölçüm...
zıttı... -> zıddı...

soliloquist soliloquist 25. novembra 2020 25. novembra 2020, 17:45:48 UTC link Trvalý odkaz

> In that case, the rule should be written as
> herşey... -> her şey...

Yes. Thanks for clarifying.

> But is there a difference between rules ending in ... and other rules, then?

No, not really, as long as ... is placed correctly.

soliloquist soliloquist 25. novembra 2020, upravené dňa 25. novembra 2020 25. novembra 2020, 15:38:29 UTC, upravené 25. novembra 2020, 15:40:17 UTC link Trvalý odkaz

Thanks. The script now detects the suffix errors (in which the error is in the suffix itself) nicely, but misspelled words with suffixes (in which the error is in the root, not in the suffix) are still ignored.

For example,

herşey -> her şey

It only detects #8904790, but it should also detect #9094761 and #8871066. They have the same error, but I guess it is set to search terms as 'whole words' so the suffixed forms are ignored. Many of the terms are in a similar situation. Only the ones without suffixes are found.

soliloquist soliloquist 25. novembra 2020 25. novembra 2020, 15:38:19 UTC link Trvalý odkaz

> - is that a common Turkish habit? :P )

No, it's just a trivial and stylistic error. :-) But it's better to standardize them to avoid having duplicates. Thanks for picking them up.

soliloquist soliloquist 25. novembra 2020 25. novembra 2020, 13:13:33 UTC link Trvalý odkaz

> What's the difference between
> "Haked... -> Hak ed..."

The latter is the correct form. '...' means that the word certainly has a suffix so you can think of it as 'haked*'.

> and
> "* Öğe -> Öge"

Sorry for the confusion. The asterisk at the beginnig of that example has a different meaning. There are several other examples like that. You can ignore them. We have two Turkish language associations and they have differences of opinion about spelling of some words and that word is one of them. I put an asterisk at the beginning of them to indicate that. I now replaced them with ※ to avoid confusion.

Examples like,

...pde -> ...pte

...sden -> ...sten

show suffix errors. Such words end with those letters without having other suffixes. They can be searched on Tatoeba by putting an asterisk before them. (*pde, *sden etc.). If it's difficult to add them to the script, you can ignore them. Just having suffix support would be sufficient.

soliloquist soliloquist 25. novembra 2020 25. novembra 2020, 10:41:04 UTC link Trvalý odkaz

Thanks a million for adapting the script for Turkish. The reason I put asterisks on some of the vocabulary items was to include their suffixed forms. Otherwise the search finds only exact matches, without suffixed ones. For example,

herşey -> her şey

Without the asterisk, the search finds only 'herşey' but not the suffixed forms like 'herşeyi', 'herşeye', 'herşeyde' etc. So it's kind of a stemming support. Your script does a great job as is, but if you could find a way to include other forms, it would be much better.

Thank you very much again.

soliloquist soliloquist 24. novembra 2020 24. novembra 2020, 19:14:33 UTC link Trvalý odkaz

Thank you. It's really a good idea.

> Would there be any use for something like this for other languages?

I had created some accounts to monitor spelling errors in the Turkish corpus using vocabulary items.

https://tatoeba.org/eng/sentences/show/8243466

It would be nice to have a similar script that collects the vocabulary items of those accounts (they can be added to the script manually), searches them in the Turkish corpus and reports the results.

soliloquist soliloquist 22. novembra 2020, upravené dňa 26. mája 2021 22. novembra 2020, 15:16:24 UTC, upravené 26. mája 2021, 20:17:34 UTC link Trvalý odkaz

https://tatoeba.org/en/sentence...omment-1268881

soliloquist soliloquist 18. novembra 2020 18. novembra 2020, 13:46:42 UTC link Trvalý odkaz

Just take your time, thank you.

soliloquist soliloquist 18. novembra 2020 18. novembra 2020, 11:59:16 UTC link Trvalý odkaz

Thanks. Would you consider adding the original sentence ratio (ocnt/scnt) into the chart like you do with tcnt/scnt? The "Base of Sentences" part on the Downloads page probably has the necessary data.