menu
Tatoeba
language
Register Log in
language English
menu
Tatoeba

chevron_right Register

chevron_right Log in

Browse

chevron_right Show random sentence

chevron_right Browse by language

chevron_right Browse by list

chevron_right Browse by tag

chevron_right Browse audio

Community

chevron_right Wall

chevron_right List of all members

chevron_right Languages of members

chevron_right Native speakers

search
clear
swap_horiz
search
soliloquist {{ icon }} keyboard_arrow_right

Profile

keyboard_arrow_right

Sentences

keyboard_arrow_right

Vocabulary

keyboard_arrow_right

Reviews

keyboard_arrow_right

Lists

keyboard_arrow_right

Favorites

keyboard_arrow_right

Comments

keyboard_arrow_right

Comments on soliloquist's sentences

keyboard_arrow_right

Wall messages

keyboard_arrow_right

Logs

keyboard_arrow_right

Audio

keyboard_arrow_right

Transcriptions

translate

Translate soliloquist's sentences

soliloquist's messages on the Wall (total 151)

soliloquist soliloquist July 7, 2021 July 7, 2021 at 10:12:52 PM UTC link Permalink

> And for RTL, maybe if we can somehow mark it as RTL would be great.

The writing direction of the language needs to be changed from LTR to auto. I had made a similar request for Ottoman Turkish. Sentences in both scripts display correctly now. #9674971

soliloquist soliloquist June 1, 2021 June 1, 2021 at 7:33:36 PM UTC link Permalink

I guess many users would either prefer to receive them all or none, but if you want to get email notifications only of a certain type, you can link your Tatoeba account to a new email address, and create some rules/filters on that email account to forward notifications to your main email address only of that type.

The subject lines of emails from Tatoeba are as below:

Tatoeba PM

Tatoeba - Comment on sentence

Tatoeba - X has replied to you on the Wall

You can create rules with those patterns to forward or filter the notifications you want.

soliloquist soliloquist March 28, 2021 March 28, 2021 at 12:38:42 PM UTC link Permalink

https://github.com/Tatoeba/tatoeba2/issues/183

soliloquist soliloquist March 21, 2021 March 21, 2021 at 1:52:41 PM UTC link Permalink

https://en.wiki.tatoeba.org/art...nguage-request

soliloquist soliloquist March 5, 2021, edited March 5, 2021 March 5, 2021 at 1:37:44 PM UTC, edited March 5, 2021 at 1:37:59 PM UTC link Permalink

Thanks for working on it! It seems you found a better method to redirect wiki pages, other than using Transifex. (https://github.com/Tatoeba/tatoeba2/issues/2626 )

soliloquist soliloquist February 12, 2021 February 12, 2021 at 1:33:23 PM UTC link Permalink

> but many of them disappear in the following three days.

I agree. I usually try not to write many pedantic comments under new members' sentences to avoid intimidating them.

Another underlying reason of this impermanence, I believe, is the lack of a reward mechanism and gamification which may cause monotony. But of course, it can be argued that they have drawbacks, too. See the discussion on GitHub.

https://github.com/Tatoeba/tatoeba2/issues/2481

soliloquist soliloquist February 5, 2021 February 5, 2021 at 11:37:50 AM UTC link Permalink

Much better than the old design, thank you.

soliloquist soliloquist December 13, 2020 December 13, 2020 at 10:02:49 PM UTC link Permalink

> Now "Tatoeba Sentences & Translations Stats" and "Tatoeba User Activity Chart"
> show "Original sentences" numbers and "Original / total" ratio.

Thank you very much for that.

soliloquist soliloquist November 25, 2020 November 25, 2020 at 6:35:23 PM UTC link Permalink

The suffixes now seem to be handled well. #8904790, #9094761 and #8871066 are all listed. Thank you very much.

soliloquist soliloquist November 25, 2020, edited November 25, 2020 November 25, 2020 at 6:31:45 PM UTC, edited November 25, 2020 at 6:36:12 PM UTC link Permalink

You're right. We should remove ... from it. That would give a lot of false positives.

It should only include exact matches.

onla -> onunla

(without ...)

soliloquist soliloquist November 25, 2020, edited November 25, 2020 November 25, 2020 at 5:46:36 PM UTC, edited November 25, 2020 at 6:24:59 PM UTC link Permalink

That's it! Here's the new list. I hope it works this time.

...çca -> ...çça
...çce -> ...ççe
...çci -> ...ççi
...çcı -> ...ççı
...çcu -> ...ççu
...çcü -> ...ççü
...çda -> ...çta
...çdan -> ...çtan
...çde -> ...çte
...çden -> ...çten
...fca -> ...fça
...fce -> ...fçe
...fci -> ...fçi
...fcı -> ...fçı
...fcu -> ...fçu
...fcü -> ...fçü
...fda -> ...fta
...fdan -> ...ftan
...fde -> ...fte
...fden -> ...ften
...hca -> ...hça
...hce -> ...hçe
...hci -> ...hçi
...hcı -> ...hçı
...hcu -> ...hçu
...hcü -> ...hçü
...hda -> ...hta
...hdan -> ...htan
...hde -> ...hte
...hden -> ...hten
...kca -> ...kça
...kce -> ...kçe
...kci -> ...kçi
...kcı -> ...kçı
...kcu -> ...kçu
...kcü -> ...kçü
...kda -> ...kta
...kdan -> ...ktan
...kde -> ...kte
...kden -> ...kten
...pca -> ...pça
...pce -> ...pçe
...pci -> ...pçi
...pcı -> ...pçı
...pcu -> ...pçu
...pcü -> ...pçü
...pda -> ...pta
...pdan -> ...ptan
...pde -> ...pte
...pden -> ...pten
...sca -> ...sça
...sce -> ...sçe
...sci -> ...sçi
...scı -> ...sçı
...scu -> ...sçu
...scü -> ...sçü
...sda -> ...sta
...sdan -> ...stan
...sde -> ...ste
...sden -> ...sten
...şca -> ...şça
...şce -> ...şçe
...şci -> ...şçi
...şcı -> ...şçı
...şcu -> ...şçu
...şcü -> ...şçü
...şda -> ...şta
...şdan -> ...ştan
...şde -> ...şte
...şden -> ...şten
acenta... -> acente...
aç gözlü... -> açgözlü...
açıkca... -> açıkça...
adele... -> adale...
afedersin... -> affedersin...
aksesuvar... -> aksesuar...
aktrist... -> aktris...
akıl almaz... -> akılalmaz...
Alaman... -> Alman...
allerji... -> alerji...
alt üst... -> altüst...
alış veriş... -> alışveriş...
aliminyum... -> alüminyum...
ambülans... -> ambulans
ampül... -> ampul...
ana okulu... -> anaokulu...
antreman... -> antrenman...
apandist... -> apandisit...
aperatif... -> aperitif...
Arabça... -> Arapça...
arasıra... -> ara sıra...
ardarda... -> art arda...
Arjentin... -> Arjantin...
atmış... -> altmış... (çoğu false positive)
avusturalya... -> avustralya...
ayırım... -> ayrım...
Azarbaycan... -> Azerbaycan
Azarbeycan... -> Azerbaycan
Azerbeycan... -> Azerbaycan
banço... -> banjo...
başbaşa... -> baş başa...
başı boş... -> başıboş...
belkide... -> belki de...
benle... -> benimle...
beysbol... -> beyzbol...
bir kaç... -> birkaç...
bir çok... -> birçok...
birarada... -> bir arada...
birden bire... -> birdenbire...
Biritan... -> Britan...
birsürü... -> bir sürü...
Brazilya... -> Brezilya...
bu gün... -> bugün...
bugünki... -> bugünkü...
bulüz... -> bluz...
burda... -> burada...
buyrun... -> buyurun...
büfte... -> bifte...
büyük anne... -> büyükanne...
büyük baba... -> büyükbaba...
can kurtaran... -> cankurtaran...
cimnastik... -> jimnastik...
çarşanba... -> çarşamba...
çeki düzen... -> çekidüzen...
çokaz... -> çok az...
çoşku... -> coşku...
dahada... -> daha da...
deniz aşırı... -> denizaşırı...
değilmi... -> değil mi...
deyer... -> değer...
deyme... -> değme...
doğumgünü... -> doğum günü...
döküman... -> doküman...
döğdü... -> dövdü...
döğüş... -> dövüş...
dünki... -> dünkü...
düz taban... -> düztaban...
eczahane... -> eczane...
eksoz... -> egzoz...
elele... -> el ele...
entellektüel... -> entelektüel...
eposta... -> e-posta...
eylence... -> eğlence...
eylenmek... -> eğlenmek...
fantazi... -> fantezi...
farked... -> fark ed...
farket... -> fark et...
farzed... -> farz ed...
farzet... -> farz et...
Fırans... -> Frans...
filim... -> film...
fonksyon... -> fonksiyon...
fotograf... -> fotoğraf...
gardrop... -> gardırop...
gurup... -> grup...
gök kuşağı... -> gökkuşağı...
gök yüzü... -> gökyüzü...
göz yaşı... -> gözyaşı...
gözardı... -> göz ardı...
gözkulak... -> göz kulak...
gözüpek... -> gözü pek...
haftasonu... -> hafta sonu...
haked... -> hak ed...
haket... -> hak et...
hakket... -> hak et...
halbu ki... > halbuki...
hastahane... -> hastane...
hava alanı... -> havaalanı...
hava limanı... -> havalimanı...
hem fikir... -> hemfikir...
hemde... -> hem de...
her hangi... -> herhangi...
hergün... -> her gün...
herkez... -> herkes...
herne... -> her ne...
heryer... -> her yer...
herzaman... -> her zaman...
herşey... -> her şey...
hiç bir... -> hiçbir...
hiçkimse... -> hiç kimse...
hiçte... -> hiç de...
humani... -> hümani...
ısraf... -> israf...
iki yüzlü... -> ikiyüzlü...
ilk okul... -> ilkokul...
insan oğlu... -> insanoğlu...
insiyatif... -> inisiyatif...
israr... -> ısrar...
istakoz... -> ıstakoz...
istambul... -> istanbul...
itibariyle... -> itibarıyla...
iyiki... -> iyi ki...
kamu oyu... -> kamuoyu...
kapşon... -> kapüşon...
kareografi... -> koreografi...
kaysı... -> kayısı...
klavuz... -> kılavuz...
klüp... -> kulüp...
kolleksiyon... -> koleksiyon...
kominist... -> komünist...
kompartman... -> kompartıman...
koperatif... -> kooperatif...
kılınç... -> kılıç...
kıral... -> kral...
kıraliyet... -> kraliyet...
kıraliçe... -> kraliçe...
kırallık... -> krallık...
kızarkadaş... -> kız arkadaş...
kızkardeş... -> kız kardeş...
Kürdçe... -> Kürtçe...
labaratuar... -> laboratuvar...
labaratuvar... -> laboratuvar...
madem ki... -> mademki...
mahçup... -> mahcup...
makina... -> makine...
malesef... -> maalesef...
malolmak... -> mal olmak...
Marry... -> Mary...
Mary'e... -> Mary'ye...
Mary'i... -> Mary'yi...
Mary'le... -> Mary'yle...
matamatik... -> matematik...
menejer... -> menajer...
metod... -> metot...
meyva... -> meyve...
meşkul... -> meşgul...
motorsiklet... -> motosiklet...
müdahele... -> müdahale...
müsade... -> müsaade...
müstehak... -> müstahak...
mütevazi... -> mütevazı...
nerde... -> nerede...
neyseki... -> neyse ki...
nufus... -> nüfus...
okur yazar... -> okuryazar...
onaltı... -> on altı...
onbeş... -> on beş...
onbir... -> on bir...
ondokuz... -> on dokuz...
ondört... -> on dört...
oniki... -> on iki...
onla... -> onunla...
onsekiz... -> on sekiz...
onyedi... -> on yedi...
onüç... -> on üç...
orda... -> orada...
orjinal... -> orijinal...
orta okul... -> ortaokul...
oysa ki... -> oysaki...
pantalon... -> pantolon...
parlemento... -> parlamento...
pastahane... -> pastane...
pekaz... -> pek az...
pekçok... -> pek çok...
penbe... -> pembe...
perşenbe... -> perşembe...
peşpeşe... -> peş peşe...
pilaj... -> plaj...
postahane... -> postane...
proğram... -> program...
rasgele... -> rastgele...
rasgelm... -> rast gelm...
raslantı... -> rastlantı...
sarfed... -> sarf ed...
sarfet... -> sarf et...
sarmısak... -> sarımsak...
senle... -> seninle...
sivri sinek... -> sivrisinek...
sohpet... -> sohbet...
sueter... -> süveter...
süpriz... -> sürpriz...
şöför... -> şoför...
tabi ki... -> tabii ki...
taktir... -> takdir...
terked... -> terk ed...
terket... -> terk et...
tesbih... -> tespih...
tesbit... -> tespit...
traş... -> tıraş...
Türküye... -> Türkiye...
umru... -> umuru...
ünüversite... -> üniversite...
ünvan... -> unvan...
vaz geç... -> vazgeç...
yada... -> ya da...
yanlız... -> yalnız...
yanyana... -> yan yana...
yanısıra... -> yanı sıra...
yinede... -> yine de...
yüksek okul... -> yüksekokul...
yüz ölçüm... -> yüzölçüm...
zıttı... -> zıddı...

soliloquist soliloquist November 25, 2020 November 25, 2020 at 5:45:48 PM UTC link Permalink

> In that case, the rule should be written as
> herşey... -> her şey...

Yes. Thanks for clarifying.

> But is there a difference between rules ending in ... and other rules, then?

No, not really, as long as ... is placed correctly.

soliloquist soliloquist November 25, 2020, edited November 25, 2020 November 25, 2020 at 3:38:29 PM UTC, edited November 25, 2020 at 3:40:17 PM UTC link Permalink

Thanks. The script now detects the suffix errors (in which the error is in the suffix itself) nicely, but misspelled words with suffixes (in which the error is in the root, not in the suffix) are still ignored.

For example,

herşey -> her şey

It only detects #8904790, but it should also detect #9094761 and #8871066. They have the same error, but I guess it is set to search terms as 'whole words' so the suffixed forms are ignored. Many of the terms are in a similar situation. Only the ones without suffixes are found.

soliloquist soliloquist November 25, 2020 November 25, 2020 at 3:38:19 PM UTC link Permalink

> - is that a common Turkish habit? :P )

No, it's just a trivial and stylistic error. :-) But it's better to standardize them to avoid having duplicates. Thanks for picking them up.

soliloquist soliloquist November 25, 2020 November 25, 2020 at 1:13:33 PM UTC link Permalink

> What's the difference between
> "Haked... -> Hak ed..."

The latter is the correct form. '...' means that the word certainly has a suffix so you can think of it as 'haked*'.

> and
> "* Öğe -> Öge"

Sorry for the confusion. The asterisk at the beginnig of that example has a different meaning. There are several other examples like that. You can ignore them. We have two Turkish language associations and they have differences of opinion about spelling of some words and that word is one of them. I put an asterisk at the beginning of them to indicate that. I now replaced them with ※ to avoid confusion.

Examples like,

...pde -> ...pte

...sden -> ...sten

show suffix errors. Such words end with those letters without having other suffixes. They can be searched on Tatoeba by putting an asterisk before them. (*pde, *sden etc.). If it's difficult to add them to the script, you can ignore them. Just having suffix support would be sufficient.

soliloquist soliloquist November 25, 2020 November 25, 2020 at 10:41:04 AM UTC link Permalink

Thanks a million for adapting the script for Turkish. The reason I put asterisks on some of the vocabulary items was to include their suffixed forms. Otherwise the search finds only exact matches, without suffixed ones. For example,

herşey -> her şey

Without the asterisk, the search finds only 'herşey' but not the suffixed forms like 'herşeyi', 'herşeye', 'herşeyde' etc. So it's kind of a stemming support. Your script does a great job as is, but if you could find a way to include other forms, it would be much better.

Thank you very much again.

soliloquist soliloquist November 24, 2020 November 24, 2020 at 7:14:33 PM UTC link Permalink

Thank you. It's really a good idea.

> Would there be any use for something like this for other languages?

I had created some accounts to monitor spelling errors in the Turkish corpus using vocabulary items.

https://tatoeba.org/eng/sentences/show/8243466

It would be nice to have a similar script that collects the vocabulary items of those accounts (they can be added to the script manually), searches them in the Turkish corpus and reports the results.

soliloquist soliloquist November 22, 2020, edited May 26, 2021 November 22, 2020 at 3:16:24 PM UTC, edited May 26, 2021 at 8:17:34 PM UTC link Permalink

https://tatoeba.org/en/sentence...omment-1268881

soliloquist soliloquist November 18, 2020 November 18, 2020 at 1:46:42 PM UTC link Permalink

Just take your time, thank you.

soliloquist soliloquist November 18, 2020 November 18, 2020 at 11:59:16 AM UTC link Permalink

Thanks. Would you consider adding the original sentence ratio (ocnt/scnt) into the chart like you do with tcnt/scnt? The "Base of Sentences" part on the Downloads page probably has the necessary data.