Profile
Sentences
Vocabulary
Reviews
Lists
Favorites
Comments
Comments on belkacem77's sentences
Wall messages
Logs
Audio
Transcriptions
Translate belkacem77's sentences
Thanks Yorwba;
It works
Tech question for technicians:
I'm trying to bzip (unzip) sentences files I downloaded from Tatoeba in Linux Ubuntu using the command (tar -xf kab_sentences.tsv.bz2 -- just a sample) but something seems wrong with the files.
Someone can help?
Thanks in advance.
@lbdx
No, I just used notepad, but I developped some checking algorithms to check the corpus for errors. (syntax and tags)
@lbdx
For NER, Deep Learning is the most used nowadays. RNNs are the most suitable. I'm using Pytorch but you can use also TensorFlow. Algorithms are same for latin-based script languages like Kabyle, French, Spanish, English... when training.
When tagging (make predictions), the only thing that can change is the morphosyntactic segmentation algorithm because of syntax (languages have different syntax)
NER processing is a supervised learning, so you need a learning corpus (almost same as morphosyntactic corpus).
But, there are other technics for NER.
When, I finish, I'll share my work on Github.
@DJ_Saidez @Objectivesea
Thanks for you both
Hi everyone,
I want tp add a special tag to Kabyle sentences. I wish tag some sentences with the label "Named Entity".
Is there anyway to do this?
Thanks for help
Noel imgerrez.
Yes, that's what I'm looking for. Thanks a lot.
Help, help!!
How can I select English sentences from the user @CK that are not translated to Kabyle and without searching again for the same user's English sentences?
I want to keep this filter (user, english sentence) on the same view on the same tab of my browser and browing sentences to translate them.
Me too
For wrong translations (mainly syntax and grammar), two solutions are possible: 1- If the translator makes lot of errors, we can change the owner and give them to a native speaker. 2- Add a fucntionality to block users translating to other languages other than his native's one.
We suggested some toponyms and proper nouns (and not all toponyms or all proper ouns) for some reasons, among them : NLP, quizs and learning material for kab kids.... We are not suggesting to change Boston, Tom, October, Mary... like some laguages did. We are keeping translating mainly from English, but we inserted some sentences for some other purpose.
@Cabo You seem very angry. Please let this space quiet. I leave the discussion.
We corrected some errors on transifex (Website UI localization) since days and I wonder if synchronization is made at regular times.
@Cabo this profile is not a bot
https://tatoeba.org/eng/user/pr...Yazid_Bouhamam
The contributors on the kab corpora are identified. We are a team working together on diffrent types of kab sentences. We are creating and sharing content before putting them on Tatoeba. Along with Tatoeba, we are also working on diffrent open projects.
@passerby
Thanks for help.
I'm wodering you just created your account today, with no contribution yet and you are able to analyze stats. It's amazing!!!
The team is going to translate all of the sentences and you are welcome to help.
Again, for the proper names we are using (Toponymy, names of towns, cities, villages, rivers in Kabylia), an ongoing project about GPS in Kabyle is launched.
As you know, in Algeria, for political reasons, Kabyle is not taugh at school, instead, they are working on a novelangue named Tamazight that none speak just to make popular languages disapear and among them Kabyle language. Tatoeba is a quiet place for us to develop our language (Kabyle) and we want keep it quiet, so please, if you can help to translate and add better sentences, you are welcome to take part.
By the way, we have a page on FB and Youtube, you can identify yoursefl and take part on our activities about kabyle linguistics corpora.
Transparency is our motto.
Muḥend Belqasem
Computer Engineer
Lang Kab Activist
Thanks, I'll do something on our community group on FB.
Thank you
Hi everyone,
Is there any way to report wrong vocabulary?
I noticed some wrong kabyle vocabulary and I don't know how to report it or contact the author.
https://tatoeba.org/eng/vocabul..._sentences/kab
I wrote a script with Python to detect some syntactic errors (affixes, bad letters, punctuation..). It's a basic script designed only for Kabyle I shared on github
Thank you @Yorwba for sharing. I'll see this.
Indeed, Kabyle belongs to the Berber family, the same branch as Arabic, Hebrew, Amharic, . the derivation and inflectional systems are almost similar. But the script is different. Kabyle is written with Latin script.
I will see the project.
I've just heard about it. Is there any link, any contact information?