Profile
Sentences
Vocabulary
Reviews
Lists
Favorites
Comments
Comments on Cangarejo's sentences
Wall messages
Logs
Audio
Transcriptions
Translate Cangarejo's sentences
Contact Cangarejo
Stats
- Comments posted
- 3,172
- Sentences owned
- 7,867
- Audio recordings
- 0
- Sentences favorited
- 1
- Contributions
- 27,829
Settings
- Email notifications are ENABLED.
- Access to this profile is PUBLIC. All the information can be seen by everyone.

Cangarejo
My translations of sentences under CC0 are also under CC0, even though Tatoeba doesn’t currently allow me to switch their license.
I wrote some scripts that try to generate lists of random sentences that are more random than the ones generated by Tatoeba.
The lists are in text file format.
https://github.com/CangarejoAsu...main/sentences (January 14, 2023)
I also have word frequency lists.
https://github.com/CangarejoAsu...ree/main/words (January 14, 2023)
And character frequency lists.
https://github.com/CangarejoAsu...ain/characters (January 14, 2023)
I often use Tatominer to find out which words have few translations into Portuguese.
Don’t forget to change the page to your language pair.
https://tatominer.netlify.app/eng-por
I use the Google query below to find public domain sentences from VOA articles containing given words.
Articles and image captions by AFP, AP, Reuters, RFA, and RFE are not in the public domain.
https://www.google.com/search?q...%22+-%22RFE%22
I use the following Google query to find public domain sentences from Gutenberg books containing given words.
Most books here, but not all, are in the public domain.
https://www.google.com/search?q...+-filetype:txt
I have a GitHub repository with scripts for processing Tatoeba dump files.
https://github.com/CangarejoAsul/tatoeba-tools
The dump files can be downloaded here.
https://tatoeba.org/downloads
Dictionaries:
https://www.infopedia.pt/
https://dicionario.priberam.org/
https://michaelis.uol.com.br/
https://www.dicio.com.br/
https://dictionary.cambridge.org/
https://www.dictionary.com/
Encyclopedias:
https://pt.wikipedia.org/
https://en.wikipedia.org/
https://www.britannica.com/
Thesauruses:
https://www.sinonimos.com.br/
https://www.wordhippo.com/
Etymology dictionaries:
https://www.etymonline.com/
https://www.arcanum.com/hu/onli...-szotar-F14D3/
Translators:
https://www.deepl.com/translator
https://translate.google.com/
Corpora:
https://www.corpusdoportugues.org/
https://www.english-corpora.org/
https://www.corpusdelespanol.org/
Translation corpora:
https://www.linguee.pt/ingles-portugues/
https://context.reverso.net/tra...les-portugues/
Frequency comparison:
https://books.google.com/ngrams/
Random name generator:
https://www.behindthename.com/random/
Symbols: — ₂
VOA terms of use:
https://www.voanews.com/p/5338.html
Gutenberg terms of use:
https://www.gutenberg.org/policy/license.html
Languages
No language added.
TIP: Encourage this user to indicate the languages he or she knows.
{{lang.name}}
{{lang.details}}