menu
Tatoeba
language
Register Log in
language English
menu
Tatoeba

chevron_right Register

chevron_right Log in

Browse

chevron_right Show random sentence

chevron_right Browse by language

chevron_right Browse by list

chevron_right Browse by tag

chevron_right Browse audio

Community

chevron_right Wall

chevron_right List of all members

chevron_right Languages of members

chevron_right Native speakers

search
clear
swap_horiz
search
Cangarejo {{ icon }} keyboard_arrow_right

Profile

keyboard_arrow_right

Sentences

keyboard_arrow_right

Vocabulary

keyboard_arrow_right

Reviews

keyboard_arrow_right

Lists

keyboard_arrow_right

Favorites

keyboard_arrow_right

Comments

keyboard_arrow_right

Comments on Cangarejo's sentences

keyboard_arrow_right

Wall messages

keyboard_arrow_right

Logs

keyboard_arrow_right

Audio

keyboard_arrow_right

Transcriptions

translate

Translate Cangarejo's sentences

email

Contact Cangarejo

Stats

Comments posted
3,172
Sentences owned
7,867
Audio recordings
0
Sentences favorited
1
Contributions
27,829
Show latest activity

Settings

  • Email notifications are ENABLED.
  • Access to this profile is PUBLIC. All the information can be seen by everyone.
Cangarejo

Cangarejo

Member since
February 13, 2022
advanced contributor
Name
-
Country
-
Birthday
-
Homepage
-
All my “original” sentences are licensed under CC0.
My translations of sentences under CC0 are also under CC0, even though Tatoeba doesn’t currently allow me to switch their license.

I wrote some scripts that try to generate lists of random sentences that are more random than the ones generated by Tatoeba.
The lists are in text file format.
https://github.com/CangarejoAsu...main/sentences (January 14, 2023)

I also have word frequency lists.
https://github.com/CangarejoAsu...ree/main/words (January 14, 2023)

And character frequency lists.
https://github.com/CangarejoAsu...ain/characters (January 14, 2023)

I often use Tatominer to find out which words have few translations into Portuguese.
Don’t forget to change the page to your language pair.
https://tatominer.netlify.app/eng-por

I use the Google query below to find public domain sentences from VOA articles containing given words.
Articles and image captions by AFP, AP, Reuters, RFA, and RFE are not in the public domain.
https://www.google.com/search?q...%22+-%22RFE%22

I use the following Google query to find public domain sentences from Gutenberg books containing given words.
Most books here, but not all, are in the public domain.
https://www.google.com/search?q...+-filetype:txt

I have a GitHub repository with scripts for processing Tatoeba dump files.
https://github.com/CangarejoAsul/tatoeba-tools

The dump files can be downloaded here.
https://tatoeba.org/downloads

Dictionaries:
https://www.infopedia.pt/
https://dicionario.priberam.org/
https://michaelis.uol.com.br/
https://www.dicio.com.br/
https://dictionary.cambridge.org/
https://www.dictionary.com/

Encyclopedias:
https://pt.wikipedia.org/
https://en.wikipedia.org/
https://www.britannica.com/

Thesauruses:
https://www.sinonimos.com.br/
https://www.wordhippo.com/

Etymology dictionaries:
https://www.etymonline.com/
https://www.arcanum.com/hu/onli...-szotar-F14D3/

Translators:
https://www.deepl.com/translator
https://translate.google.com/

Corpora:
https://www.corpusdoportugues.org/
https://www.english-corpora.org/
https://www.corpusdelespanol.org/

Translation corpora:
https://www.linguee.pt/ingles-portugues/
https://context.reverso.net/tra...les-portugues/

Frequency comparison:
https://books.google.com/ngrams/

Random name generator:
https://www.behindthename.com/random/

Symbols: — ₂

VOA terms of use:
https://www.voanews.com/p/5338.html

Gutenberg terms of use:
https://www.gutenberg.org/policy/license.html

Languages

No language added.

TIP: Encourage this user to indicate the languages he or she knows.

{{lang.name}}

{{lang.details}}