menu
Tatoeba
language
Register Log in
language Dansk
menu
Tatoeba

chevron_right Register

chevron_right Log in

Gennemse

chevron_right Show random sentence

chevron_right Browse by language

chevron_right Browse by list

chevron_right Browse by tag

chevron_right Browse audio

Community

chevron_right Væg

chevron_right List of all members

chevron_right Languages of members

chevron_right Native speakers

search
clear
swap_horiz
search

Note

The data you will find here will NOT be useful unless you are coding a language tool or processing data.

If you simply want sentences that you can use to learn a language, check out the sentence lists. You can build your own, or view the ones that others have created. The lists can be downloaded and printed.

General information about the files

Many of the Japanese and English sentences are from the Tanaka Corpus, which belongs to the public domain.

Creative commons

These files are released under CC BY 2.0 FR.

Creative Commons License CC-BY

A part of our sentences are also available under CC0 1.0.

Creative Commons License CC0

Licenses covering audio

The license covering an audio file is chosen by the contributor, and is indicated on the page that lists the audio files that he or she has contributed.

Spørgsmål?

If you have questions or requests, feel free to contact us. In general, we answer quickly.

Downloads

arrow_back

Custom exports

Sentence pairs

Use this tool to generate and download customized exports on demand.

translate Sentence pairs
Download all sentences in language A with translations in language B

Download all sentences in language A that are translated into language B, along with the translations.

Weekly exports

info The files provided below are updated every Saturday at 6:30 a.m. (UTC).

Sætninger

Filename

{{sentences | filename}}

Alle sprog
Only sentences in: abazinsk Abkhasisk adyghe afrihili afrikaans ainu aklansk albansk Algerisk arabisk amharisk Ancient Hebrew Ao Naga arabisk aragonesisk aserbajdsjansk assamesisk Assyrisk asturisk Avar awadhi aymara Bajersk balinesisk baluchi bambara Banjar bashkir baskisk Baybayanon bengali berbisk Berom bhojpuri bislama bodo bosnisk bretonsk Brithenig bulgarsk burmesisk Buryat catalansk cayuga cebuano Central Bikol Central Dusun Central Huasteca Nahuatl Central Kanuri Central Kurdish (Soranî) Central Mnong chagatai chamorro Chavacano cherokee Chinese Pidgin English chinook Chinyanja choctaw chuvash Coastal Kadazan congolesisk swahili cornisk CycL dansk dargwa Dhivehi Drents Dungan Dutton World Speedwords Eastern Armenian egyptisk arabisk Emilian engelsk Erromintxela erzya esperanto estisk Evenki ewe Extremaduran Fiji Hindi fijiansk finsk fransk frisisk friulian færøsk fønikisk ga gagauzisk galicisk Gammel østslavisk sprog Gammelsaksisk gan-kinesisk Garhwali gegisk georgisk gilbertesisk Golf arabisk gotisk Groningsk græsk grønlandsk Guadeloupean Creole French guarani Guerrero Nahuatl gujarati Gun haida haitisk hakka-kinesisk hausa hawaiiansk hebraisk hiligaynon Hill Mari hindi Hitchiti Hmong Daw (White) Hmong Njua (Green) Ho hollandsk Hunsrik hviderussisk iban Ido igbo ilokansk indonesisk Ingrian Interglossa interlingua interlingue Interslavic inuktitut Iraqi Arabic irsk Isan isiXhosa islandsk italiensk jakutisk japansk javanesisk Jewish Babylonian Aramaic Jewish Palestinian Aramaic jiddisch Jin Chinese Juhuri (Judeo-Tat) K'iche' kabardian kabylisk kalmyk kamba kannada kantonesisk Karakalpak Karakhanid karatjai-balkar karelsk kasakhisk kashmiri kasjubisk Kekchi (Q'eqchi') Kelantan-Pattani Malay Keningau Murut Khakas Khalaj khasi khmer kinesisk kinyarwanda kirgisisk Kirundi klingon komi-permjakisk Komi-Zyrian Konkani (Goan) koreansk korsikansk Kotava Krimtatarisk kroatisk kujansk Kven Finnish kymyk Kölsch Láadan Ladin ladino lakota lao latin Laz Letgallisk lettisk Lezgi Libysk arabisk ligurisk limburgsk lingala Lingua Franca Nova litauisk Literary Chinese Livisk lojban Lombardisk Louisiana-kreolsk Luganda Lushootseed luxembourgsk madurese Mahasu Pahari maithili makedonsk malagassisk malajisk Malay (Vernacular) malayalam maltesisk Mambae manchu Mandar manx maori mapudungun marathisk Marokkansk arabisk marshallese Meadow Mari Meitei micmac middelengelsk middelfransk Middle Persian (Pahlavi) min-kinesisk minangkabau Mingrelian mirandesisk mohawk moksha Mon mongolsk Mono (USA) morisyen Muskogee (Creek) Naga (Tangshang) Nahuatl Nande napolitansk Nauruan navajo nedersorbisk nepalesisk newari Ngeq Nigerian Fulfulde niueansk nogai Nordfrisisk nordsamisk norsk bokmål North Levantine Arabic North Moluccan Malay Northern Kurdish (Kurmancî) Northern Zaza (Kirmanjki) Novial nuer Nuosu nynorsk Nyungar O'odham occitansk Odia (Oriya) Ojibwa Okinawan Old Aramaic Old Frisian Old Prussian Old Spanish Old Turkish oldengelsk oldfransk oldgræsk oldislandsk Orizaba Nahuatl osmannisk tyrkisk Ossetisk Palatine German palauansk pali Pampangansk pangasinan papiamento pashto Patois Pennsylvania German persisk Piedmontese Pikardisk Pipil plains cree plattysk (nedersaksisk) polsk portugisisk Pulaar Punjabi (Eastern) Punjabi (Western) Qashqai quechua Quenya Rapa Nui Rendille rohingya Romani rumænsk Rusinsk russisk rætoromansk samoansk sango sanskrit santali Saraiki sardinsk Saterfrisisk Schlesisk schweizertysk serbisk Setswana Seychellois Creole Shanghainese shona Shuswap siciliansk Sindarin sindhi singalesisk skotsk skotsk gælisk slovakisk slovensk somali South Levantine Arabic Southern Subanen Southern Zaza (Dimli) spansk sranan tongo sumerisk sundanesisk Svan svensk Swabian swahili Swazi sydaltaisk sydhaida sydkurdisk sydsamisk sydsotho Sylheti syrisk Tachawit tadsjikisk Tagal Murut tagalog Tahaggart Tamahaq tahitiansk Talossan Talysh tamazight tamil Tarifit Tashelhit tatarisk telugu Temuan Tetun thai tibetansk tigre tigrinya tjekkisk tjetjensk tjuktjisk tok pisin Tokelauan Tokipona Tonga (Zambezi) tongansk tsonga tumbuka Tupinambá turkmensk Tuvaluan tuvinian tyrkisk tysk Uab Meto udmurt ukrainsk umbundu ungarsk urdu Urhobo usbekisk uygurisk vallonsk venetiansk Vepsisk vietnamesisk volapyk Võro walisisk waray Wayuu West-Central Oromo Western Armenian wolof xiang-kinesisk yoruba Yukatansk maya zaza Zeelandic Žemaitisk zulu øvresorbisk Unknown language
File description
Contains all the sentences in the selected language. Each sentence is associated with a unique id and an ISO 639-3 language code.
Fields and structure
Sætning id [tab] Sprog [tab] Tekst

Detailed Sentences

Filename

{{sentencesDetailed | filename}}

Alle sprog
Only sentences in: abazinsk Abkhasisk adyghe afrihili afrikaans ainu aklansk albansk Algerisk arabisk amharisk Ancient Hebrew Ao Naga arabisk aragonesisk aserbajdsjansk assamesisk Assyrisk asturisk Avar awadhi aymara Bajersk balinesisk baluchi bambara Banjar bashkir baskisk Baybayanon bengali berbisk Berom bhojpuri bislama bodo bosnisk bretonsk Brithenig bulgarsk burmesisk Buryat catalansk cayuga cebuano Central Bikol Central Dusun Central Huasteca Nahuatl Central Kanuri Central Kurdish (Soranî) Central Mnong chagatai chamorro Chavacano cherokee Chinese Pidgin English chinook Chinyanja choctaw chuvash Coastal Kadazan congolesisk swahili cornisk CycL dansk dargwa Dhivehi Drents Dungan Dutton World Speedwords Eastern Armenian egyptisk arabisk Emilian engelsk Erromintxela erzya esperanto estisk Evenki ewe Extremaduran Fiji Hindi fijiansk finsk fransk frisisk friulian færøsk fønikisk ga gagauzisk galicisk Gammel østslavisk sprog Gammelsaksisk gan-kinesisk Garhwali gegisk georgisk gilbertesisk Golf arabisk gotisk Groningsk græsk grønlandsk Guadeloupean Creole French guarani Guerrero Nahuatl gujarati Gun haida haitisk hakka-kinesisk hausa hawaiiansk hebraisk hiligaynon Hill Mari hindi Hitchiti Hmong Daw (White) Hmong Njua (Green) Ho hollandsk Hunsrik hviderussisk iban Ido igbo ilokansk indonesisk Ingrian Interglossa interlingua interlingue Interslavic inuktitut Iraqi Arabic irsk Isan isiXhosa islandsk italiensk jakutisk japansk javanesisk Jewish Babylonian Aramaic Jewish Palestinian Aramaic jiddisch Jin Chinese Juhuri (Judeo-Tat) K'iche' kabardian kabylisk kalmyk kamba kannada kantonesisk Karakalpak Karakhanid karatjai-balkar karelsk kasakhisk kashmiri kasjubisk Kekchi (Q'eqchi') Kelantan-Pattani Malay Keningau Murut Khakas Khalaj khasi khmer kinesisk kinyarwanda kirgisisk Kirundi klingon komi-permjakisk Komi-Zyrian Konkani (Goan) koreansk korsikansk Kotava Krimtatarisk kroatisk kujansk Kven Finnish kymyk Kölsch Láadan Ladin ladino lakota lao latin Laz Letgallisk lettisk Lezgi Libysk arabisk ligurisk limburgsk lingala Lingua Franca Nova litauisk Literary Chinese Livisk lojban Lombardisk Louisiana-kreolsk Luganda Lushootseed luxembourgsk madurese Mahasu Pahari maithili makedonsk malagassisk malajisk Malay (Vernacular) malayalam maltesisk Mambae manchu Mandar manx maori mapudungun marathisk Marokkansk arabisk marshallese Meadow Mari Meitei micmac middelengelsk middelfransk Middle Persian (Pahlavi) min-kinesisk minangkabau Mingrelian mirandesisk mohawk moksha Mon mongolsk Mono (USA) morisyen Muskogee (Creek) Naga (Tangshang) Nahuatl Nande napolitansk Nauruan navajo nedersorbisk nepalesisk newari Ngeq Nigerian Fulfulde niueansk nogai Nordfrisisk nordsamisk norsk bokmål North Levantine Arabic North Moluccan Malay Northern Kurdish (Kurmancî) Northern Zaza (Kirmanjki) Novial nuer Nuosu nynorsk Nyungar O'odham occitansk Odia (Oriya) Ojibwa Okinawan Old Aramaic Old Frisian Old Prussian Old Spanish Old Turkish oldengelsk oldfransk oldgræsk oldislandsk Orizaba Nahuatl osmannisk tyrkisk Ossetisk Palatine German palauansk pali Pampangansk pangasinan papiamento pashto Patois Pennsylvania German persisk Piedmontese Pikardisk Pipil plains cree plattysk (nedersaksisk) polsk portugisisk Pulaar Punjabi (Eastern) Punjabi (Western) Qashqai quechua Quenya Rapa Nui Rendille rohingya Romani rumænsk Rusinsk russisk rætoromansk samoansk sango sanskrit santali Saraiki sardinsk Saterfrisisk Schlesisk schweizertysk serbisk Setswana Seychellois Creole Shanghainese shona Shuswap siciliansk Sindarin sindhi singalesisk skotsk skotsk gælisk slovakisk slovensk somali South Levantine Arabic Southern Subanen Southern Zaza (Dimli) spansk sranan tongo sumerisk sundanesisk Svan svensk Swabian swahili Swazi sydaltaisk sydhaida sydkurdisk sydsamisk sydsotho Sylheti syrisk Tachawit tadsjikisk Tagal Murut tagalog Tahaggart Tamahaq tahitiansk Talossan Talysh tamazight tamil Tarifit Tashelhit tatarisk telugu Temuan Tetun thai tibetansk tigre tigrinya tjekkisk tjetjensk tjuktjisk tok pisin Tokelauan Tokipona Tonga (Zambezi) tongansk tsonga tumbuka Tupinambá turkmensk Tuvaluan tuvinian tyrkisk tysk Uab Meto udmurt ukrainsk umbundu ungarsk urdu Urhobo usbekisk uygurisk vallonsk venetiansk Vepsisk vietnamesisk volapyk Võro walisisk waray Wayuu West-Central Oromo Western Armenian wolof xiang-kinesisk yoruba Yukatansk maya zaza Zeelandic Žemaitisk zulu øvresorbisk Unknown language
File description
Contains additional fields for each sentence (owner name, date created/modified).
Fields and structure
Sætning id [tab] Sprog [tab] Tekst [tab] Brugernavn [tab] Dato tilføjet [tab] Date last modified

Original and Translated Sentences

Filename
sentences_base.tar.bz2
File description
Each sentence is listed as original or a translation of another. The "base" field can have the following values:
  • zero: The sentence is original, not a translation of another.
  • greater than zero: The id of the sentence from which it was translated.
  • \N: Unknown (rare).
Fields and structure
Sætning id [tab] Base field

Sentences (CC0)

Filename

{{sentencesCC0 | filename}}

Alle sprog
Only sentences in: Algerisk arabisk Ancient Hebrew arabisk bengali berbisk catalansk dansk engelsk esperanto finsk fransk fønikisk hebraisk hindi Ho hollandsk hviderussisk Ido interlingua interlingue italiensk japansk Jewish Babylonian Aramaic Jewish Palestinian Aramaic jiddisch kabylisk kantonesisk karelsk kinesisk klingon Konkani (Goan) Kven Finnish Láadan ladino latin ligurisk Literary Chinese middelengelsk norsk bokmål Nyungar Odia (Oriya) Old Aramaic Old Frisian oldgræsk oldislandsk polsk portugisisk russisk santali spansk svensk Sylheti Tachawit tamazight tjekkisk Tokipona tysk ukrainsk ungarsk volapyk walisisk Unknown language
File description
Contains all the sentences available under CC0.
Fields and structure
Sætning id [tab] Sprog [tab] Tekst [tab] Date last modified

Links

Filename
links.tar.bz2
File description
Contains the links between the sentences. 1 [tab] 77 means that sentence #77 is the translation of sentence #1. The reciprocal link is also present, so the file will also contain a line that says 77 [tab] 1.
Fields and structure
Sætning id [tab] Translation id

Tags

Filename
tags.tar.bz2
File description
Contains the list of tags associated with each sentence. 381279 [tab] proverb means that sentence #381279 has been assigned the "proverb" tag.
Fields and structure
Sætning id [tab] Tag name

Lister

Filename
user_lists.tar.bz2
File description
Contains the list of sentence lists.
Fields and structure
List id [tab] Brugernavn [tab] Dato oprettet [tab] Date last modified [tab] List name [tab] Editable by

Sentences in lists

Filename
sentences_in_lists.tar.bz2
File description
Indicates the sentences that are contained by any lists. 13 [tab] 381279 means that sentence #381279 is contained by the list that has an id of 13.
Fields and structure
List id [tab] Sætning id

Japanese indices

Filename
jpn_indices.tar.bz2
File description
Contains the equivalent of the "B lines" in the Tanaka Corpus file distributed by Jim Breen. See this page for the format. Each entry is associated with a pair of Japanese/English sentences. Sætning id refers to the id of the Japanese sentence. Meaning id refers to the id of the English sentence.
Fields and structure
Sætning id [tab] Meaning id [tab] Tekst

Sentences with audio

Filename
sentences_with_audio.tar.bz2
File description
Contains the ids of the sentences, in all languages, for which audio is available. Other fields indicate who recorded the audio, its license and a URL to attribute the author. If the license field is empty, you may not reuse the audio outside the Tatoeba project.
Downloading audio
A single sentence can have one or more audio, each from a different voice. To download a particular audio, use its audio id to compute the download URL. For example, to download the audio with the id 1234, the URL is https://tatoeba.org/audio/download/1234.
Fields and structure
Sætning id [tab] Audio id [tab] Brugernavn [tab] Licens [tab] Attribution URL

User skill level per language

Filename
user_languages.tar.bz2
File description
Indicates the self-reported skill levels of members in individual languages.
Fields and structure
Sprog [tab] Skill level [tab] Brugernavn [tab] Detaljer

Users' sentence reviews

Filename
users_sentences.csv
File description
Contains sentences reviewed by users. The value of the review can be -1 (sentence not OK), 0 (undecided or unsure), or 1 (sentence OK). Warning: this data is still experimental.
Fields and structure
Brugernavn [tab] Sætning id [tab] Review [tab] Dato tilføjet [tab] Date last modified

Transcriptions

Filename

{{transcriptions | filename}}

Alle sprog
Only sentences in: japansk kantonesisk kinesisk usbekisk
File description
Contains all transcriptions in auxiliary or alternative scripts. A username associated with a transcription indicates the user who last reviewed and possibly modified it. A transcription without a username has not been marked as reviewed. The script name is defined according to the ISO 15924 standard.
Fields and structure
Sætning id [tab] Sprog [tab] Script name [tab] Brugernavn [tab] Transcription