Edit product


English-Korean CDC COVID-19 Website Corpus v1

Parallel Corpus of the Centers for Disease Control and Prevention (CDC) website on COVID-19 captured on June 24, 2020. The download contains the corpus in two formats:

  • TMX including non-deduplicated segments in origin order with meta-data on source documents and alignment quality
  • TSV: UTF-8 encoded text file containing tab-separated segments deduplicated in randomized order


This translation memory of CDC COVID-19 translations by Polyglot Technology LLC is made available under the Open Data Commons Attribution License: http://opendatacommons.org/licenses/by/1.0. Individual contents of the database are in the public domain.

Source: CDC; Reference to specific commercial products, manufacturers, companies, or trademarks does not constitute its endorsement or recommendation by the U.S. Government, Department of Health and Human Services, or Centers for Disease Control and Prevention; The material is available on the agency website https://www.cdc.gov/ for no charge.

You've purchased this product

See it in your library

View in Library
Sorry, this item is not available in your location.
Sold out, please go back and pick another option.

Name a fair price:

  • Size5.44 MB
  • English source words537,204
  • English source words (deduplicated)262,393


English-Korean CDC COVID-19 Website Corpus v1

Enter your info to complete your purchase


···· ···· ···· 4242
Test card



Use a different card?


pp paypal

or pay with

We do not keep any of your sensitive credit card information on file with us unless you ask us to after this purchase is complete.

or pay with

Your purchase was successful!

We charged your card and sent you a receipt


    Gumroad Library

    Download from the App Store or text yourself a link to the app

    Good news! Since you already have a Gumroad account, it's also been added to your library.

    Powered by Gumroad