University of Pretoria
Browse

South African multilingual lexicons

Download (3.9 MB)
Version 2 2025-08-26, 14:20
Version 1 2024-09-23, 17:45
dataset
posted on 2025-08-26, 14:20 authored by Thapelo SindaneThapelo Sindane, Vukosi MarivateVukosi Marivate, Abiodun ModupeAbiodun Modupe
<p dir="ltr">This dataset contains a list of paired words for South Africa languages: Sepedi (nso), Sesotho(st), Tshivenda(ven), Xitsonga(tso), Setswana(tsn), IsiXhosa(xho), Isizulu(zul), Afrikaans(af), Isiswati(ssw), IsiNdebele(nr), and English(en). The paired words are stored in a json file with keys dict_keys(['en-af', 'en-zul', 'en-xho', 'en-ssw', 'en-nr', 'en-nso', 'en-tsn', 'en-st', 'en-ven', 'en-tso']) for ease of use and accessibility. For each key (E.g en-xho) retrieves a list of paired words between English (en) and IsiXhosa(xho). </p>

Funding

MasterCard Scholarship Foundation

ABSA Chair for Data Science

History

Department/Unit/School/Center

Computer Science

Sustainable Development Goals

  • 4 Quality Education

Usage metrics

    Engineering, Built Environment and Information Technology

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC