A Report for the SADiLaR-Wikipedia-PanSALB Project for South African Languages

Authors

  • Muzi Matfunjwa

    South African Centre for Digital Language, North-West University, Potchefstroom 2531, South Africa

  • Nomsa Skosana

    South African Centre for Digital Language, North-West University, Potchefstroom 2531, South Africa

  • Lebogang Boemo

    South African Centre for Digital Language, North-West University, Potchefstroom 2531, South Africa

DOI:

https://doi.org/10.30564/fls.v7i5.9068
Received: 12 March 2025 | Revised: 20 April 2025 | Accepted: 24 April 2025 | Published Online: 7 May 2025

Abstract

South African languages are underrepresented in online encyclopaedias, which limits access to free and open information in these languages. This report provides an overview of the SADiLaR-Wikipedia-PanSALB (SWiP) project and how it was utilised to create content in Wikipedia for South African Languages. The creation of Wikipedia content was facilitated through training participants across 11 public universities in South Africa wherein they were taught how to contribute to Wikipedia. The participants mainly consisted of students, lecturers, language practitioners, and community members. They learned how to activate Wikipedia translation tools on their Wikipedia accounts, search for their preferred language and translate selected articles. These participants were also trained to edit the translated and published articles, create new ones in their respective languages, and structure them according to Wikipedia's guidelines for article creation. The training also included explanations on how to add references and images, as well as how to link articles to other Wikipedia pages. The SWiP project resulted in the creation of 737 articles, 160 images and 1960 references. The new corpora developed on Wikipedia are currently used by human language technology developers to create and improve tools for South African languages. Therefore, the SWiP project has significantly enhanced the visibility of South African languages on Wikipedia and improved access to information in these languages.

Keywords:

SWiP; Corpora; Wikipedia; South African languages

References

[1] SWiP Project, n.d. SWiP Project. Available from: https://sadilar.org/en/swip/ (cited 12 November 2024).

[2] SADiLaR, n.d. SADiLaR. Available from: https://sadilar.org/en/ (cited 20 August 2024).

[3] Wikipedia Foundation, n.d. Wikipedia. Available from: https://www.wikipedia.org/ (cited 6 October 2024).

[4] PanSALB, n.d. PanSALB. Available from: https://www.pansalb.org/ (cited 12 September 2024).

[5] McDonough, D.J., 2017. Expanding the sum of all human knowledge: Wikipedia, translation and linguistic justice. The Translator. 23(2), 143–157. DOI: https://doi.org/10.1080/13556509.2017.1321519

[6] Five pillars of Wikipedia, n.d. Five pillars of Wikipedia. Available from: https://en.wikipedia.org/wiki/Wikipedia:Five_pillars (cited 15 December 2024).

[7] SWiP project dashboard, n.d. SWiP project dashboard. Available from: https://outreachdashboard.wmflabs.org/courses/SADiLaR,_Wikipedia,_PanSALB/SWiP_Workshops (cited 4 February 2025).

[8] SWiP Resource page, n.d. SWiP Resource page. Available from: https://meta.wikimedia.org/wiki/SWiP_Resource_Page (cited 4 February 2025).

[9] SWiP writing competition dashboard, n.d. SWiP writing competition dashboard. Available from: https://outreachdashboard.wmflabs.org/courses/SADiLaR,_Wikipedia,_PanSALB/SWiP_Writing_Competition_(2024)/ (cited 5 September 2024).

Downloads

How to Cite

Matfunjwa, M., Skosana, N., & Boemo, L. (2025). A Report for the SADiLaR-Wikipedia-PanSALB Project for South African Languages. Forum for Linguistic Studies, 7(5), 598–603. https://doi.org/10.30564/fls.v7i5.9068

Issue

Article Type

Short Communications