Sitemap

A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.

Pages

Posts

Future Blog Post

less than 1 minute read

Published:

This post will show up by default. To disable scheduling of future posts, edit config.yml and set future: false.

portfolio

projects

Medical Machine Translation

EPSRC GCRF-funded project in collaboration with the University of Cape Town to build a machine translation system to facilitate communication in the medical domain between isiXhosa-speaking patients and English-speaking doctors in health centres in South Africa. I participated as a research assistant at the University of Edinburgh.

ParaCrawl

Long-term EU(CEF)-funded project to collect parallel corpora from large-scale web crawls. I participated during my time as a data engineer at one of the project partners — TAUS.

publications

talks

Xhosa-English Machine Translation for the Medical Domain

Published:

As part of a project to develop medical domain machine translation systems between isiXhosa and English (see project page for more details), the University of Cape Town hosted a workshop, where I presented the challenges of building such a system in low-resource languages, some low-resource techniques to build better MT systems, the data used for building the medical domain isiXhosa-English MT models, and some preliminary results.

teaching

Teaching experience 2

Workshop, University 1, Department, 2015

This is a description of a teaching experience. You can use markdown like any other post.