I'm a software engineer and Dartmouth alum in the Seattle area. I love studying computational linguistics and compilers. Check out my work on Github.
News:
- Updated the website with a fresh, moblie-friendlier look!
Posts:
Publications:
NMT Models with Back-Translation for the Extremely Low-Resource Indigenous Language Bribri (2020)
[pdf (300KB)] [acl]
Abstract: The paper presents a neural machine translation model and dataset for the Chibchan language Bribri, with an average performance of BLEU 16.9±1.7. This was trained on an extremely small dataset (5923 Bribri-Spanish pairs), providing evidence for the applicability of NMT in extremely low-resource environments. We discuss the challenges entailed in managing training input from languages without standard orthographies, we provide evidence of successful learning of Bribri grammar, and also examine the translations of structures that are infrequent in major Indo-European languages, such as positional verbs, ergative markers, numerical classifiers and complex demonstrative systems.