Li Nguyen


Xin chào! 你好! Yaama!

I am a linguist specialising in language variation and change, contact linguistics, corpus linguistics. Much of my work is corpora-based and focuses on contact phenomena in bilingual settings. My interest in Linguistics is however diverse, and I have written on a wide range of topics, including variationist linguistics, discourse analysis, heritage language, ethical data collection methods, multilingual NLP and NLP for spoken low-resource varieties.

I completed my PhD at the University of Cambridge in 2021 and now work as a Research Associate at the Cambridge Department of Computer Science and Technology. I am also a Linguistics Consultant at the Vietnamese Lexicography Centre. Before that, I was a co-editor for Cambridge Occasional Papers in Linguistics (COPiL) and a Visitor and a Research Assistant at ANU School of Literature, Languages and Linguistics, where I worked on an IAFPA-funded project on forensic voice comparison.

My PhD, fully funded by the Cambridge Trust, explored language variation and change across generations in the Canberra Vietnamese community, using an original, natural corpus of spontaneous speech (the Canberra Vietnamese-English Corpus/CanVEC). My research specifically considered the extent to which we may automate the processing of multilingual corpora, and examined how this data allowed us to characterise speakers’ linguistic variation and change across both their monolingual and bilingual discourse. Prior to Cambridge, I gained a Master of General and Applied Linguistics (Advanced) at the Australian National University (ANU), and also a Bachelor of Arts (Honours I) at the University of Canberra.

I am particularly interested in projects on language contact, language evolution, heritage speakers, linguistically and socially informed NLP, responsible research methods, and innovative technologies that can accommodate multilingual speakers.