I am a software development engineer, I have been working with linguists for more than 15 years now.
My activity is mainly focused on textual data, not or weakly structured. I use, and if necessary adapt, natural language processing (NLP) tools to process these data. I develop or participate in the development of a website to expose the results.
Between linguistics, NLP and digital humanities the border is sometimes tenuous. The nature of the data can tip projects into one discipline or the other. Working with literary data, for example, requires to adapt NLP tools whose models have not generally been trained on literature.
The question of visualization is also often at the heart of digital humanities projects. NLP tools are efficient but never 100%. Errors in automatic analysis are often an issue for humanities researchers; they must be identified and updated in order to be discussed and, if possible, corrected.
I’m part of the ‘Groupe de travail humanités numériques’ (GTHN) of the EUR Translitterae.