Paul Kervegan

Computer Science
Social Sciences
text mining, OCR
, updated on
16 May 2022
Paul Kervegan
Observatoire des humanités numériques de l'ENS-PSL / EUR Translitterae

Intern working on optical character recognition (OCR), text mining and web development in the MSS / Katabase project (École normale supérieure / Unige / École nationale des chartes).

Field of research

École du Louvre alumni and student in « Digital technologies applied to history » at the École nationale des chartes, I develop methods to extract and make sense of textual data in the MSS project. I aim to enhance the OCR model and to automate the processing, normalisation and enrichment of the extracted data. The goal is to share the processed data online through the Katabase platform and to bring forward « distant reading » methods for sale catalogues. I am interested in the econometric study of the catalogues in order to identify the key factors in the manuscripts’ price fluctuations. Using graphic design and data visualisation to make meaning out of textual data, I am also enhancing the structure and design of the Katabase website.

My other research interests include contemporary art and anthropology of experimental and underground music.