This is a book that contains instructional materials for our workshops.

Held at the Center for Digital Humanities at Princeton, this Institute is a collaboration with Haverford College, the Library of Congress Labs, and DARIAH, the European Digital Research Infrastructure for the Arts and Humanities.

Participants will work over the course of a year—between June 2021 and May 2022— and will meet for three intensive workshops where they will learn how to annotate linguistic data and train statistical language models using cutting-edge natural language processing (NLP) tools. They will learn best practices in project and research data management. They will join discussions with leaders in the fields of multilingual NLP and DH. They will advance their own research projects by creating, employing and interrogating text-analysis tools and methods, while increasing much-needed linguistic diversity in the field of NLP.

Further information on the project can be found here.

Please feel free to contact the project directors with questions: Natalia Ermolaev (nataliae@princeton.edu) Andrew Janco (ajanco@haverford.edu)