Machine-Learning Research Associate – NLP/Data Extraction


A 2-year research associated position is available from September 2024 in the School of Physics and the CRANN Institute. Sponsored by Enterprise Ireland, this role will play a significant part in a commercialization project, whose final aim is the creation of a Start-up company. The project will be hosted by the Computational Spintronics Group (www.spincomp.com), headed by Prof. Sanvito.

The project focusses of the use of Natural Language Processing (NLP) to extract accurate materials information from published literature, either in the form of text, tables or pictures. Our final goal is to extract data on-demand to support the R&D and/or the marketing departments of manufacturing, chemical, materials and pharmaceutical companies in the development of new products and/or business opportunities. The project will establish the use of large language models for various extraction and classification tasks and will run 5 extraction pilots with 5 different companies across multiple technology sectors. The successful candidate will be part of the team that will establish the new startup company and will have the possibility to acquire equities in the new company. They will work closely with members of a world-leading group in materials science who have experience with NLP extraction workflows.