University of Turku

Postdoctoral Researcher to work on large language model pre-training

2024-01-11 (Europe/Helsinki)
Save job

About the employer

The University of Turku has a unique, creative and inspirational work environment. Here you will work with top experts, pedagogues and researchers.

Visit the employer page

TurkuNLP (https://turkunlp.org) is an internationally established multidisciplinary research group that specializes in the development of machine learning -based resources, models, and tools for natural language processing (NLP) using web-scale datasets. In recent years TurkuNLP has created a number of openly available Transformer-based language models, including e.g. FinBERT and FinGPT, the leading Finnish BERT and GPT-3 models. TurkuNLP was also selected as the pilot user of the Europe’s largest supercomputer LUMI and has a long tradition in the use of high performance computing in NLP.

We welcome applications for 1 fixed-term position of a Postdoctoral Researcher to work on large language model pre-training. The position is for the period of February 1, 2024 (a more precise date can be agreed upon) to August 31, 2025, and can be either part-time or full-time accommodating the applicants’ wishes and availability. 

The position is part of the research project High Performance Language Models (HPLT) funded by Horizon Europe. The project will create next-generation large language models covering all European languages and more by training on terabytes of data using the largest supercomputer in Europe, LUMI. For more information, please visit https://hplt-project.org/ .

We offer you

The position offers an excellent opportunity to participate in the development of large language models in a team which has the experience, data, and computational resources. The HPLT project is set apart from many similar efforts by (1) having a very large allotment of computational time at its disposal, on the order of 15+ million GPU hours on Europe’s largest supercomputer, (2) having secured access to very large textual datasets, including partial dumps of the Internet Archive, and (3) being a multinational collaboration, bringing together NLP expertise from several well established labs and data companies. The project is well networked and regularly interacts with the key players in the field.

Key tasks and responsibilities

You will work as part of the HPLT project team in TurkuNLP. The key tasks will revolve around the steps needed to pre-train, from scratch, very large multilingual language models. A particular focus will be on training large multi-lingual models on the AMD architecture of the LUMI supercomputer. Post-doctoral researchers are additionally expected to participate in the supervision of more junior researchers and other similar tasks typically associated with a post-doctoral position in academia.


Who we are looking for

Qualifications

The qualification requirements of the positions are stated in the University of Turku Rules of Procedure: https://www.utu.fi/en/university/organisation.
A person selected for the position of Postdoctoral researcher is required to have a doctoral degree relevant for the position and the ability to do independent scientific work as well as having the necessary teaching skills. 

 

Our expectations

The project is technical in its nature and we are primarily favoring candidates with prior experience, enabling a fast start in the project.

  • We expect good programming and scripting skills. Especially an intermediate-to-advanced knowledge of python as well as some knowledge of bash scripting and the command line environment are crucial for a successful contribution to the project given its computing environment. A prior experience with deep learning, ideally in the pytorch framework, is also expected.
  • Any prior experience in training generative language models, even if small in size, is considered a distinct advantage.
  • Any experience with large-scale, distributed training of deep learning models in a cluster computing environment is considered a substantial advantage.
  • Any experience with large textual dataset processing is considered a distinct advantage.

Salary and trial period

The salaries are determined in accordance with the university salary system for teaching and research personnel. For a postdoctoral researcher the salary will be in the beginning of the employment around 3400-3600 euros/month, depending on previous experience and skills.

The amount of the salary is defined when the employment contract is prepared.

The salaried position includes extensive occupational healthcare, affordable sports services and a holiday allowance.

A six (6) month trial period applies to the position. 

Application guidelines

Applications must be submitted by Thursday January 11, 2024 at 16:00 (UTC+3) using the University's electronic application form. The link to the system is provided at the beginning of this announcement ("Apply for this position") and can also be found at utu.fi/career.

The application should include:
- a motivation letter
- CV
- copy of degree certificate(s) with translation in English or Finnish
- list of publications
Additional information

For further information about the position and the project, kindly contact Sampo Pyysalo, sampo.pyysalo@gmail.com. For questions related to the application process, contact HR specialist Nina Reini, nina.reini@utu.fi.
University of Turku reserves the right for justified reasons to leave the position open, to extend the application period, reopen the application process, and to consider candidates who have not submitted applications during the application period.

We value equality and diversity in our work community and encourage qualified applicants, regardless of background, to apply for our open positions.

Please read more about University of Turku as an employer on the Come work with us! page and the welcome services for newcomers

The European Commission has awarded the University of Turku the right to use the HR Excellence in Research logo. The logo is a token of the University's commitment to continuous development of the position and working conditions of researchers according to the guidelines set forth in the European Charter for Researchers. 

The university offers good support and orientation for international hires. Please learn more about the Finnish culture and relocation to Finland:

 


Department of Computing

The Department of Computing educates professionals for the modern digitalised society. Central themes of teaching and research include Digitalisation, Machine Learning, Autonomous and Intelligent Systems, Cybersecurity and Health Technology. 

The Faculty of Technology is the newest faculty of the University of Turku, it was established in the beginning of 2021. The Faculty consists of the Department of Computing, Department of Biotechnology and the Department of Mechanical and Materials Engineering. We have more than 2,000 graduate and postgraduate students and circa 400 employees, 50 of whom are professors. Our faculty is an international, multicultural and diverse work community.

The University of Turku is an inspiring and international academic community of 25,000 students and staff in Southwest Finland. We build a sustainable future with multidisciplinary research, education, and collaboration. With us, your work will have a significant impact and relevance in the changing world.

See our open vacancies

HR Excellence in Research Logo

Job details

Title
Postdoctoral Researcher to work on large language model pre-training
Location
Vesilinnantie 5 Turku, Finland
Published
2023-12-20
Application deadline
2024-01-11 16:00 (Europe/Helsinki)
2024-01-11 15:00 (CET)
Job type
Save job

More jobs from this employer

About the employer

The University of Turku has a unique, creative and inspirational work environment. Here you will work with top experts, pedagogues and researchers.

Visit the employer page

This might interest you

...
Deciphering the Gut’s Clues to Our Health University of Turku 5 min read
...
Understanding Users to Optimise 3D Experiences Centrum Wiskunde & Informatica (CWI) 5 min read
...
Control Systems: The Key to Our Automated Future? Max Planck Institute for Software Systems (MPI-SWS) 5 min read
More stories