Domain-oriented LLM development (RE3) - AI4S

Job Reference

565_24_ES_CES_RE3

Position

Domain-oriented LLM development (RE3) - AI4S

Data de tancament

Dissabte, 31 Agost, 2024
Reference: 565_24_ES_CES_RE3
Job title: Domain-oriented LLM development (RE3) - AI4S

 

About BSC
 
The Barcelona Supercomputing Center - Centro Nacional de Supercomputación (BSC-CNS) is the leading supercomputing center in Spain. It houses MareNostrum, one of the most powerful supercomputers in Europe, was a founding and hosting member of the former European HPC infrastructure PRACE (Partnership for Advanced Computing in Europe), and is now hosting entity for EuroHPC JU, the Joint Undertaking that leads large-scale investments and HPC provision in Europe. The mission of BSC is to research, develop and manage information technologies in order to facilitate scientific progress. BSC combines HPC service provision and R&D into both computer and computational science (life, earth and engineering sciences) under one roof, and currently has over 1000 staff from 60 countries.

Look at the BSC experience:
BSC-CNS YouTube Channel
Let's stay connected with BSC Folks!

We are particularly interested for this role in the strengths and lived experiences of women and underrepresented groups to help us avoid perpetuating biases and oversights in science and IT research. In instances of equal merit, the incorporation of the under-represented sex will be favoured.

We promote Equity, Diversity and Inclusion, fostering an environment where each and every one of us is appreciated for who we are, regardless of our differences.

If you consider that you do not meet all the requirements, we encourage you to continue applying for the job offer. We value diversity of experiences and skills, and you could bring unique perspectives to our team.
 
Context And Mission
 
The successful candidate will join the Computational Earth Sciences group to lead the development and fine-tuning of domain-specific large language models (LLMs) tailored for weather and climate applications. This role involves creating and curating specialized corpora and training LLMs to improve model predictions in these scientific domains. Additionally, the candidate will coordinate the development of AI-based atmospheric modeling emulators, enhancing the capabilities of existing models.

The candidate will also oversee activities related to porting Earth Science model modules to GPU architectures. This includes contributing to the development of GPU-optimized kernels and ensuring efficient integration with CPU computations, resulting in heterogeneous applications. The successful candidate will be expected to advance research into new computational architectures, supporting the group’s mission to innovate in high-performance computing for Earth Sciences.

The funding for these actions/fellowships and contracts comes from the European Union Recovery and Resilience Facility - Next Generation, within the framework of the General Invitation by the public business entity Red.es to participate in the talent attraction and retention programs within Investment 4 of Component 19 of the Recovery, Transformation, and Resilience Plan.
For more information, please check: https://www.bsc.es/join-us/excellence-career-opportunities/ai4s

"La financiación de estas actuaciones/becas y contratos, procede del Mecanismo de Recuperación y Resiliencia de la Unión Europea-Next Generation, en el marco de la Invitación General de la entidad pública empresarial Red.es para participar en los programas de atracción y retención del talento dentro de la Inversión 4 del Componente 19 del Plan de Recuperación, Transformación y Resiliencia.
Para más información: https://www.bsc.es/join-us/excellence-career-opportunities/ai4s "

 

 

Key Duties
 
  • Lead the development and fine-tuning of domain-specific LLMs for weather and climate applications.
  • Develop and manage specialized corpora to improve LLM accuracy and relevance for atmospheric science.
  • Coordinate the development of AI-based atmospheric modeling emulators.
  • Contribute to integrating heterogeneous computing applications, leveraging both CPUs and GPUs.
  • Support the design and optimization of GPU kernels for efficient execution and integration with CPU-based computations.
    Conduct and publish research on new computational architectures and techniques.
 
Requirements
 
  • Education
    • Bachelor’s degree in Computer Science, Physics, or a related discipline.
    • A Master’s degree or Ph.D. in a relevant field will be highly valued.
  • Essential Knowledge and Professional Experience
    • Strong programming skills in high-level languages (e.g., Python, C/C++).
    • Experience with AI/ML frameworks (e.g., TensorFlow, PyTorch) and LLM fine-tuning.
    • Proven experience in developing and fine-tuning LLMs, focusing on scientific applications.
    • Experience with UNIX/LINUX environments and scripting languages.
    • Familiarity with HPC systems, parallel programming, and heterogeneous computing.
  • Additional Knowledge and Professional Experience
    • Fluency in English is essential. Proficiency in Spanish and other European languages would be advantageous.
    • Experience with GPU programming (CUDA, OpenCL, etc.).
    • Previous experience in the development and application of AI-based modeling emulators.
    • Expertise in porting and optimizing computational models for GPU architectures.
    • Proven ability to manage large-scale, collaborative projects, including version control (Git, SVN).
    • Previous experience in supervising PhD, Masters, or undergraduate students.
  • Competences
    • Strong communication and interpersonal skills to facilitate collaboration within a multidisciplinary team.
    • Proven leadership and management abilities to coordinate complex projects.
    • Ability to interact with domain and computer scientists to drive collaborative research.
 
Conditions
 
  • The position will be located at BSC within the Earth Sciences Department
  • We offer a full-time contract (37.5h/week), a good working environment, a highly stimulating environment with state-of-the-art infrastructure, flexible working hours, extensive training plan, restaurant tickets, private health insurance
  • Duration: 4 years
  • Holidays: 23 paid vacation days plus 24th and 31st of December per our collective agreement
  • Salary: 50.00,00€
  • Additional Expenses Grant: Each fellowship will be associated with a grant for additional expenses, such as IT equipment, travel, training, stays, etc.
  • Starting date: asap - the incorporation for this vacancy must be before the 16th of December 2024
 
Applications procedure and process
 

All applications must be submitted via the BSC website and contain:

  • A full CV in English, including contact details.
  • A cover/motivation letter with a statement of interest in English, clearly specifying for which specific area and topics the applicant wishes to be considered. Additionally, two references for further contacts must be included. Applications without this document will not be considered.

 

Development of the recruitment process

The selection will be carried out through a competitive examination system ("Concurso-Oposición"). The recruitment process consists of two phases:

  1. Curriculum Analysis: Evaluation of previous experience and/or scientific history, degree, training, and other professional information relevant to the position. - 40 points
  2. Interview phase: The highest-rated candidates at the curriculum level will be invited to the interview phase, conducted by the corresponding department and Human Resources. In this phase, technical competencies, knowledge, skills, and professional experience related to the position, as well as the required personal competencies, will be evaluated. - 60 points. A minimum of 30 points out of 60 must be obtained to be eligible for the position.

The recruitment panel will be composed of at least three people, ensuring at least 25% representation of women.

In accordance with OTM-R principles, a gender-balanced recruitment panel is formed for each vacancy at the beginning of the process. After reviewing the content of the applications, the panel will begin the interviews, with at least one technical and one administrative interview. At a minimum, a personality questionnaire as well as a technical exercise will be conducted during the process.

The panel will make a final decision, and all individuals who participated in the interview phase will receive feedback with details on the acceptance or rejection of their profile.

 

At BSC, we seek continuous improvement in our recruitment processes. For any suggestions or comments/complaints about our recruitment processes, please contact recruitment [at] bsc [dot] es.

For more information, please follow this link.

 
Deadline
 
The vacancy will remain open until a suitable candidate has been hired. Applications will be regularly reviewed and potential candidates will be contacted.
 
OTM-R principles for selection processes
 
BSC-CNS is committed to the principles of the Code of Conduct for the Recruitment of Researchers of the European Commission and the Open, Transparent and Merit-based Recruitment principles (OTM-R). This is applied for any potential candidate in all our processes, for example by creating gender-balanced recruitment panels and recognizing career breaks etc.
BSC-CNS is an equal opportunity employer committed to diversity and inclusion. We are pleased to consider all qualified applicants for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, age, disability or any other basis protected by applicable state or local law.
For more information follow this link

 

Application Form

please choose one of this and if needed describe the option : - BSC Website - Euraxess - Spotify - HiPeac - LinkedIn - Networking/Referral: include who and how - Events (Forum, career fairs): include who and how - Through University: include the university name - Specialized website (Metjobs, BIB, other): include which one - Other social Networks: (Twitter, Facebook, Instagram, Youtube): include which one - Other (Glassdoor, ResearchGate, job search website and other cases): include which one
Please, upload your CV document using the following name structure: Name_Surname_CV
Els fitxers han de ser de menys de 3 MB.
Tipus de fitxers permesos: txt rtf pdf doc docx.
Please, upload your CV document using the following name structure: Name_Surname_CoverLetter
Els fitxers han de ser de menys de 3 MB.
Tipus de fitxers permesos: txt rtf pdf doc docx zip.
Please, upload your CV document using the following name structure: Name_Surname_OtherDocument
Els fitxers han de ser de menys de 10 MB.
Tipus de fitxers permesos: txt rtf pdf doc docx rar tar zip.
** Consider that the information provided in relation to gender and nationality will be used solely for statistical purposes.