MLOps Engineer RE2

Job Reference

279_24_LS_LT_RE2

Position

MLOps Engineer RE2

Closing Date

Thursday, 01 August, 2024
Reference: 279_24_LS_LT_RE2
Job title: MLOps Engineer RE2

About BSC

The Barcelona Supercomputing Center - Centro Nacional de Supercomputación (BSC-CNS) is the leading supercomputing center in Spain. It houses MareNostrum, one of the most powerful supercomputers in Europe, was a founding and hosting member of the former European HPC infrastructure PRACE (Partnership for Advanced Computing in Europe), and is now hosting entity for EuroHPC JU, the Joint Undertaking that leads large-scale investments and HPC provision in Europe. The mission of BSC is to research, develop and manage information technologies in order to facilitate scientific progress. BSC combines HPC service provision and R&D into both computer and computational science (life, earth and engineering sciences) under one roof, and currently has over 1000 staff from 60 countries.

Look at the BSC experience:
BSC-CNS YouTube Channel
Let's stay connected with BSC Folks!

We are particularly interested for this role in the strengths and lived experiences of women and underrepresented groups to help us avoid perpetuating biases and oversights in science and IT research. In instances of equal merit, the incorporation of the under-represented sex will be favoured.

Context And Mission

The Language Technologies Unit at BSC has a consolidated experience in several NLP areas, such as massive language model building, biomedical text mining, machine translation and unsupervised learning for under-resourced languages and domains. It has been entrusted by the Spanish and the Catalan government, with the mission to develop fundamental open-source resources and technologies for Spanish and Catalan. In connection with this, the LT Unit is currently in charge of three flagship projects at the national and regional level: the Spanish National Language Technology Plan and ILENIA funded by the Spanish Secretariat of Digitalisation and Artificial Intelligence; and the AINA project, aimed at developing AI resources for Catalan, funded by the Catalan Digitalisation Department. In addition, the Unit participates in various EU funded international projects.

The LT Unit at BSC is looking for an MLOps engineer with experience in automating the development, deployment and monitoring of ML models. The MLOps engineer will develop, maintain and optimize ML pipelines to ensure high performance and reliability of ML models, furthermore ensure their effective deployment to the production. The successful candidate will work in a highly sophisticated HPC environment, will have access to state-of-the-art large models and computational infrastructures.

Key Duties

  • Design and implement ML model continuous integration, continuous deployment and continuous monitoring processes for production.
  • Develop and maintain CI/CD pipelines for ML models.
  • Configure and automate ML model deployment, monitoring and scaling in production.
  • Identify areas of improvement in ML model lifecycle.
  • Collaborate with data engineers and ML engineers to select best ML model architectures and hyperparameters.
  • Work closely with the ML engineers, specifically on Large Language Models, to develop internal tools to facilitate the development and evaluation processes.
  • Ensure the accessibility of the open software, and the replicability of their deployment via containers based architectures.
  • Maintain the code infrastructure as open source repositories.
  • Auditing the quality of the work submitted by providers, external collaborators.
  • Mentoring junior members of the team

Requirements

  • Education
    • Bachelor’s degree in Computer Science, Information Technology or related field.
    • Master’s degree will be valued
  • Essential Knowledge and Professional Experience
    • In-depth knowledge of Linux/Unix systems.
    • Demonstrable experience with Kubernetes, Docker, or other comparable containerization
    • Demonstrable experience of automating tasks via Python, bash or other scripting tools
    • Hands-on experience with Automation/DevOps tools (such as Github Actions, Gitlab, Jenkins, Ansible etc.)
    • Strong knowledge of provisioning methods via tools such as Terraform and Helm
    • Strong knowledge of ML model development and deployment tools
    • Experience with cloud technology; AWS, Azure/GCP or OpenStack.
    • Experience with monitoring and logging tools such as ELK, Prometheus, Grafana, etc.
    • Good knowledge on security best practices
  • Competences
    • Ability to work independently and in a team to complete tasks on schedule.
    • Ability to work under set deadlines

Conditions

  • The position will be located at BSC within the Life Sciences Department
  • We offer a full-time contract (37.5h/week), a good working environment, a highly stimulating environment with state-of-the-art infrastructure, flexible working hours, extensive training plan, restaurant tickets, private health insurance, support to the relocation procedures
  • Duration: Open-ended contract due to technical and scientific activities linked to the project and budget duration
  • Holidays: 23 paid vacation days plus 24th and 31st of December per our collective agreement
  • Salary: we offer a competitive salary commensurate with the qualifications and experience of the candidate and according to the cost of living in Barcelona
  • Starting date: asap

Applications procedure and process

All applications must be made through BSC website and contain:

  • A full CV in English including contact details
  • A Cover Letter with a statement of interest in English, including two contacts for further references - Applications without this document will not be considered

    In accordance with the OTM-R principles, a gender-balanced recruitment panel is formed for every vacancy at the beginning of the process. After reviewing the content of the applications, the panel will start the interviews, with at least one technical and one administrative interview. A profile questionnaire as well as a technical exercise may be required during the process.

    The panel will make a final decision and all candidates who had contacts with them will receive a feedback with details on the acceptance or rejection of their profile.

    At BSC we are seeking continuous improvement in our recruitment processes, for any suggestions or feedback/complaints about our Recruitment Processes, please contact recruitment [at] bsc [dot] es.

    For more information follow this link

  • Deadline

    The vacancy will remain open until a suitable candidate has been hired. Applications will be regularly reviewed and potential candidates will be contacted.

    OTM-R principles for selection processes

    BSC-CNS is committed to the principles of the Code of Conduct for the Recruitment of Researchers of the European Commission and the Open, Transparent and Merit-based Recruitment principles (OTM-R). This is applied for any potential candidate in all our processes, for example by creating gender-balanced recruitment panels and recognizing career breaks etc.
    BSC-CNS is an equal opportunity employer committed to diversity and inclusion. We are pleased to consider all qualified applicants for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, age, disability or any other basis protected by applicable state or local law.
    For more information follow this link
    This position is reserved for candidates who meet the requirements and have the legal status of disabled persons with a degree of disability equal to or greater than 33%. In case there are no applicants with disabilities that meet the requirements, the rest of the candidates without declared disability will be evaluated.

    Application Form

    please choose one of this and if needed describe the option : - BSC Website - Euraxess - Spotify - HiPeac - LinkedIn - Networking/Referral: include who and how - Events (Forum, career fairs): include who and how - Through University: include the university name - Specialized website (Metjobs, BIB, other): include which one - Other social Networks: (Twitter, Facebook, Instagram, Youtube): include which one - Other (Glassdoor, ResearchGate, job search website and other cases): include which one
    Please, upload your CV document using the following name structure: Name_Surname_CV
    Files must be less than 3 MB.
    Allowed file types: txt rtf pdf doc docx.
    Please, upload your CV document using the following name structure: Name_Surname_CoverLetter
    Files must be less than 3 MB.
    Allowed file types: txt rtf pdf doc docx zip.
    Please, upload your CV document using the following name structure: Name_Surname_OtherDocument
    Files must be less than 10 MB.
    Allowed file types: txt rtf pdf doc docx rar tar zip.
    ** Consider that the information provided in relation to gender and nationality will be used solely for statistical purposes.