InstructLab

A new community-based approach to build truly open-source LLMs

Tag: principal

  • Principal Software Engineer (CLI Specialist)

    Boston, Massachusetts
    InstructLab Engine & Runtime Team

    This role reports to the Engine & Runtime Team, ensuring a usable and accessible InstructLab workflow provided by the python-based CLI interface. The focus of this role is on the user experience and requires working from the CLI user interface into the core engine/runtime and supporting libraries’ logic. As part of these responsibilities, you will need to adhere to coding best practices and standards, including well-documented scalable and efficient code and staying up-to-date with software engineering technologies, frameworks, and methodologies. The ideal candidate for this role will have experience building usable & accessible CLIs for Linux and/or Mac-based command lines.

    Note “Apply Now” job descriptions are the official job postings.

  • Principal Software Quality Engineer

    Boston, Massachusetts
    Preference Tuning Team

    This quality engineer will be focused on the quality of post-training components of InstructLab, including preference tuning and tool use. They will be responsible for building a test suite and test automation for preference tuning and tool use, and they will be responsible for evaluating how the quality and performance of models is impacted by changes to the post-training components. Quality engineers who have an interest / familiarity with data science and/or machine learning would be amazing candidates for this role.

    Note “Apply Now” job descriptions are the official job postings.

  • Principal Software Engineer

    Waterford City, Ireland
    InstructLab RAG & Agents Team

    This is a software engineering role that involves designing, improving, and maintaining open source codebases for the InstructLab project that provide RAG and software agent capabilities. This role is upstream and downstream focused, so you will need experience working in upstream, community-based projects, ideally as a project maintainer. Responsibilities include adhering to coding best practices and standards, including well-documented scalable and efficient code. Experience working with ML/AI tooling such as langchain, llamaindex, openrag, and haystack and vector databases will be needed for this role. Experience with prompt engineering, experience working with HuggingFace transformers, and working with frontend tooling such as gradio, streamlit, and fastapi will be helpful.

    Note “Apply Now” job descriptions are the official job postings.

  • Principal Software Engineer

    Waterford City, Ireland
    InstructLab Data Ingestion Team

    This is a software engineering role that involves designing, extending, improving, and maintaining open source codebases for the InstructLab project that assists in the preprocessing of data for InstructLab. A key project for this role is the docling project. Experience working in upstream, open source, community-based projects, ideally as project maintainer, is key. Engineers in this role will need to adhere to coding best practices and standards, including well-documented, scalable, and efficient code. Experience with tooling for data collection, data streaming APIs, data preprocessing, data cleansing and formatting, working with large datasets, and using distributed and cloud-based processing of data will be ideal. Experience with ML/AI frameworks and vector databases will be helpful for this role.

    Note “Apply Now” job descriptions are the official job postings.

  • Principal Software Quality Engineer

    Boston, Massachusetts
    InstructLab Core Engine & Runtime Team

    This is both an upstream and downstream-focused role; an ideal candidate will have experience working in public upstream open source codebases. They will be responsible for building a test suite and test automation for the core engine & runtime codebase on both Linux and Mac platforms, and they will be responsible for evaluating how the quality of models built with the tooling is impacted by updates to the codebase. An interest / familiarity with large language models and/or GPUs or other hardware accelerators will be great for this role. It will be important for this role to have experience working with Linux on the command-line.

    Note “Apply Now” job descriptions are the official job postings.