InstructLab

A new community-based approach to build truly open-source LLMs

Category: Jobs

  • Senior Software Engineer (Frontend)

    Boston, Massachusetts
    InstructLab Frontend Team

    This role reports to the Frontend Team, ensuring a usable and accessible InstructLab workflow provided by front-end user interfaces primarily including various React-based graphical web user interfaces. The focus of this role is on the user experience and requires working from the user interface (python-based CLI) into the core engine/runtime and supporting libraries’ logic. As part of these responsibilities, you will need to adhere to coding best practices and standards, including well-documented scalable and efficient code and staying up-to-date with software engineering technologies, frameworks, and methodologies.

    Note “Apply Now” job descriptions are the official job postings.

  • Senior Data Scientist

    Waterford City, Ireland
    InstructLab Models Team

    You will be responsible for the investigation, evaluation, integration, and development of open source AI/ML systems and functionality to improve the overall development and operations of both Red Hat’s downstream AI products and upstream open source AI projects. In this role, you’ll specialize in ensuring the quality of models produced by the InstructLab toolset. Qualified data scientists in this role regularly participate in design reviews and productization for major product features and bug fixes, and are experienced and comfortable working with Linux-based operating systems. Must have strong experience with Python development.

    Note “Apply Now” job descriptions are the official job postings.

  • Principal Software Engineer (CLI Specialist)

    Boston, Massachusetts
    InstructLab Engine & Runtime Team

    This role reports to the Engine & Runtime Team, ensuring a usable and accessible InstructLab workflow provided by the python-based CLI interface. The focus of this role is on the user experience and requires working from the CLI user interface into the core engine/runtime and supporting libraries’ logic. As part of these responsibilities, you will need to adhere to coding best practices and standards, including well-documented scalable and efficient code and staying up-to-date with software engineering technologies, frameworks, and methodologies. The ideal candidate for this role will have experience building usable & accessible CLIs for Linux and/or Mac-based command lines.

    Note “Apply Now” job descriptions are the official job postings.

  • Manager, Machine Learning Engineering

    Boston, Massachusetts
    InstructLab Training Team

    A manager in this role will be an experienced, open and transparent leader with 5+ years of experience as a software developer, software engineer, or similar roles that establish a strong understanding of software development practices. The core responsibility is developing and growing a high-performing team of software engineers through coaching, mentoring, and ongoing feedback. This manager will have strong planning, coordination, and problem-solving skills to ensure alignment within the broader InstructLab team and be familiar with upstream open source community software development practices and “the open source way.” Deep experience with python and familiarity with AI and ML platforms, tooling, and frameworks is important for this role. The ability to quickly learn and use new tools and technologies will be critical in this role.

    Note “Apply Now” job descriptions are the official job postings.

  • Senior Machine Learning Engineer

    Raleigh, North Carolina
    Preference Tuning Team

    This machine learning engineer will be focused on developing post-training components of InstructLab, including preference tuning and tool use. Engineers in this role will run experiments, tests, and large-scale distributed jobs in support of post-model training AI product features. They will lead a variety of coding projects in different programming languages (primarily python), helping transition software components from research into product. Engineers in this role will also participate in and lead upstream communities with a focus towards preference tuning and tool use. They will also promote machine learning and data science technologies and ongoing machine learning projects with a variety of technical and non-technical stakeholders.

    Note “Apply Now” job descriptions are the official job postings.

  • Software Quality Engineer

    Waterford City, Ireland
    InstructLab Data Ingestion Team

    This quality engineer will be focused on the quality of software that handles data preprocessing and data ingestion for InstructLab. They will be responsible for building a test suite and test automation for data ingestion tooling, and they will be responsible for evaluating how the quality of the synthetic data output is impacted by changes to data ingestion related codebases. Quality engineers who have an interest / familiarity with data science and/or machine learning would be amazing candidates for this role.

    Note “Apply Now” job descriptions are the official job postings.

  • Taxonomy Engineer

    Boston, Massachusetts
    InstructLab Taxonomy Team

    As a taxonomy engineer, you will be responsible for developing, maintaining, and optimizing classification systems that support our AI models and data organization efforts. In this role, you will be responsible for reviewing and categorizing incoming data, issues, or tasks and prioritizing these items based on urgency, importance, and relevance. Effective taxonomy engineers will collaborate closely with data scientists, machine learning engineers, and domain experts to align taxonomies with project requirements, and to understand the effectiveness of taxonomies in supporting AI model performance. You may build tooling to optimize and enable taxonomy contributions and organization.

    Note “Apply Now” job descriptions are the official job postings.

  • Principal Software Quality Engineer

    Boston, Massachusetts
    Preference Tuning Team

    This quality engineer will be focused on the quality of post-training components of InstructLab, including preference tuning and tool use. They will be responsible for building a test suite and test automation for preference tuning and tool use, and they will be responsible for evaluating how the quality and performance of models is impacted by changes to the post-training components. Quality engineers who have an interest / familiarity with data science and/or machine learning would be amazing candidates for this role.

    Note “Apply Now” job descriptions are the official job postings.

  • Software Quality Engineer

    Waterford City, Ireland
    InstructLab Models Team

    This quality engineer will be focused on the quality of synthetic data output of the SDG codebase. They will be responsible for building a test suite and test automation for the SDG python API, and they will be responsible for evaluating how the quality of the synthetic data output is impacted by SDG library upgrades. Quality engineers who have an interest / familiarity with data science and/or machine learning would be amazing candidates for this role.

    Note “Apply Now” job descriptions are the official job postings.

  • Principal Software Engineer

    Waterford City, Ireland
    InstructLab RAG & Agents Team

    This is a software engineering role that involves designing, improving, and maintaining open source codebases for the InstructLab project that provide RAG and software agent capabilities. This role is upstream and downstream focused, so you will need experience working in upstream, community-based projects, ideally as a project maintainer. Responsibilities include adhering to coding best practices and standards, including well-documented scalable and efficient code. Experience working with ML/AI tooling such as langchain, llamaindex, openrag, and haystack and vector databases will be needed for this role. Experience with prompt engineering, experience working with HuggingFace transformers, and working with frontend tooling such as gradio, streamlit, and fastapi will be helpful.

    Note “Apply Now” job descriptions are the official job postings.

  • Principal Software Engineer

    Waterford City, Ireland
    InstructLab Data Ingestion Team

    This is a software engineering role that involves designing, extending, improving, and maintaining open source codebases for the InstructLab project that assists in the preprocessing of data for InstructLab. A key project for this role is the docling project. Experience working in upstream, open source, community-based projects, ideally as project maintainer, is key. Engineers in this role will need to adhere to coding best practices and standards, including well-documented, scalable, and efficient code. Experience with tooling for data collection, data streaming APIs, data preprocessing, data cleansing and formatting, working with large datasets, and using distributed and cloud-based processing of data will be ideal. Experience with ML/AI frameworks and vector databases will be helpful for this role.

    Note “Apply Now” job descriptions are the official job postings.

  • Senior AI Taxonomy Engineer

    Boston, Massachusetts
    InstructLab Taxonomy Team

    In this role, you’ll help maintains upstream and product taxonomies, help triage and maintain taxonomy contributions to the InstructLab community model, develop and maintain tooling for users to build and work with their taxonomies, and help support issues regarding the processing of taxonomy data for model training. Experience contributing to a public open source community is important in this role. A data engineering background is ideal. You can learn more about the upstream InstructLab community model taxonomy at https://github.com/instructlab/taxonomy.

    Note “Apply Now” job descriptions are the official job postings.

  • Principal Software Quality Engineer

    Boston, Massachusetts
    InstructLab Core Engine & Runtime Team

    This is both an upstream and downstream-focused role; an ideal candidate will have experience working in public upstream open source codebases. They will be responsible for building a test suite and test automation for the core engine & runtime codebase on both Linux and Mac platforms, and they will be responsible for evaluating how the quality of models built with the tooling is impacted by updates to the codebase. An interest / familiarity with large language models and/or GPUs or other hardware accelerators will be great for this role. It will be important for this role to have experience working with Linux on the command-line.

    Note “Apply Now” job descriptions are the official job postings.

  • Senior Software Engineer (Full Stack)

    Boston, Massachusetts
    InstructLab End-to-End Experience Team

    This role reports to the End-to-End Experience Team, ensuring the full InstructLab workflow is smooth at a holistic, cross-component level. The focus of this role is on the user experience and requires working from the user interface (primarily python-based CLI, but also involving various web-based interfaces including React) into the core engine/runtime and supporting libraries’ logic. As part of these responsibilities, you will need to adhere to coding best practices and standards, including well-documented scalable and efficient code and staying up-to-date with software engineering technologies, frameworks, and methodologies.

    Note “Apply Now” job descriptions are the official job postings.

  • Principal Software Engineer (Full Stack)

    Boston, Massachusetts
    InstructLab Frontend Team

    This role reports to the Frontend Team, ensuring a usable and accessible InstructLab workflow provided by front-end user interfaces primarily including various React-based graphical web user interfaces. The focus of this role is on the user experience and requires working from the user interface (python-based CLI) into the core engine/runtime and supporting libraries’ logic. As part of these responsibilities, you will need to adhere to coding best practices and standards, including well-documented scalable and efficient code and staying up-to-date with software engineering technologies, frameworks, and methodologies.

    Note “Apply Now” job descriptions are the official job postings.

  • Principal Software Quality Engineer

    Boston, Massachusetts
    InstructLab Synthetic Data Generation Team

    This quality engineer will be focused on the quality of synthetic data output of the Synthetic Data Generation (SDG) codebase. They will be responsible for building a test suite and test automation for the SDG python API, and they will be responsible for evaluating how the quality of the synthetic data output is impacted by SDG library upgrades. Quality engineers who have an interest / familiarity with data science and/or machine learning would be amazing candidates for this role.

    Note “Apply Now” job descriptions are the official job postings.

  • Principal Software Engineer

    Boston, Massachusetts
    InstructLab API & Integration Team

    This role reports to the API & Integration team. This role is a software engineering role that involves designing, extending, improving, and maintaining an API for the InstructLab project with a great developer experience. The role also involves ensuring that developers of third-party and ecosystem tooling are easily able to integrate InstructLab into their projects. As part of these responsibilities, you will need to adhere to coding best practices and standards, including well-documented scalable and efficient code and staying up-to-date with software engineering technologies, frameworks, and methodologies. Experience designing and maintaining APIs and other integrations is key for this role.

    Note “Apply Now” job descriptions are the official job postings.

  • Principal Machine Learning Engineer

    Boston, Massachusetts
    InstructLab Model Evaluation Team

    This role involves evaluating the quality and performance of models produced by InstructLab. This includes model evaluation community and industry benchmarks as well as improvements to the model over time. Engineers in this role will run experiments, tests, and large-scale distributed jobs in support of evaluation-related AI product features. They will lead a variety of coding projects in different programming languages (primarily python), helping transition software components from research into product. Engineers in this role will also participate in and lead upstream communities with a focus towards model evaluation projects. They will also promote machine learning and data science technologies and ongoing machine learning projects with a variety of technical and non-technical stakeholders.

    Note “Apply Now” job descriptions are the official job postings.

  • Principal Software Engineer

    Boston, Massachusetts
    InstructLab Model Training Team

    This role reports to the model training team. This role is a software engineering role that involves designing, extending, improving, and maintaining the InstructLab training library & codebase. This role focuses both on the upstream training codebase as well as integrating the training library downstream into Red Hat products. As part of these responsibilities, you will need to adhere to coding best practices and standards, including well-documented scalable and efficient code and staying up-to-date with software engineering technologies, frameworks, and methodologies. Experience maintaining upstream open source community projects is key for this role.

    Note “Apply Now” job descriptions are the official job postings.

  • Manager, Full Stack Engineering

    Boston, Massachusetts
    InstructLab Frontend Team

    A manager in this role will be an experienced, open and transparent leader with 5+ years of experience as a software developer, software engineer, or similar roles that establish a strong understanding of software development practices. The core responsibility is developing and growing a high-performing team of software engineers through coaching, mentoring, and ongoing feedback. This manager will have strong planning, coordination, and problem-solving skills to ensure alignment within the broader InstructLab team and be familiar with upstream open source community software development practices and “the open source way.” Deep experience with full stack software development, which may include LAMP, JAM, or MEAN stack experience, is important for this role. Ideally this candidate will have some familiarity with AI and Machine Learning platforms, tools, and frameworks. The ability to quickly learn and use new tools and technologies will be critical in this role.

    Note “Apply Now” job descriptions are the official job postings.