Skip to content
@MachineLearningLifeScience

Machine Learning in Life Science

Welcome to the github page for the Center for Basic Machine Learning Research in Life Science

We conduct the basic machine learning research needed to estimate representations of biomedical data that are

  • Robust
  • Interpretable
  • Data efficient
  • Reflective of inherent data uncertainty
  • Able to leverage existing knowledge

These representations are both predictive and knowledge discovery tasks.

Research

Our research focuses on four themes, and each theme advances different aspects of representation learning for life science and support each other:

  1. Meaningful representation of data and computational and mathematical tools development to realize the answer.
  2. Geometric constructions to incorporate existing knowledge into representations and ensure that the result is understandable by humans.
  3. Representation of data often appearing within life science, such as trees, graphs, and sequences.
  4. Inclusion of real data that is “noisy” and investigation of how associated uncertainty is best encoded.

Pinned Loading

  1. meaningful-protein-representations meaningful-protein-representations Public

    Jupyter Notebook 110 7

  2. stochman stochman Public

    Algorithms for computations on random manifolds made easier

    Python 94 10

  3. BEND BEND Public

    Forked from frederikkemarin/BEND

    Benchmarking DNA Language Models on Biologically Meaningful Tasks

    Python 1

  4. poli poli Public

    A library of discrete objectives

    Python 24 1

  5. hdbo_benchmark hdbo_benchmark Public

    Code for "A survey and benchmark of high-dimensional Bayesian optimization of discrete sequences"

    Python 16 1

  6. torchplot torchplot Public

    Plotting pytorch tensors made easy!

    Python 15 1

Repositories

Showing 10 of 14 repositories
  • inverse_folding_free_energies Public

    Code for the "Zero-shot protein stability prediction by inverse folding models: a free energy interpretation" paper.

    MachineLearningLifeScience/inverse_folding_free_energies’s past year of commit activity
    Jupyter Notebook 1 BSD-3-Clause 0 0 0 Updated Jan 16, 2026
  • HackathonEBM Public
    MachineLearningLifeScience/HackathonEBM’s past year of commit activity
    Jupyter Notebook 4 1 0 0 Updated Nov 24, 2025
  • poli Public

    A library of discrete objectives

    MachineLearningLifeScience/poli’s past year of commit activity
    Python 24 MIT 1 55 4 Updated Oct 5, 2025
  • poli-docs Public

    Documentation for poli and poli-baselines

    MachineLearningLifeScience/poli-docs’s past year of commit activity
    5 0 6 2 Updated Sep 28, 2025
  • poli-baselines Public

    A collection of objective functions and black box optimization algorithms related to proteins and small molecules

    MachineLearningLifeScience/poli-baselines’s past year of commit activity
    Python 7 MIT 5 16 (1 issue needs help) 4 Updated Mar 3, 2025
  • hdbo_benchmark Public

    Code for "A survey and benchmark of high-dimensional Bayesian optimization of discrete sequences"

    MachineLearningLifeScience/hdbo_benchmark’s past year of commit activity
    Python 16 1 3 1 Updated Feb 24, 2025
  • BEND Public Forked from frederikkemarin/BEND

    Benchmarking DNA Language Models on Biologically Meaningful Tasks

    MachineLearningLifeScience/BEND’s past year of commit activity
    Python 1 BSD-3-Clause 20 1 0 Updated Oct 31, 2024
  • poli-assets Public

    Assets and datasets for `poli` and `poli-baselines`

    MachineLearningLifeScience/poli-assets’s past year of commit activity
    0 0 0 0 Updated Sep 12, 2024
  • protein_regression Public

    The codebase to replicate the analysis of "A systematic analysis of regression models for protein engineering" (2024).

    MachineLearningLifeScience/protein_regression’s past year of commit activity
    Jupyter Notebook 5 MIT 2 0 0 Updated Jun 12, 2024
  • corel Public
    MachineLearningLifeScience/corel’s past year of commit activity
    Python 3 MIT 1 4 0 Updated Apr 12, 2024

Most used topics

Loading…