Skip to content
@MachineLearningLifeScience

Machine Learning in Life Science

Welcome to the github page for the Center for Basic Machine Learning Research in Life Science

We conduct the basic machine learning research needed to estimate representations of biomedical data that are

  • Robust
  • Interpretable
  • Data efficient
  • Reflective of inherent data uncertainty
  • Able to leverage existing knowledge

These representations are both predictive and knowledge discovery tasks.

Research

Our research focuses on four themes, and each theme advances different aspects of representation learning for life science and support each other:

  1. Meaningful representation of data and computational and mathematical tools development to realize the answer.
  2. Geometric constructions to incorporate existing knowledge into representations and ensure that the result is understandable by humans.
  3. Representation of data often appearing within life science, such as trees, graphs, and sequences.
  4. Inclusion of real data that is “noisy” and investigation of how associated uncertainty is best encoded.

Pinned Loading

  1. meaningful-protein-representations meaningful-protein-representations Public

    Jupyter Notebook 103 8

  2. stochman stochman Public

    Algorithms for computations on random manifolds made easier

    Python 85 11

  3. BEND BEND Public

    BEND: Benchmarking DNA Language Models on Biologically Meaningful Tasks

    Python 1

  4. torchplot torchplot Public

    Plotting pytorch tensors made easy!

    Python 14 1

  5. poli poli Public

    A library of discrete objectives

    Python 14 1

Repositories

Showing 10 of 12 repositories
  • poli Public

    A library of discrete objectives

    MachineLearningLifeScience/poli’s past year of commit activity
    Python 14 MIT 1 45 6 Updated Oct 23, 2024
  • hdbo_benchmark Public

    Code for "A survey and benchmark of high-dimensional Bayesian optimization of discrete sequences"

    MachineLearningLifeScience/hdbo_benchmark’s past year of commit activity
    Python 7 0 1 0 Updated Oct 22, 2024
  • poli-baselines Public

    A collection of objective functions and black box optimization algorithms related to proteins and small molecules

    MachineLearningLifeScience/poli-baselines’s past year of commit activity
    Python 5 MIT 2 12 (1 issue needs help) 6 Updated Oct 16, 2024
  • poli-docs Public

    Documentation for poli and poli-baselines

    MachineLearningLifeScience/poli-docs’s past year of commit activity
    5 0 4 2 Updated Sep 25, 2024
  • poli-assets Public

    Assets and datasets for `poli` and `poli-baselines`

    MachineLearningLifeScience/poli-assets’s past year of commit activity
    0 0 0 0 Updated Sep 12, 2024
  • protein_regression Public

    The codebase to replicate the analysis of "A systematic analysis of regression models for protein engineering" (2024).

    MachineLearningLifeScience/protein_regression’s past year of commit activity
    Jupyter Notebook 2 MIT 1 0 0 Updated Jun 12, 2024
  • corel Public
    MachineLearningLifeScience/corel’s past year of commit activity
    Python 2 MIT 1 4 0 Updated Apr 12, 2024
  • stochman Public

    Algorithms for computations on random manifolds made easier

    MachineLearningLifeScience/stochman’s past year of commit activity
    Python 85 Apache-2.0 11 10 0 Updated Dec 4, 2023
  • BEND Public

    BEND: Benchmarking DNA Language Models on Biologically Meaningful Tasks

    MachineLearningLifeScience/BEND’s past year of commit activity
    Python 0 BSD-3-Clause 1 1 0 Updated Nov 24, 2023
  • .github Public
    MachineLearningLifeScience/.github’s past year of commit activity
    1 0 0 0 Updated Aug 18, 2023

Top languages

Loading…

Most used topics

Loading…