Skip to content

MEDomics-UdeS/MEDomicsLab

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

⚠️ Important Notice: This platform, formerly known as MEDomicsLab, is now called MEDomics.

MEDomics platform: Integrative Modeling of Heterogeneous Data in Medicine

MEDomics logo white text

GitHub forks GitHub stars GitHub release GitHub license Gitbook docs YouTube Channel Discord server

Table of Contents

What is MEDomics?

MEDomics is an open-source platform designed to advance medical AI by bridging clinical and computational research. It empowers clinicians to design experiments, enables computer scientists to customize and adapt them, and refines processes with a flexible backend. The platform offers multiple usage levels: intuitive built-in pipelines, customizable settings, automatic code generation, and modular Python libraries—enhancing collaboration and innovation across healthcare professionals and data scientists.

MEDomics Overview

The MEDomics platform is focused on integrative modeling of heterogeneous medical data to support personalized treatments. The platform offers an environment for manipulating heterogeneous medical data and developing predictive models to enable the characterization of biological processes.

MEDomics overview

The end-goal is to give researchers the ability to leverage diverse data types contributing to more personalized and effective treatment models by enhancing the synergy between computer scientists and clinical researchers, in particular, through the code generation feature that allows for the automatic generation of Python code from the graphically created pipelines as depicted below.

MEDomics code generation

MEDomics Components

MEDomics workflow

MEDomics uses a modular workflow, detailing how patient data is processed to develop and test predictive models for clinical use. The workflow consists of the following layers:

  • Design layer: In this layer, various types of input data are integrated to support the analytical process. This includes DICOM images and tabular data in CSV format, such as time series and clinical text data from notes or reports. This layer uses the following modules:

    • Extraction Module: performs feature extraction to identify critical characteristics—like radiomic features from medical images using the MEDimage package.
    • Input Module: used to process extracted data. It provides extensive functionalities including cleaning empty lines, merging tables, and removing correlated features. It also offers a tagging tool that allows users to organize dataset columns, and offers a graphical implementation of the MEDprofiles package for patient data treatment and visualization.
    • Exploratory Module: utilized to assess data characteristics visually or statistically, with tools like Sweetviz, D-Tale, and ydata-profiling.
  • Development Layer: Once features have been extracted and treated, it enters the Development Layer, which includes key modules for model training and testing:

    • Learning Module: allows users to experiment with different machine learning (ML) models to determine the best-performing algorithms for a given clinical endpoint. After model experimentation, the ML module can help finalize a model ready for evaluation and saves it under the MEDomics format: ".medmodel".
    • Evaluation Module: enables model validation by testing it on new data and explaining its performance and outcomes using the explainerdashboard package. For models that require distributed data sources for training, a graphical implementation of the MEDfl package for simulated Federated Learning (FL) is included, allowing users to simulate federated learning experiments.
  • Deployment Layer : After finalizing and evaluating a model, it’s ready to be used in the Application Module. This module enables users to apply the trained model to new datasets or patient data, making it accessible for clinical use. Additionally, the standardized ".medmodel" format plays a crucial role in facilitating collaboration, allowing models to be shared seamlessly across teams and ensuring compatibility and ease of integration in diverse environments.

MEDomics in Action

Explore MEDomics's powerful features through the part two of the machine learning module tutorial. It highlights the intuitive interface and versatile tools available for data analysis and how it can be utilized.

Installation and Usage

Get started here.

Operating systems supported by MEDomics

Explore Documentation

Detailed documentation for MEDomics can be found here.

Engage in MEDomics

Explore our contribution page for resources on contributing to the MEDomics platform and setting up your development environment.

Acknowledgement

This project relies on the following open-source packages, and we are grateful to their developers:

MEDomics workflow