graphium

Graphium: Scaling molecular GNNs to infinity.

Project description

Scaling molecular GNNs to infinity

A deep learning library focused on graph representation learning for real-world chemical tasks.

✅ State-of-the-art GNN architectures.
🐍 Extensible API: build your own GNN model and train it with ease.
⚗️ Rich featurization: powerful and flexible built-in molecular featurization.
🧠 Pretrained models: for fast and easy inference or transfer learning.
⮔ Read-to-use training loop based on Pytorch Lightning.
🔌 Have a new dataset? Graphium provides a simple plug-and-play interface. Change the path, the name of the columns to predict, the atomic featurization, and you’re ready to play!

Documentation

Visit https://graphium-docs.datamol.io/.

You can try running Graphium on Graphcore IPUs for free on Gradient by clicking on the button above.

Installation for developers

For CPU and GPU developers

Use mamba:

# Install Graphium's dependencies in a new environment named `graphium`
mamba env create -f env.yml -n graphium

# Install Graphium in dev mode
mamba activate graphium
pip install --no-deps -e .

For IPU developers

# Install Graphcore's SDK and Graphium dependencies in a new environment called `.graphium_ipu`
./install_ipu.sh .graphium_ipu

The above step needs to be done once. After that, enable the SDK and the environment as follows:

source enable_ipu.sh .graphium_ipu

Training a model

To learn how to train a model, we invite you to look at the documentation, or the jupyter notebooks available here.

If you are not familiar with PyTorch or PyTorch-Lightning, we highly recommend going through their tutorial first.

Running an experiment

We have setup Graphium with hydra for managing config files. To run an experiment go to the expts/ folder. For example, to benchmark a GCN on the ToyMix dataset run

graphium-train dataset=toymix model=gcn

To change parameters specific to this experiment like switching from fp16 to fp32 precision, you can either override them directly in the CLI via

graphium-train dataset=toymix model=gcn trainer.trainer.precision=32

or change them permamently in the dedicated experiment config under expts/hydra-configs/toymix_gcn.yaml. Integrating hydra also allows you to quickly switch between accelerators. E.g., running

graphium-train dataset=toymix model=gcn accelerator=gpu

automatically selects the correct configs to run the experiment on GPU. Finally, you can also run a fine-tuning loop:

graphium-train +finetuning=admet

To use a config file you built from scratch you can run

graphium-train --config-path [PATH] --config-name [CONFIG]

Thanks to the modular nature of hydra you can reuse many of our config settings for your own experiments with Graphium.

License

Under the Apache-2.0 license. See LICENSE.

Documentation

Diagram for data processing in molGPS.

Diagram for Muti-task network in molGPS

Project details

Release history Release notifications | RSS feed

2.4.7

Jun 28, 2024

2.4.6

Jun 27, 2024

2.4.5

Jan 19, 2024

2.4.4

Dec 21, 2023

2.4.3

Dec 21, 2023

2.4.2

Dec 19, 2023

2.4.1

Sep 19, 2023

2.4.0

Sep 7, 2023

2.3.6

Sep 1, 2023

2.3.5

Aug 31, 2023

2.3.4

Aug 26, 2023

2.3.3

Aug 26, 2023

2.3.2

Aug 18, 2023

This version

2.3.1

Aug 18, 2023

2.3.0

Aug 9, 2023

2.2.0

Jul 21, 2023

2.1.3

Jul 11, 2023

2.1.2

Jul 11, 2023

2.1.1

Jul 6, 2023

2.1.0

Jul 5, 2023

2.0.2

Jun 17, 2023

2.0.1

Jun 7, 2023

2.0.0

Jun 7, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

graphium-2.3.1.tar.gz (4.6 MB view hashes)

Uploaded Aug 18, 2023 Source

Built Distribution

graphium-2.3.1-py3-none-any.whl (1.0 MB view hashes)

Uploaded Aug 18, 2023 Python 3

Hashes for graphium-2.3.1.tar.gz

Hashes for graphium-2.3.1.tar.gz
Algorithm	Hash digest
SHA256	`02983e317c8ea84bb3a7a3c3e8e8b875671c7f87eea88d57d6f495c4c01c0856`
MD5	`19a520b524b97f27a0034022d1af61ad`
BLAKE2b-256	`d66f6713bb0ce3e1680f80fc85456d2610f2d69358c06be20ee3113125277315`

Hashes for graphium-2.3.1-py3-none-any.whl

Hashes for graphium-2.3.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`65b7b3bab8b7c4d1853c0af49ca41f4523cf313ca7d8b18501279eba988e0f41`
MD5	`2ece30629e34a89f9476b4e0bc727bf4`
BLAKE2b-256	`0d12761829c93c11b645f65114f152137f75694e449d3a14a5fc7cd54761e7b5`