Hierarchical autoencoder fusion for improved multi-omics integration in cancer survival models

This repository contains this work in "library" format for ease-of-use in one other repository I have. For the original code and structure, please refer to the original repository at: https://github.com/BoevaLab/Hierarchical-autoencoder-fusion-for-improved-multi-omics-integration-in-cancer-survival-models

Reproduction guide

To reproduce any of the results, all you have to do is run the requisite bash scripts.

You may obtain a copy the input all data files on Polybox for reproduction.

Setup

bash setup.sh

Please note that you might have to run the setup script with sudo depending on your setup.

Recreating data and splits (this is not necessary unless you want to, the preprocessed data is also included above)

Please note that to recreate the data and splits, you might need to adjust your Python version, which will be used in creating the splits. I attempted to make this as reproducible as possible, but unfortunately, the exact path can depend on your OS and conda version.

To change the version, first, run which python in your terminal after activating the requisite conda env via conda activate hierarchical_fusion. Afterward, change the following line in the src/chores/get_new_cancers.R script:

Sys.setenv(PATH = paste(c(paste0("/Users/", Sys.info()[["user"]], "/miniforge3/envs/hierarchical_fusion/bin"), Sys.getenv("PATH"),
  collapse = .Platform$path.sep
), collapse = ":"))
``

to the following

```R
Sys.setenv(PATH = paste(c("your output from which python goes here", Sys.getenv("PATH"),
  collapse = .Platform$path.sep
), collapse = ":"))

Please note that this might also work by default if you use miniforge (i.e., you might not have to change it).

To recreate the data and splits then simply run bash recreate_data_and_splits.sh.

Re-running benchmarks

To re-run all benchmarks, run bash reproduce_benchmarks.sh. Of course, you can also just re-run single scripts by manually calling the requisite R or Python script.

Please note that you need to re-run the benchmarks in order to reproduce figures and tables.

Reproducing figures

bash reproduce_figures.sh

Figures can be reproduced without re-running any benchmarks, as the requisite benchmark results are included in the data folder obtainable from Kaggle as above.

Reproducing tables

bash reproduce_tables.sh

Tables can be reproduced without re-running any benchmarks, as the requisite benchmark results are included in the data folder obtainable from Kaggle as above.

Re-running everything

To re-run everything, simply call bash reproduce_all.sh. This will run in order the setup script, the data creation script, followed by the benchmarks, misc tasks, and the reproduction of all lots and tables.

In case of any problems, feel free to open an issue or email us at [email protected].

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
hierae		hierae
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Hierarchical autoencoder fusion for improved multi-omics integration in cancer survival models

Reproduction guide

Setup

Recreating data and splits (this is not necessary unless you want to, the preprocessed data is also included above)

Re-running benchmarks

Reproducing figures

Reproducing tables

Re-running everything

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

phcavelar/hierae

Folders and files

Latest commit

History

Repository files navigation

Hierarchical autoencoder fusion for improved multi-omics integration in cancer survival models

Reproduction guide

Setup

Recreating data and splits (this is not necessary unless you want to, the preprocessed data is also included above)

Re-running benchmarks

Reproducing figures

Reproducing tables

Re-running everything

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages