C-EDGE and PGA

Code for the paper: "Discovering code Essential Multiple Gene Effects through Large Scale Optimization: an Application to Human Cancer Metabolism"

Initial settings

Download COBRA toolbox for MATLAB from http://opencobra.github.io/ and set the local COBRA folder with the instruction

addpath(genpath('path_to_COBRA_toolbox'));

The following code is fully parallelised and requires the Parallel Toolbox in Matlab.

Multi-objective optimization of gene expression

Load the model (e.g. the one with biomass and phosphoglycerate dehydrogenase set as objectives)

load('recon2_merged_bio_PHGDH.mat')

To start the optimization, run

NUMBER_OF_CORES = 4  % please change depending on the number of cores available
RUN(128,384,NUMBER_OF_CORES)

Default population of 128 individuals; 384 populations will be generated by default. We suggest keeping this 1:3 proportion. This will run the optimization with biomass as first objective and PHGDH as second objective. To change the objective reactions modify the following variables:

a) fbarecon.f selects the first objective (default: biomass) b) fbarecon.g selects the second objective (default: phosphoglycerate dehydrogenase)

After the optimization, append_and_plot_solutions.m computes the Pareto front. The file non_dominated.mat contains all the Pareto optimal points, while others.mat are all the other points. In the first two columns there are the two objective functions, while the 4th column is the number of the generation (that is, the file solutionX) in which that solution has been found, and the 5th column is the position of that solution in that generation.

Finally, plot_and_export_color.m plots the final version of the Pareto front, and statistics_on_genes.m generates the clustering and multidimensional scaling plots.

To find the indices of the reactions, and then change lower and upper bounds (fbamodel.lb and fbamodel.ub), please type

n = find(ismember(fbarecon.rxns, 'EX_succ(e)')==1) (e.g. for succinate)
fbarecon.lb(n) = NEW_LOWER_BOUND
fbarecon.ub(n) = NEW_UPPER_BOUND

C-EDGE algorithm

To run C-EDGE_1 and C-EDGE_2, run the compute_EDGE.m Matlab script. This will load the recon2_merged_bio_PHGDH.mat metabolic model and run the C-EDGE algorithm. The script will save the resulting vector as "c-edge_scores.mat" reporting the C-EDGE scores for each gene in the model.

To run C-EDGE_k in general, run the RUN_CEDGEk.m, matlab script in the subfolder C-EDGE_k

RUN_CEDGEk(4) % please change depending on the number of cores available

In this case, the single objctive PGA (soPGA) will run. We are interested in finding the largest subsets possible of k genes such that if we compute the edge for those subsets we get interesting results, i.e. the EDGE(k) is different from all the EDGE(k-1) of the subsets of the k genes with k-1 elements. That is why, in genetic_operator.m, we maximise the objective EDGE_diff = abs(EDGE_k - EDGE_kminus1);

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
C-EDGE_k		C-EDGE_k
EDGE_results.mat		EDGE_results.mat
Figure2b_results.mat		Figure2b_results.mat
README.md		README.md
README.txt		README.txt
RUN.m		RUN.m
RUN_parallel_script.m		RUN_parallel_script.m
Recon2.v04.mat		Recon2.v04.mat
Recon2_Quek et al..mat		Recon2_Quek et al..mat
add_synthetic_3rd_obj_to_COBRA_model.m		add_synthetic_3rd_obj_to_COBRA_model.m
add_synthetic_obj_to_COBRA_model.m		add_synthetic_obj_to_COBRA_model.m
append_and_plot_solutions.m		append_and_plot_solutions.m
average_genes_high_biomass.mat		average_genes_high_biomass.mat
bplot.m		bplot.m
breast_cancer.mat		breast_cancer.mat
cancer_genes_Palsson.mat		cancer_genes_Palsson.mat
cancer_genes_Syed.mat		cancer_genes_Syed.mat
change_obj.m		change_obj.m
check_position_list_Syed.m		check_position_list_Syed.m
compute_EDGE.m		compute_EDGE.m
controllability_analysis.m		controllability_analysis.m
evaluate_objective.m		evaluate_objective.m
evaluate_objective_EDGE.m		evaluate_objective_EDGE.m
evaluate_objective_simple_EDGE.m		evaluate_objective_simple_EDGE.m
expFBA.m		expFBA.m
extract_fluxes.m		extract_fluxes.m
flux_balance.m		flux_balance.m
flux_balance_trilevel.m		flux_balance_trilevel.m
genetic_operator.m		genetic_operator.m
geni.mat		geni.mat
geni_edge_results.mat		geni_edge_results.mat
geni_full_recon.mat		geni_full_recon.mat
geni_names.mat		geni_names.mat
geni_names_full_recon.mat		geni_names_full_recon.mat
inset.m		inset.m
ixs_geni_sorted_by_length.mat		ixs_geni_sorted_by_length.mat
kmeans_corr_1.m		kmeans_corr_1.m
nhist.m		nhist.m
nhist_original.m		nhist_original.m
non_dominated.mat		non_dominated.mat
non_dominated_chromosomes.mat		non_dominated_chromosomes.mat
non_domination.m		non_domination.m
non_domination_sort_mod.m		non_domination_sort_mod.m
others.mat		others.mat
paGDMO.m		paGDMO.m
parfor_progress.m		parfor_progress.m
pdist_corr_1.m		pdist_corr_1.m
pdistmex.mexw64		pdistmex.mexw64
plot_and_export_color.m		plot_and_export_color.m
plot_histcounts_nondominated.m		plot_histcounts_nondominated.m
pos_genes_in_react_expr.mat		pos_genes_in_react_expr.mat
reaction_expression.mat		reaction_expression.mat
recon2_merged_bio_PHGDH.mat		recon2_merged_bio_PHGDH.mat
replace_chromosome.m		replace_chromosome.m
results_controllability.mat		results_controllability.mat
silhouette_corr_1.m		silhouette_corr_1.m
statinsertnan.m		statinsertnan.m
statistics_on_genes.m		statistics_on_genes.m
statremovenan.m		statremovenan.m
subtightplot.m		subtightplot.m
suptitle.m		suptitle.m
test_if_Pareto_points_are_nondominated.m		test_if_Pareto_points_are_nondominated.m
tournament_selection.m		tournament_selection.m
user_string.m		user_string.m

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

C-EDGE and PGA

Initial settings

Multi-objective optimization of gene expression

C-EDGE algorithm

About

Releases

Packages

Languages

claudioangione/PGA_and_C-EDGE

Folders and files

Latest commit

History

Repository files navigation

C-EDGE and PGA

Initial settings

Multi-objective optimization of gene expression

C-EDGE algorithm

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages