Characterization of Core Components
This notebook demonstrates the tutorial workflow for MultiModulon analysis.
[1]:
# Import required libraries
from multimodulon import MultiModulon
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
# Set display options
pd.set_option('display.max_columns', None)
pd.set_option('display.max_rows', 50)
Step 1: Load MultiModulon object
Load the saved MultiModulon object from Tutorial_1_Create_multiModulon_object.ipynb
[2]:
multiModulon = MultiModulon.load_json_multimodulon("./multiModulon_E_coli_comparison_demo.json.gz")
[3]:
multiModulon.species_palette
[3]:
{'BL21': '#B24C4C', 'MG1655': '#4C6AB2', 'W3110': '#88B24C'}
Step 2: Visualize the core components (Available Functions)
Visualize the core and unique components by view_iModulon_weights and view_core_iModulon_weights
[4]:
multiModulon.view_iModulon_weights(
species="MG1655",
component="Core_1",
fig_size=(8, 5),
font_path = "/usr/share/fonts/truetype/msttcorefonts/Arial.ttf",
show_COG=True,
show_gene_names=True
)
[5]:
multiModulon.view_iModulon_genes(
species="MG1655",
component="Core_1"
)
[5]:
| M_weight | Preferred_name | gene_name | accession | start | end | strand | old_locus_tag | gene_product | ncbi_protein | seed_ortholog | evalue | score | eggNOG_OGs | max_annot_lvl | COG_category | Description | GOs | EC | KEGG_ko | KEGG_Pathway | KEGG_Module | KEGG_Reaction | KEGG_rclass | BRITE | KEGG_TC | CAZy | BiGG_Reaction | PFAMs | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| b0583 | 0.128547 | entD | entD | U00096.3 | 609459 | 610079 | - | NaN | phosphopantetheinyl transferase EntD | AAC73684.2 | 316407.85674704 | 1.240000e-150 | 423.0 | COG2977@1|root,COG2977@2|Bacteria,1MZK2@1224|P... | 2|Bacteria | H | Involved in the biosynthesis of the siderophor... | GO:0003674,GO:0003824,GO:0005575,GO:0005622,GO... | 6.3.2.14 | ko:K02362 | ko01053,ko01110,ko01130,map01053,map01110,map0... | - | R07644 | RC00162,RC03046 | ko00000,ko00001,ko01000 | - | - | iAPECO1_1312.APECO1_1466,iECO111_1330.ECO111_0... | ACPS |
| b0584 | 0.170337 | fepA | fepA | U00096.3 | 610254 | 612494 | - | NaN | ferric enterobactin outer membrane transporter | AAC73685.1 | 316407.4062212 | 0.000000e+00 | 1519.0 | COG1629@1|root,COG4771@2|Bacteria,1MUC1@1224|P... | 2|Bacteria | P | This protein is involved in the initial step o... | GO:0000041,GO:0003674,GO:0005215,GO:0005488,GO... | - | ko:K16089,ko:K19611 | ko02020,map02020 | - | - | - | ko00000,ko00001,ko02000 | 1.B.14.1,1.B.14.1.22,1.B.14.1.3,1.B.14.1.5,1.B... | - | iECABU_c1320.ECABU_c06330,iECO103_1326.ECO103_... | Plug,TonB_dep_Rec |
| b0585 | 0.163971 | fes | fes | U00096.3 | 612737 | 613939 | + | NaN | ferric enterobactin esterase | AAC73686.2 | 511145.b0585 | 1.160000e-313 | 851.0 | COG2382@1|root,COG2382@2|Bacteria,1MXJN@1224|P... | 2|Bacteria | P | esterase | GO:0003674,GO:0003824,GO:0005575,GO:0005622,GO... | - | ko:K07214 | - | - | - | - | ko00000 | - | - | iECNA114_1301.ECNA114_0527,iSbBS512_1146.SbBS5... | DUF3327,Esterase |
| b4511 | 0.163414 | ybdZ | ybdZ | U00096.3 | 613942 | 614160 | + | NaN | enterobactin biosynthesis protein YbdZ | ABD18641.1 | 316407.85674706 | 5.410000e-51 | 160.0 | COG3251@1|root,COG3251@2|Bacteria,1N87J@1224|P... | 2|Bacteria | S | Involved in the biosynthesis of the siderophor... | GO:0005575,GO:0005622,GO:0005623,GO:0005737,GO... | - | ko:K05375,ko:K07214 | ko00261,ko01130,map00261,map01130 | M00736 | R10880 | RC00064,RC00141,RC03296,RC03297,RC03298 | ko00000,ko00001,ko00002 | - | - | - | MbtH |
| b0586 | 0.151540 | entF | entF | U00096.3 | 614157 | 618038 | + | NaN | apo-serine activating enzyme | AAC73687.1 | 316407.85674707 | 0.000000e+00 | 2562.0 | COG1020@1|root,COG3319@1|root,COG1020@2|Bacter... | 2|Bacteria | Q | Activates the carboxylate group of L-serine vi... | GO:0003674,GO:0003824,GO:0005488,GO:0005575,GO... | 6.3.2.14 | ko:K02364 | ko01053,ko01110,ko01130,map01053,map01110,map0... | - | R07644 | RC00162,RC03046 | ko00000,ko00001,ko01000,ko01008 | - | - | iECO111_1330.ECO111_0616,iECO26_1355.ECO26_066... | AMP-binding,AMP-binding_C,Condensation,PP-bind... |
| b0588 | 0.094940 | fepC | fepC | U00096.3 | 619384 | 620199 | - | NaN | ferric enterobactin ABC transporter ATP bindin... | AAC73689.1 | 316407.85674709 | 7.490000e-195 | 540.0 | COG1120@1|root,COG1120@2|Bacteria,1MUNG@1224|P... | 2|Bacteria | P | Probably responsible for energy coupling to th... | GO:0000041,GO:0000166,GO:0003674,GO:0005215,GO... | 3.6.3.34 | ko:K02013 | ko02010,map02010 | M00240 | - | - | ko00000,ko00001,ko00002,ko01000,ko02000 | 3.A.1.14 | - | iAF1260.b0588,iB21_1397.B21_00544,iBWG_1329.BW... | ABC_tran |
| b0589 | 0.081723 | fepG | fepG | U00096.3 | 620196 | 621188 | - | NaN | ferric enterobactin ABC transporter membrane s... | AAC73690.1 | 316407.85674710 | 3.590000e-218 | 604.0 | COG4779@1|root,COG4779@2|Bacteria,1NXD0@1224|P... | 2|Bacteria | P | Belongs to the binding-protein-dependent trans... | GO:0000041,GO:0003674,GO:0005215,GO:0005575,GO... | - | ko:K02015 | ko02010,map02010 | M00240 | - | - | ko00000,ko00001,ko00002,ko02000 | 3.A.1.14 | - | iAPECO1_1312.APECO1_1460,iEC55989_1330.EC55989... | FecCD |
| b0590 | 0.093200 | fepD | fepD | U00096.3 | 621185 | 622189 | - | NaN | ferric enterobactin ABC transporter membrane s... | AAC73691.1 | 511145.b0590 | 5.450000e-215 | 596.0 | COG0609@1|root,COG0609@2|Bacteria,1MYYG@1224|P... | 2|Bacteria | P | Belongs to the binding-protein-dependent trans... | GO:0000041,GO:0003674,GO:0005215,GO:0005575,GO... | - | ko:K02015 | ko02010,map02010 | M00240 | - | - | ko00000,ko00001,ko00002,ko02000 | 3.A.1.14 | - | iB21_1397.B21_04117,iEC042_1314.EC042_4777,iEC... | FecCD |
| b0591 | 0.096287 | entS | entS | U00096.3 | 622300 | 623550 | + | NaN | enterobactin exporter EntS | AAC73692.1 | 316407.85674712 | 9.810000e-280 | 767.0 | COG0477@1|root,COG0477@2|Bacteria,1MXZ3@1224|P... | 2|Bacteria | EGP | Exports the siderophore enterobactin out of th... | GO:0005575,GO:0005623,GO:0005886,GO:0006810,GO... | - | ko:K08225 | - | - | - | - | ko00000,ko02000 | 2.A.1.38 | - | iECABU_c1320.ECABU_c06410,iSBO_1134.SBO_0452 | MFS_1,MFS_3 |
| b0592 | 0.086034 | fepB | fepB | U00096.3 | 623554 | 624510 | - | NaN | ferric enterobactin ABC transporter periplasmi... | AAC73693.1 | 316407.85674713 | 2.450000e-219 | 606.0 | COG4592@1|root,COG4592@2|Bacteria,1NPCN@1224|P... | 2|Bacteria | P | part of the binding-protein- dependent transpo... | GO:0000041,GO:0005575,GO:0005623,GO:0006810,GO... | - | ko:K02016 | ko02010,map02010 | M00240 | - | - | ko00000,ko00001,ko00002,ko02000 | 3.A.1.14 | - | iECNA114_1301.ECNA114_0535 | Peripla_BP_2 |
| b0593 | 0.179651 | entC | entC | U00096.3 | 624885 | 626060 | + | NaN | isochorismate synthase EntC | AAC73694.1 | 316407.85674714 | 6.640000e-279 | 763.0 | COG1169@1|root,COG1169@2|Bacteria,1MVB7@1224|P... | 2|Bacteria | HQ | Involved in the biosynthesis of the siderophor... | GO:0000162,GO:0000287,GO:0003674,GO:0003824,GO... | 5.4.4.2 | ko:K01851,ko:K02361,ko:K02552 | ko00130,ko01053,ko01100,ko01110,ko01130,map001... | M00116 | R01717 | RC00588 | ko00000,ko00001,ko00002,ko01000 | - | - | iEC55989_1330.EC55989_0585,iECABU_c1320.ECABU_... | Chorismate_bind |
| b0594 | 0.173730 | entE | entE | U00096.3 | 626070 | 627680 | + | NaN | 2%2C3-dihydroxybenzoate-AMP ligase | AAC73695.1 | 316407.85674715 | 0.000000e+00 | 1068.0 | COG1021@1|root,COG1021@2|Bacteria,1NSN8@1224|P... | 2|Bacteria | Q | Involved in the biosynthesis of the siderophor... | GO:0003674,GO:0003824,GO:0005575,GO:0005622,GO... | 2.7.7.58,6.3.2.14 | ko:K02363,ko:K12238 | ko01053,ko01110,ko01130,map01053,map01110,map0... | - | R07644 | RC00162,RC03046 | ko00000,ko00001,ko01000,ko01008 | - | - | iSF_1195.SF0508,iSSON_1240.SSON_0545,iS_1188.S... | AMP-binding,AMP-binding_C |
| b0595 | 0.157571 | entB | entB | U00096.3 | 627694 | 628551 | + | NaN | enterobactin synthase component B | AAC73696.1 | 316407.85674716 | 3.090000e-210 | 580.0 | COG1535@1|root,COG3433@1|root,COG1535@2|Bacter... | 2|Bacteria | Q | Involved in the biosynthesis of the siderophor... | GO:0000287,GO:0003674,GO:0003824,GO:0004463,GO... | 3.3.2.1,6.3.2.14 | ko:K01252 | ko01053,ko01110,ko01130,map01053,map01110,map0... | - | R03037,R07644 | RC00162,RC00350,RC02148,RC03046 | ko00000,ko00001,ko01000,ko01008 | - | - | iSFV_1184.SFV_0543 | Isochorismatase,PP-binding |
| b0596 | 0.146970 | entA | entA | U00096.3 | 628551 | 629297 | + | NaN | 2%2C3-dihydro-2%2C3-dihydroxybenzoate dehydrog... | AAC73697.1 | 316407.85674717 | 1.270000e-170 | 477.0 | COG1028@1|root,COG1028@2|Bacteria,1MUD2@1224|P... | 2|Bacteria | IQ | Involved in the biosynthesis of the siderophor... | GO:0003674,GO:0003824,GO:0005488,GO:0005515,GO... | 1.3.1.28 | ko:K00216 | ko01053,ko01110,ko01130,map01053,map01110,map0... | - | R01505 | RC00534 | ko00000,ko00001,ko01000 | - | - | iECNA114_1301.ECNA114_0539,iECSF_1327.ECSF_053... | adh_short_C2 |
| b0597 | 0.146853 | entH | entH | U00096.3 | 629300 | 629713 | + | NaN | proofreading thioesterase in enterobactin bios... | AAC73698.1 | 316407.85674718 | 5.720000e-99 | 286.0 | COG2050@1|root,COG2050@2|Bacteria,1MYE0@1224|P... | 2|Bacteria | Q | Required for optimal enterobactin synthesis. A... | GO:0003674,GO:0003824,GO:0005488,GO:0005515,GO... | 3.1.2.28 | ko:K02614,ko:K19222 | ko00130,ko00360,ko01100,ko01110,map00130,map00... | M00116 | R07262,R09840 | RC00004,RC00014,RC00174 | ko00000,ko00001,ko00002,ko01000 | - | - | - | 4HBT,Hydrolase_3 |
| b0683 | -0.106083 | fur | fur | U00096.3 | 710200 | 710646 | - | NaN | DNA-binding transcriptional dual regulator Fur | AAC73777.1 | 316407.1651290 | 2.650000e-107 | 308.0 | COG0735@1|root,COG0735@2|Bacteria,1RDWJ@1224|P... | 2|Bacteria | K | Acts as a global negative controlling element,... | GO:0000976,GO:0001067,GO:0001130,GO:0001216,GO... | - | ko:K03711,ko:K09823 | ko02024,map02024 | - | - | - | ko00000,ko00001,ko03000 | - | - | - | FUR |
| b0803 | 0.076955 | ybiI | ybiI | U00096.3 | 838190 | 838456 | - | NaN | zinc finger domain-containing protein YbiI | AAC73890.1 | 316407.4062364 | 4.120000e-60 | 184.0 | COG1734@1|root,COG1734@2|Bacteria,1MZIB@1224|P... | 2|Bacteria | T | zinc ion binding | - | - | ko:K06204 | ko02026,map02026 | - | - | - | ko00000,ko00001,ko03000,ko03009,ko03021 | - | - | - | zf-dskA_traR |
| b0804 | 0.135791 | ybiX | ybiX | U00096.3 | 838530 | 839207 | - | NaN | PKHD-type hydroxylase YbiX | AAC73891.2 | 316407.85674765 | 1.830000e-163 | 457.0 | COG3128@1|root,COG3128@2|Bacteria,1MUI7@1224|P... | 2|Bacteria | S | PKHD-type hydroxylase YbiX | GO:0006873,GO:0006875,GO:0006879,GO:0006950,GO... | - | ko:K07336 | - | - | - | - | ko00000,ko01000 | - | - | - | 2OG-FeII_Oxy_3 |
| b0805 | 0.169354 | fiu | fiu | U00096.3 | 839249 | 841531 | - | NaN | Fe catecholate outer membrane transporter Fiu | AAC73892.1 | 316407.4062366 | 0.000000e+00 | 1514.0 | COG4774@1|root,COG4774@2|Bacteria,1MV0X@1224|P... | 2|Bacteria | P | Involved in the active transport across the ou... | GO:0005575,GO:0005623,GO:0006810,GO:0008150,GO... | - | ko:K16090 | - | - | - | - | ko00000,ko02000 | 1.B.14.1.11 | - | iLF82_1304.LF82_0668,iNRG857_1313.NRG857_03600... | Plug,TonB_dep_Rec |
| b1102 | 0.130277 | fhuE | fhuE | U00096.3 | 1159362 | 1161551 | - | NaN | ferric coprogen/ferric rhodotorulic acid outer... | AAC74186.1 | 316407.1651542 | 0.000000e+00 | 1473.0 | COG4773@1|root,COG4773@2|Bacteria,1MW5E@1224|P... | 2|Bacteria | P | Outer-membrane receptor for Fe(III)-coprogen, ... | GO:0005575,GO:0005623,GO:0009279,GO:0016020,GO... | - | ko:K16088 | - | - | - | - | ko00000,ko02000 | 1.B.14.1.10,1.B.14.1.3,1.B.14.1.8 | - | iSDY_1059.SDY_2048 | Plug,STN,TonB_dep_Rec |
| b1452 | 0.101367 | yncE | yncE | U00096.3 | 1523307 | 1524368 | + | NaN | PQQ-like domain-containing protein YncE | AAC74534.1 | 316407.85674972 | 2.170000e-226 | 627.0 | COG3391@1|root,COG3391@2|Bacteria,1PDS2@1224|P... | 2|Bacteria | S | DNA binding | GO:0003674,GO:0003676,GO:0003677,GO:0005488,GO... | - | - | - | - | - | - | - | - | - | - | PQQ_2 |
| b1656 | -0.107184 | sodB | sodB | U00096.3 | 1735378 | 1735959 | + | NaN | superoxide dismutase (Fe) | AAC74728.1 | 316407.1742729 | 8.250000e-142 | 399.0 | COG0605@1|root,COG0605@2|Bacteria,1MVW2@1224|P... | 2|Bacteria | C | radicals which are normally produced within th... | GO:0000302,GO:0000303,GO:0000305,GO:0003674,GO... | 1.15.1.1,3.1.11.6 | ko:K03601,ko:K04564 | ko03430,ko04013,ko04068,ko04146,ko04211,ko0421... | - | - | - | ko00000,ko00001,ko01000,ko03400 | - | - | iAF1260.b1656,iAPECO1_1312.APECO1_736,iB21_139... | Sod_Fe_C,Sod_Fe_N |
| b1705 | 0.102622 | NaN | ydiE | U00096.3 | 1789613 | 1789804 | + | NaN | PF10636 family protein YdiE | AAC74775.1 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| b2155 | 0.168631 | cirA | cirA | U00096.3 | 2244778 | 2246769 | - | NaN | Fe-catecholate outer membrane transporter CirA | AAC75216.1 | 511145.b2155 | 0.000000e+00 | 1360.0 | COG1629@1|root,COG4771@2|Bacteria,1MUC1@1224|P... | 2|Bacteria | P | Colicin I receptor | GO:0000041,GO:0003674,GO:0005215,GO:0005488,GO... | - | ko:K16089,ko:K19611 | ko02020,map02020 | - | - | - | ko00000,ko00001,ko02000 | 1.B.14.1,1.B.14.1.22,1.B.14.1.3,1.B.14.1.5,1.B... | - | iECO103_1326.ECO103_2630,iSDY_1059.SDY_0512 | Plug,TonB_dep_Rec |
| b2673 | 0.116194 | nrdH | nrdH | U00096.3 | 2800723 | 2800968 | + | NaN | glutaredoxin-like protein NrdH | AAC75720.1 | 316407.85675518 | 1.570000e-54 | 170.0 | COG0695@1|root,COG0695@2|Bacteria,1N7YD@1224|P... | 2|Bacteria | O | Electron transport system for the ribonucleoti... | GO:0003674,GO:0003824,GO:0006091,GO:0008150,GO... | - | ko:K06191 | - | - | - | - | ko00000 | - | - | - | Glutaredoxin |
| b2674 | 0.110272 | nrdI | nrdI | U00096.3 | 2800965 | 2801375 | + | NaN | dimanganese-tyrosyl radical cofactor maintenan... | AAC75721.1 | 316407.85675519 | 5.970000e-96 | 279.0 | COG1780@1|root,COG1780@2|Bacteria,1RGYF@1224|P... | 2|Bacteria | F | Probably involved in ribonucleotide reductase ... | GO:0000166,GO:0003674,GO:0005488,GO:0006464,GO... | - | ko:K03647 | - | - | - | - | ko00000 | - | - | - | Flavodoxin_NdrI |
| b2675 | 0.108159 | nrdE | nrdE | U00096.3 | 2801348 | 2803492 | + | NaN | ribonucleoside-diphosphate reductase 2 subunit... | AAC75722.1 | 316407.85675520 | 0.000000e+00 | 1424.0 | COG0209@1|root,COG0209@2|Bacteria,1MUJ8@1224|P... | 2|Bacteria | F | Provides the precursors necessary for DNA synt... | GO:0000166,GO:0003674,GO:0003824,GO:0004748,GO... | 1.17.4.1 | ko:K00525 | ko00230,ko00240,ko01100,map00230,map00240,map0... | M00053 | R02017,R02018,R02019,R02024 | RC00613 | ko00000,ko00001,ko00002,ko01000,ko03400 | - | - | iAPECO1_1312.APECO1_3846,iSDY_1059.SDY_2428,iY... | RNR_N,Ribonuc_red_lgC,Ribonuc_red_lgN |
| b2676 | 0.100969 | nrdF | nrdF | U00096.3 | 2803502 | 2804461 | + | NaN | ribonucleoside-diphosphate reductase 2 subunit... | AAC75723.1 | 316407.1800064 | 3.560000e-233 | 641.0 | COG0208@1|root,COG0208@2|Bacteria,1MWUS@1224|P... | 2|Bacteria | F | Provides the precursors necessary for DNA synt... | GO:0003674,GO:0005488,GO:0005506,GO:0005515,GO... | 1.17.4.1 | ko:K00526 | ko00230,ko00240,ko01100,map00230,map00240,map0... | M00053 | R02017,R02018,R02019,R02024 | RC00613 | ko00000,ko00001,ko00002,ko01000,ko03400 | - | - | iAPECO1_1312.APECO1_4325,iECNA114_1301.ECNA114... | Ribonuc_red_sm |
| b4291 | 0.081580 | fecA | fecA | U00096.3 | 4514353 | 4516677 | - | NaN | ferric citrate outer membrane transporter | AAC77247.1 | 316407.85677032 | 0.000000e+00 | 1573.0 | COG4772@1|root,COG4772@2|Bacteria,1MWDG@1224|P... | 2|Bacteria | P | receptor | GO:0000041,GO:0003674,GO:0005215,GO:0005381,GO... | - | ko:K16091 | - | - | - | - | ko00000,ko02000 | 1.B.14.1.14 | - | iEC042_1314.EC042_4780 | Plug,STN,TonB_dep_Rec |
| b4292 | 0.098783 | fecR | fecR | U00096.3 | 4516764 | 4517717 | - | NaN | ferric citrate regulator FecR | AAC77248.1 | 316407.85677033 | 2.210000e-226 | 624.0 | COG3712@1|root,COG3712@2|Bacteria,1N28G@1224|P... | 2|Bacteria | PT | Fe2 -dicitrate sensor, membrane component | GO:0005575,GO:0005623,GO:0042597,GO:0044464 | - | ko:K07165 | - | - | - | - | ko00000 | - | - | - | DUF4880,DUF4974,FecR |
| b4293 | 0.096825 | fecI | fecI | U00096.3 | 4517714 | 4518235 | - | NaN | RNA polymerase sigma factor FecI | AAC77249.1 | 316407.85677034 | 4.510000e-118 | 338.0 | COG1595@1|root,COG1595@2|Bacteria,1RHRW@1224|P... | 2|Bacteria | K | Belongs to the sigma-70 factor family. ECF sub... | - | - | ko:K03088 | - | - | - | - | ko00000,ko03021 | - | - | - | Sigma70_r2,Sigma70_r4_2 |
| b4367 | 0.098091 | fhuF | fhuF | U00096.3 | 4604875 | 4605663 | - | NaN | ferric-siderophore reductase FhuF | AAC77323.1 | 316407.85677107 | 5.510000e-197 | 545.0 | COG4114@1|root,COG4114@2|Bacteria,1MWSR@1224|P... | 2|Bacteria | S | ferric iron reductase | GO:0003674,GO:0005488,GO:0005575,GO:0005622,GO... | - | ko:K13255 | - | - | - | - | ko00000 | - | - | iE2348C_1286.E2348C_4666,iECH74115_1262.ECH741... | FhuF,FhuF_C |
| b4567 | 0.192312 | yjjZ | yjjZ | U00096.3 | 4605804 | 4606040 | + | NaN | DUF1435 domain-containing protein YjjZ | ABD18719.1 | 511145.b4567 | 6.590000e-48 | 153.0 | 2CF73@1|root,32RSY@2|Bacteria,1N033@1224|Prote... | 2|Bacteria | S | Protein of unknown function (DUF1435) | - | - | - | - | - | - | - | - | - | - | - | DUF1435 |
[6]:
multiModulon['MG1655'].presence_matrix
[6]:
| Core_1 | Core_2 | Core_3 | Core_4 | Core_5 | Core_6 | Core_7 | Core_8 | Core_9 | Core_10 | Core_11 | Core_12 | Core_13 | Core_14 | Core_15 | Core_16 | Core_17 | Core_18 | Core_19 | Core_20 | Core_21 | Core_22 | Core_23 | Unique_1 | Unique_2 | Unique_3 | Unique_4 | Unique_5 | Unique_6 | Unique_7 | Unique_8 | Unique_9 | Unique_10 | Unique_11 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| b0001 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| b0002 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| b0003 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| b0004 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| b0005 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... |
| b4400 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| b4401 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| b4402 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| b4824 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| b4403 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
4305 rows × 34 columns
[7]:
multiModulon['MG1655'].sample_sheet
[7]:
| Run | ReleaseDate | LoadDate | spots | bases | spots_with_mates | avgLength | size_MB | AssemblyName | download_path | LibraryName | LibraryStrategy | LibrarySelection | LibrarySource | LibraryLayout | InsertSize | InsertDev | Platform | Model | SRAStudy | BioProject | Study_Pubmed_id | ProjectID | Sample | BioSample | SampleType | TaxID | ScientificName | SampleName | g1k_pop_code | source | g1k_analysis_group | Subject_ID | Sex | Disease | Tumor | Affection_Status | Analyte_Type | Histological_Type | Body_Site | CenterName | Submission | dbgap_study_accession | Consent | RunHash | ReadHash | R1 | R2 | Strain | biosample_genotype | reference_sequence | project | condition | reference_condition | strain_description | base_media | carbon_source | nitrogen_source | aerobicity | supplement | temperature | pH | growth_phase | OD | additional_notes | knockout | biological_replicate | technical replicates | sampleID | culture_type | Swift | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| SRX8684872 | SRR12170059 | 2022-09-25 23:05:26 | 2022-09-25 23:01:55 | 5516734.0 | 1.641596e+09 | 5516734.0 | 297.0 | 407.0 | NC_000913 | https://sra-downloadb.be-md.ncbi.nlm.nih.gov/s... | MG1655_F133_MID_1 | RNA-Seq | cDNA | TRANSCRIPTOMIC | PAIRED | 0.0 | 0.0 | ILLUMINA | Illumina HiSeq 4000 | SRP270849 | PRJNA644668 | NaN | 644668.0 | SRS6963815 | SAMN15470650 | simple | 511145.0 | Escherichia coli str. K-12 substr. MG1655 | MG1655_F133_MID | NaN | NaN | NaN | NaN | NaN | NaN | no | NaN | NaN | NaN | NaN | UNIVERSITY OF CALIFORNIA- SAN DIEGO | SRA1096109 | NaN | public | A2811478FA75C0725C9C8ABAC24F6CE5 | 2D157C081C3A732986637377ACA14205 | s3://modulome/multi_ecoli/MG1655/raw_rna/SRX86... | s3://modulome/multi_ecoli/MG1655/raw_rna/SRX86... | MG1655 | F133_MID | U00096.3 | multi_ALE | F133_MID | WT | F133_MID | M9 | glucose | NH4Cl | Aerobic | NaN | 37 | NaN | NaN | NaN | NaN | WT | 1 | 1 | NaN | Batch | 0 |
| SRX8684873 | SRR12170058 | 2022-09-25 23:05:26 | 2022-09-25 23:02:21 | 6904986.0 | 2.054524e+09 | 6904986.0 | 297.0 | 510.0 | NC_000913 | https://sra-downloadb.be-md.ncbi.nlm.nih.gov/s... | MG1655_F133_MID_2 | RNA-Seq | cDNA | TRANSCRIPTOMIC | PAIRED | 0.0 | 0.0 | ILLUMINA | Illumina HiSeq 4000 | SRP270849 | PRJNA644668 | NaN | 644668.0 | SRS6963815 | SAMN15470650 | simple | 511145.0 | Escherichia coli str. K-12 substr. MG1655 | MG1655_F133_MID | NaN | NaN | NaN | NaN | NaN | NaN | no | NaN | NaN | NaN | NaN | UNIVERSITY OF CALIFORNIA- SAN DIEGO | SRA1096109 | NaN | public | 6F1624C35020E2950400EAA843FBE4A1 | B224CA3555520A964DE7D3934CAD3D73 | s3://modulome/multi_ecoli/MG1655/raw_rna/SRX86... | s3://modulome/multi_ecoli/MG1655/raw_rna/SRX86... | MG1655 | F133_MID | U00096.3 | multi_ALE | F133_MID | WT | F133_MID | M9 | glucose | NH4Cl | Aerobic | NaN | 37 | NaN | NaN | NaN | NaN | WT | 2 | 1 | NaN | Batch | 0 |
| SRX8684874 | SRR12170057 | 2022-09-25 23:07:04 | 2022-09-25 23:04:27 | 16293212.0 | 2.205682e+09 | 14341258.0 | 135.0 | 426.0 | GCA_000005845.2 | https://sra-downloadb.be-md.ncbi.nlm.nih.gov/s... | MG1655_F001_MID_1 | RNA-Seq | cDNA | TRANSCRIPTOMIC | PAIRED | 0.0 | 0.0 | ILLUMINA | Illumina HiSeq 4000 | SRP270849 | PRJNA644668 | NaN | 644668.0 | SRS6963816 | SAMN15470648 | simple | 511145.0 | Escherichia coli str. K-12 substr. MG1655 | MG1655_F001_MID | NaN | NaN | NaN | NaN | NaN | NaN | no | NaN | NaN | NaN | NaN | UNIVERSITY OF CALIFORNIA- SAN DIEGO | SRA1096109 | NaN | public | CBDA566CCB1AC327D08A7D29E5D0E5B9 | EBA2468F6BD5786912663646206B0606 | s3://modulome/multi_ecoli/MG1655/raw_rna/SRX86... | s3://modulome/multi_ecoli/MG1655/raw_rna/SRX86... | MG1655 | F001_MID | U00096.3 | multi_ALE | F001_MID | WT | F001_MID | M9 | glucose | NH4Cl | Aerobic | NaN | 37 | NaN | NaN | NaN | NaN | WT | 1 | 1 | NaN | Batch | 0 |
| SRX8684875 | SRR12170056 | 2022-09-25 23:05:26 | 2022-09-25 23:01:48 | 7318390.0 | 9.946296e+08 | 6495910.0 | 135.0 | 185.0 | GCA_000005845.2 | https://sra-downloadb.be-md.ncbi.nlm.nih.gov/s... | MG1655_F001_MID_2 | RNA-Seq | cDNA | TRANSCRIPTOMIC | PAIRED | 0.0 | 0.0 | ILLUMINA | Illumina HiSeq 4000 | SRP270849 | PRJNA644668 | NaN | 644668.0 | SRS6963816 | SAMN15470648 | simple | 511145.0 | Escherichia coli str. K-12 substr. MG1655 | MG1655_F001_MID | NaN | NaN | NaN | NaN | NaN | NaN | no | NaN | NaN | NaN | NaN | UNIVERSITY OF CALIFORNIA- SAN DIEGO | SRA1096109 | NaN | public | 643DCE4D30004666B1915811EC00AB96 | B9AEE80BABB7F175FDE99637F60849F9 | s3://modulome/multi_ecoli/MG1655/raw_rna/SRX86... | s3://modulome/multi_ecoli/MG1655/raw_rna/SRX86... | MG1655 | F001_MID | U00096.3 | multi_ALE | F001_MID | WT | F001_MID | M9 | glucose | NH4Cl | Aerobic | NaN | 37 | NaN | NaN | NaN | NaN | WT | 2 | 1 | NaN | Batch | 0 |
| SRX8684876 | SRR12170055 | 2020-07-08 01:02:36 | 2020-07-08 01:00:21 | 4835538.0 | 1.450661e+09 | 4835538.0 | 300.0 | 392.0 | NC_000913 | https://sra-downloadb.be-md.ncbi.nlm.nih.gov/s... | MG1655_F057_MID_1 | RNA-Seq | cDNA | TRANSCRIPTOMIC | PAIRED | 0.0 | 0.0 | ILLUMINA | Illumina HiSeq 4000 | SRP270849 | PRJNA644668 | NaN | 644668.0 | SRS6963817 | SAMN15470649 | simple | 511145.0 | Escherichia coli str. K-12 substr. MG1655 | MG1655_F057_MID | NaN | NaN | NaN | NaN | NaN | NaN | no | NaN | NaN | NaN | NaN | UNIVERSITY OF CALIFORNIA- SAN DIEGO | SRA1096109 | NaN | public | 38EF20DF6206A513E0C21E0370DCFCA0 | BD4EAEB626E9C690E0051523F7DFCC1D | s3://modulome/multi_ecoli/MG1655/raw_rna/SRX86... | s3://modulome/multi_ecoli/MG1655/raw_rna/SRX86... | MG1655 | F057_MID | U00096.3 | multi_ALE | F057_MID | WT | F057_MID | M9 | glucose | NH4Cl | Aerobic | NaN | 37 | NaN | NaN | NaN | NaN | WT | 1 | 1 | NaN | Batch | 0 |
| ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... |
| p1k_00506 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | PAIRED | NaN | NaN | ILLUMINA | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | MG1655 | WT | U00096.3 | P1K | m9p | M9_glucose | WT | M9-P | glucose | NH4Cl | Aerobic | NaN | 37 | 7.0 | NaN | NaN | NaN | WT | 2 | 1 | NaN | Batch | True |
| p1k_00507 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | PAIRED | NaN | NaN | ILLUMINA | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | MG1655 | WT | U00096.3 | P1K | 01xLB_pyruvate | M9_glucose | WT | 01xLB | NaN | NaN | Aerobic | 100uM Pyruvate | 37 | 7.0 | NaN | NaN | NaN | WT | 1 | 1 | NaN | Batch | True |
| p1k_00508 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | PAIRED | NaN | NaN | ILLUMINA | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | MG1655 | WT | U00096.3 | P1K | 01xLB_pyruvate | M9_glucose | WT | 01xLB | NaN | NaN | Aerobic | 100uM Pyruvate | 37 | 7.0 | NaN | NaN | NaN | WT | 2 | 1 | NaN | Batch | True |
| p1k_00509 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | PAIRED | NaN | NaN | ILLUMINA | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | MG1655 | WT | U00096.3 | P1K | ZnCl2_LB | M9_glucose | WT | LB | NaN | NaN | Aerobic | 1mM ZnCl2 | 37 | 7.0 | NaN | NaN | NaN | WT | 1 | 1 | NaN | Batch | True |
| p1k_00510 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | PAIRED | NaN | NaN | ILLUMINA | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | MG1655 | WT | U00096.3 | P1K | ZnCl2_LB | M9_glucose | WT | LB | NaN | NaN | Aerobic | 1mM ZnCl2 | 37 | 7.0 | NaN | NaN | NaN | WT | 2 | 1 | NaN | Batch | True |
69 rows × 71 columns
[8]:
multiModulon.view_iModulon_activities(
species="MG1655",
component="Core_1",
fig_size=(6, 4),
font_path = "/usr/share/fonts/truetype/msttcorefonts/Arial.ttf"
)
[9]:
multiModulon.compare_core_iModulon_activity(
component='Core_1',
fig_size=(4, 3),
species_in_comparison=['MG1655','BL21', 'W3110'],
font_path = "/usr/share/fonts/truetype/msttcorefonts/Arial.ttf",
condition_list=['M9_glucose:in_house_2', 'LiAcet:in_house_2', 'acetate:in_house_2'],
legend_title='Strain',
title=None
)
[10]:
multiModulon.show_iModulon_activity_change(
species='MG1655',
condition_1='acetate:in_house_2',
condition_2='LiAcet:in_house_2',
fig_size=(5, 5),
font_path = "/usr/share/fonts/truetype/msttcorefonts/Arial.ttf"
)
[11]:
multiModulon.show_gene_iModulon_correlation(
gene='b0583',
component='Core_1',
fig_size=(4, 3),
font_path = "/usr/share/fonts/truetype/msttcorefonts/Arial.ttf"
)
Step 3: Compare Gene Membership for Core components
Visualize the core and unique components by view_iModulon_weights and view_core_iModulon_weights
[12]:
multiModulon.view_core_iModulon_weights(
component="Core_1",
fig_size=(6, 4),
font_path = "/usr/share/fonts/truetype/msttcorefonts/Arial.ttf",
reference_order=['MG1655', 'BL21', 'W3110'],
show_COG=True
)
[13]:
Core_1_comparison = multiModulon.compare_core_iModulon(
component='Core_1',
y_label='Strains',
reference_order=['MG1655', 'BL21', 'W3110'],
fig_size=(20, 6),
font_path="/usr/share/fonts/truetype/msttcorefonts/Arial.ttf",
show_gene_names=False
# save_path="./Output_iModulon_Figures/"
)
[14]:
Core_1_comparison
[14]:
| b0583 | b0584 | b0585 | b0586 | b0588 | b0589 | b0590 | b0591 | b0592 | b0593 | b0594 | b0595 | b0596 | b0597 | b0805 | b1102 | b1452 | b1656 | b1705 | b2155 | b2673 | b2674 | b2675 | b2676 | b4292 | b4293 | b4367 | b4511 | b4567 | b0803 | b0804 | b1018 | b1679 | b1680 | b1681 | b1682 | b1683 | b3337 | b4291 | b0150 | b0151 | b0152 | b0153 | b0468 | b0587 | b0683 | b1019 | b1252 | b1494 | b1495 | b1496 | b1905 | b2211 | b3005 | b3006 | b3070 | ECD_01019 | JW5326 | b1684 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| MG1655 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | -1 | -1 | 0 |
| BL21 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | -1 | -1 |
| W3110 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | -1 | 1 | 1 |