.. DO NOT EDIT.
.. THIS FILE WAS AUTOMATICALLY GENERATED BY SPHINX-GALLERY.
.. TO MAKE CHANGES, EDIT THE SOURCE PYTHON FILE:
.. "_examples/matplotlib/plot_main06_c_collateral_sensitivity.py"
.. LINE NUMBERS ARE GIVEN BELOW.

.. only:: html

    .. note::
        :class: sphx-glr-download-link-note

        Click :ref:`here <sphx_glr_download__examples_matplotlib_plot_main06_c_collateral_sensitivity.py>`
        to download the full example code

.. rst-class:: sphx-glr-example-title

.. _sphx_glr__examples_matplotlib_plot_main06_c_collateral_sensitivity.py:


06.c Collateral Sensitivity Index (CRI)
----------------------------------------------

Since the computation of the Collateral Sensitivity Index is quite
computationally expensive, the results are saved into a .csv file
so that can be easily loaded and displayed. This script shows a
very basic graph of such information.

The generates a heatmap visualization for a dataset related to collateral
sensitivity. It uses the Seaborn library to plot the rectangular data as
a color-encoded matrix. The code loads the data from a CSV file, creates
mappings for categories and colors, and then plots the heatmap using the
loaded data and color maps. It also includes annotations, colorbar axes,
category patches, legend elements, and formatting options to enhance the
visualization.

.. GENERATED FROM PYTHON SOURCE LINES 19-74

.. code-block:: default
   :lineno-start: 19

    # Libraries
    import numpy as np
    import pandas as pd
    import seaborn as sns
    import matplotlib as mpl
    import matplotlib.pyplot as plt

    #
    from pathlib import Path
    from itertools import combinations
    from matplotlib.colors import LogNorm, Normalize

    # See https://matplotlib.org/devdocs/users/explain/customizing.html
    mpl.rcParams['axes.titlesize'] = 8
    mpl.rcParams['axes.labelsize'] = 8
    mpl.rcParams['xtick.labelsize'] = 8
    mpl.rcParams['ytick.labelsize'] = 8

    try:
        __file__
        TERMINAL = True
    except:
        TERMINAL = False


    # --------------------------------
    # Methods
    # --------------------------------
    def _check_ax_ay_equal(ax, ay):
        return ax==ay

    def _check_ax_ay_greater(ax, ay):
        return  ax>ay

    # --------------------------------
    # Constants
    # --------------------------------
    # Possible cmaps
    # https://r02b.github.io/seaborn_palettes/
    # Diverging: coolwarm, RdBu_r, vlag
    # Others: bone, gray, pink, twilight
    cmap0 = 'coolwarm'
    cmap1 = sns.light_palette("seagreen", as_cmap=True)
    cmap2 = sns.color_palette("light:b", as_cmap=True)
    cmap3 = sns.color_palette("vlag", as_cmap=True)
    cmap4 = sns.diverging_palette(220, 20, as_cmap=True)
    cmap5 = sns.color_palette("ch:s=.25,rot=-.25", as_cmap=True) # no
    cmap6 = sns.color_palette("light:#5A9", as_cmap=True)
    cmap7 = sns.light_palette("#9dedcc", as_cmap=True)
    cmap8 = sns.color_palette("YlGn", as_cmap=True)

    # Figure size
    figsize = (17, 4)


.. GENERATED FROM PYTHON SOURCE LINES 75-76

Let's load the data

.. GENERATED FROM PYTHON SOURCE LINES 76-112

.. code-block:: default
   :lineno-start: 77


    # Load data
    path = Path('../../datasets/collateral-sensitivity/20230525-135511')
    data = pd.read_csv(path / 'contingency.csv')
    abxs = pd.read_csv(path / 'categories.csv')

    # Format data
    data = data.set_index(['specimen', 'o', 'ax', 'ay'])
    data.RR = data.RR.fillna(0).astype(int)
    data.RS = data.RS.fillna(0).astype(int)
    data.SR = data.SR.fillna(0).astype(int)
    data.SS = data.SS.fillna(0).astype(int)
    data['samples'] = data.RR + data.RS + data.SR + data.SS

    #data['samples'] = data.iloc[:, :4].sum(axis=1)

    def filter_top_pairs(df, n=5):
        """Filter top n (Specimen, Organism) pairs."""
        # Find top
        top = df.groupby(level=[0, 1]) \
            .samples.sum() \
            .sort_values(ascending=False) \
            .head(n)

        # Filter
        idx = pd.IndexSlice
        a = top.index.get_level_values(0).unique()
        b = top.index.get_level_values(1).unique()

        # Return
        return df.loc[idx[a, b, :, :]]

    # Filter
    data = filter_top_pairs(data, n=2)
    data = data[data.samples > 500]


.. GENERATED FROM PYTHON SOURCE LINES 113-114

Lets see the data

.. GENERATED FROM PYTHON SOURCE LINES 114-123

.. code-block:: default
   :lineno-start: 114

    if TERMINAL:
        print("\n")
        print("Number of samples: %s" % data.samples.sum())
        print("Number of pairs: %s" % data.shape[0])
        print("Data:")
        print(data)
    data.iloc[:7,:].dropna(axis=1, how='all')


.. raw:: html

    <div class="output_subarea output_html rendered_html output_result">
    <div>
    <style scoped>
        .dataframe tbody tr th:only-of-type {
            vertical-align: middle;
        }

        .dataframe tbody tr th {
            vertical-align: top;
        }

        .dataframe thead th {
            text-align: right;
        }
    </style>
    <table border="1" class="dataframe">
      <thead>
        <tr style="text-align: right;">
          <th></th>
          <th></th>
          <th></th>
          <th></th>
          <th>RR</th>
          <th>RS</th>
          <th>SI</th>
          <th>SR</th>
          <th>SS</th>
          <th>MIS</th>
          <th>samples</th>
        </tr>
        <tr>
          <th>specimen</th>
          <th>o</th>
          <th>ax</th>
          <th>ay</th>
          <th></th>
          <th></th>
          <th></th>
          <th></th>
          <th></th>
          <th></th>
          <th></th>
        </tr>
      </thead>
      <tbody>
        <tr>
          <th rowspan="7" valign="top">WOUCUL</th>
          <th rowspan="7" valign="top">SAUR</th>
          <th rowspan="7" valign="top">ACHL</th>
          <th>ACLI</th>
          <td>2</td>
          <td>10</td>
          <td>3.0</td>
          <td>165</td>
          <td>585</td>
          <td>-0.003227</td>
          <td>762</td>
        </tr>
        <tr>
          <th>AERY</th>
          <td>2</td>
          <td>10</td>
          <td>4.0</td>
          <td>213</td>
          <td>536</td>
          <td>-0.007012</td>
          <td>761</td>
        </tr>
        <tr>
          <th>AFUS</th>
          <td>2</td>
          <td>10</td>
          <td>NaN</td>
          <td>86</td>
          <td>667</td>
          <td>0.003373</td>
          <td>765</td>
        </tr>
        <tr>
          <th>ALIN</th>
          <td>0</td>
          <td>10</td>
          <td>NaN</td>
          <td>0</td>
          <td>523</td>
          <td>0.000000</td>
          <td>533</td>
        </tr>
        <tr>
          <th>AMET</th>
          <td>3</td>
          <td>9</td>
          <td>NaN</td>
          <td>71</td>
          <td>679</td>
          <td>0.010702</td>
          <td>762</td>
        </tr>
        <tr>
          <th>AMUP</th>
          <td>0</td>
          <td>10</td>
          <td>6.0</td>
          <td>3</td>
          <td>690</td>
          <td>-0.000183</td>
          <td>703</td>
        </tr>
        <tr>
          <th>ANEO</th>
          <td>1</td>
          <td>9</td>
          <td>NaN</td>
          <td>27</td>
          <td>622</td>
          <td>0.003881</td>
          <td>659</td>
        </tr>
      </tbody>
    </table>
    </div>
    </div>
    <br />
    <br />

.. GENERATED FROM PYTHON SOURCE LINES 124-125

Lets load the antimicrobial data and create color mapping variables

.. GENERATED FROM PYTHON SOURCE LINES 125-137

.. code-block:: default
   :lineno-start: 126


    # Create dictionary to map category to color
    labels = abxs.category
    palette = sns.color_palette('Spectral', labels.nunique())
    palette = sns.cubehelix_palette(labels.nunique(),
        light=.9, dark=.1, reverse=True, start=1, rot=-2)
    lookup = dict(zip(labels.unique(), palette))

    # Create dictionary to map code to category
    code2cat = dict(zip(abxs.antimicrobial_code, abxs.category))


.. GENERATED FROM PYTHON SOURCE LINES 138-139

Let's display the information

.. GENERATED FROM PYTHON SOURCE LINES 139-276

.. code-block:: default
   :lineno-start: 140


    # Loop
    for i, df in data.groupby(level=[0, 1]):

        # Drop level
        df = df.droplevel(level=[0, 1])

        # Check possible issues.
        ax = df.index.get_level_values(0)
        ay = df.index.get_level_values(1)
        idx1 = _check_ax_ay_equal(ax, ay)
        idx2 = _check_ax_ay_greater(ax, ay)

        # Show
        print("%25s. ax==ay => %5s | ax>ay => %5s" % \
              (i, idx1.sum(), idx2.sum()))

        # Re-index to have square matrix
        abxs = set(ax) | set(ay)
        index = pd.MultiIndex.from_product([abxs, abxs])

        # Reformat MIS
        mis = df['MIS'] \
            .reindex(index, fill_value=np.nan) \
            .unstack()

        # Reformat samples
        freq = df['samples'] \
            .reindex(index, fill_value=0) \
            .unstack()

        # Combine in square matrix
        m1 = mis.copy(deep=True).to_numpy()
        m2 = freq.to_numpy()
        il1 = np.tril_indices(mis.shape[1])
        m1[il1] = m2.T[il1]
        m = pd.DataFrame(m1,
            index=mis.index, columns=mis.columns)

        # .. note: This is the matrix that is used in previous
        #          samples to display the CRI and the count using
        #          the sns.heatmap function
        # Save
        #m.to_csv('%s'%str(i))

        # Add frequency
        top_n = df \
            .sort_values('samples', ascending=False) \
            .head(20).drop(columns='MIS') \
            .dropna(axis=1, how='all')

        # Draw
        fig, axs = plt.subplots(nrows=1, ncols=4,
            sharey=False, sharex=False, figsize=figsize,
            gridspec_kw={'width_ratios': [2, 3, 3, 3.5]})

        sns.heatmap(data=mis * 100, annot=False, linewidth=.5,
                    cmap='coolwarm', vmin=-70, vmax=70, center=0,
                    annot_kws={"size": 8}, square=True,
                    ax=axs[2], xticklabels=True, yticklabels=True)

        sns.heatmap(data=freq, annot=False, linewidth=.5,
                    cmap='Blues', norm=LogNorm(),
                    annot_kws={"size": 8}, square=True,
                    ax=axs[1], xticklabels=True, yticklabels=True)

        sns.heatmap(top_n,
                    annot=False, linewidth=0.5,
                    cmap='Blues', ax=axs[0], zorder=1,
                    vmin=None, vmax=None, center=None, robust=True,
                    square=False, xticklabels=True, yticklabels=True,
                    cbar_kws={
                        'use_gridspec': True,
                        'location': 'right'
                    }
        )

        # Display
        masku = np.triu(np.ones_like(m))
        maskl = np.tril(np.ones_like(m))
        sns.heatmap(data=m, cmap=cmap8, mask=masku, ax=axs[3],
                    annot=False, linewidth=0.5, norm=LogNorm(),
                    annot_kws={"size": 8}, square=True, vmin=0)
        sns.heatmap(data=m, cmap=cmap4, mask=maskl, ax=axs[3],
                    annot=False, linewidth=0.5, vmin=-0.7, vmax=0.7,
                    center=0, annot_kws={"size": 8}, square=True,
                    xticklabels=True, yticklabels=True)

        # Configure axes
        axs[0].set_title('Contingency')
        axs[1].set_title('Number of samples')
        axs[2].set_title('Collateral Sensitivity Index')
        axs[3].set_title('Samples / Collateral Sensitivity')

        # Add colors to xticklabels

        #abxs = pd.read_csv('../../datasets/susceptibility-nhs/susceptibility-v0.0.1/antimicrobials.csv')##

        #groups = dict(zip(abxs.antimicrobial_code, abxs.category))
        #cmap = sns.color_palette("Spectral", abxs.category.nunique())
        #colors = dict(zip(abxs.category, cmap))

        # ------------------------------------------
        # Add category colors on xtick labels
        # ------------------------------------------
        # Create colors
        colors = m.columns.to_series().map(code2cat).map(lookup)

        # Loop
        for lbl in axs[3].get_xticklabels():
            try:
                x, y = lbl.get_position()
                c = colors.to_dict().get(lbl.get_text(), 'k')
                lbl.set_color(c)
                lbl.set_weight('bold')

                """
                axs[3].annotate('', xy=(2000, 0),
                    #xytext=(0, -15 - axs[3].xaxis.labelpad),
                    xytext=(i.x, y)
                    xycoords=('data', 'axes fraction'),
                    textcoords='offset points',
                    ha='center', va='top',
                    bbox=dict(boxstyle='round', fc='none', ec='red'))
                """
            except Exception as e:
                print(lbl.get_text(), e)

        # Configure plot
        plt.suptitle('%s - %s' % (i[0], i[1]))
        plt.tight_layout()
        plt.subplots_adjust(wspace=0.1)

        # Exit loop
        break

    # Show
    plt.show()


.. image-sg:: /_examples/matplotlib/images/sphx_glr_plot_main06_c_collateral_sensitivity_001.png
   :alt: URICUL - ECOL, Contingency, Number of samples, Collateral Sensitivity Index, Samples / Collateral Sensitivity
   :srcset: /_examples/matplotlib/images/sphx_glr_plot_main06_c_collateral_sensitivity_001.png
   :class: sphx-glr-single-img


.. rst-class:: sphx-glr-script-out

 Out:

 .. code-block:: none

           ('URICUL', 'ECOL'). ax==ay =>    18 | ax>ay =>     0


.. rst-class:: sphx-glr-timing

   **Total running time of the script:** ( 0 minutes  2.476 seconds)


.. _sphx_glr_download__examples_matplotlib_plot_main06_c_collateral_sensitivity.py:


.. only :: html

 .. container:: sphx-glr-footer
    :class: sphx-glr-footer-example


  .. container:: sphx-glr-download sphx-glr-download-python

     :download:`Download Python source code: plot_main06_c_collateral_sensitivity.py <plot_main06_c_collateral_sensitivity.py>`


  .. container:: sphx-glr-download sphx-glr-download-jupyter

     :download:`Download Jupyter notebook: plot_main06_c_collateral_sensitivity.ipynb <plot_main06_c_collateral_sensitivity.ipynb>`


.. only:: html

 .. rst-class:: sphx-glr-signature

    `Gallery generated by Sphinx-Gallery <https://sphinx-gallery.github.io>`_