{
 "cells": [
  {
   "cell_type": "markdown",
   "id": "edcb3b82",
   "metadata": {},
   "source": [
    "## Analyse Binette results\n",
    "\n",
    "Let's visualize the results from Binette and compare them to the initial bin sets used as input. \n",
    "\n",
    "To explore these results interactively, you can open the Jupyter notebook via Binder by following this link: [![Binder](https://mybinder.org/badge_logo.svg)](https://mybinder.org/v2/gh/genotoul-bioinfo/Binette/binder_tutorial_env?urlpath=git-pull%3Frepo%3Dhttps%253A%252F%252Fgithub.com%252Fgenotoul-bioinfo%252FBinette%26urlpath%3Dtree%252FBinette%252Fdocs%252Ftutorial%252Fanalyse_binette_result.ipynb%26branch%3Dmain)"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "dbe1d73b",
   "metadata": {},
   "source": [
    "### Import Necessary Libraries\n",
    "\n",
    "First, we'll need to import the necessary libraries for our analysis and plotting:"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 1,
   "id": "9e9153ef",
   "metadata": {
    "execution": {
     "iopub.execute_input": "2025-10-14T08:42:20.824074Z",
     "iopub.status.busy": "2025-10-14T08:42:20.823897Z",
     "iopub.status.idle": "2025-10-14T08:42:22.884995Z",
     "shell.execute_reply": "2025-10-14T08:42:22.884454Z"
    }
   },
   "outputs": [],
   "source": [
    "import pandas as pd\n",
    "from pathlib import Path\n",
    "import plotly.express as px\n",
    "\n",
    "# The following two lines are needed to properly display Plotly graphs in the documentation\n",
    "# However you may need to remove these lines and restart the kernel to visualise the graph in another context\n",
    "import plotly.io as pio\n",
    "pio.renderers.default = \"sphinx_gallery\""
   ]
  },
  {
   "cell_type": "markdown",
   "id": "b93e8a0e",
   "metadata": {},
   "source": [
    "### Load Binette Results\n",
    "\n",
    "Now, let's load the final Binette quality report into a Pandas DataFrame:"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 2,
   "id": "d95ad45c",
   "metadata": {
    "execution": {
     "iopub.execute_input": "2025-10-14T08:42:22.887040Z",
     "iopub.status.busy": "2025-10-14T08:42:22.886826Z",
     "iopub.status.idle": "2025-10-14T08:42:22.922631Z",
     "shell.execute_reply": "2025-10-14T08:42:22.922125Z"
    },
    "lines_to_next_cell": 0
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>name</th>\n",
       "      <th>origin</th>\n",
       "      <th>is_original</th>\n",
       "      <th>original_name</th>\n",
       "      <th>completeness</th>\n",
       "      <th>contamination</th>\n",
       "      <th>score</th>\n",
       "      <th>checkm2_model</th>\n",
       "      <th>size</th>\n",
       "      <th>N50</th>\n",
       "      <th>coding_density</th>\n",
       "      <th>contig_count</th>\n",
       "      <th>tool</th>\n",
       "      <th>index</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>binette_bin1</td>\n",
       "      <td>binette</td>\n",
       "      <td>False</td>\n",
       "      <td>binette_bin1</td>\n",
       "      <td>100.00</td>\n",
       "      <td>0.10</td>\n",
       "      <td>99.80</td>\n",
       "      <td>Neural Network (Specific Model)</td>\n",
       "      <td>4658605</td>\n",
       "      <td>82084</td>\n",
       "      <td>0.8803</td>\n",
       "      <td>91</td>\n",
       "      <td>binette</td>\n",
       "      <td>0</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>binette_bin2</td>\n",
       "      <td>binette</td>\n",
       "      <td>False</td>\n",
       "      <td>binette_bin2</td>\n",
       "      <td>99.94</td>\n",
       "      <td>0.23</td>\n",
       "      <td>99.48</td>\n",
       "      <td>Neural Network (Specific Model)</td>\n",
       "      <td>2796059</td>\n",
       "      <td>41151</td>\n",
       "      <td>0.8882</td>\n",
       "      <td>98</td>\n",
       "      <td>binette</td>\n",
       "      <td>1</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>binette_bin3</td>\n",
       "      <td>binette</td>\n",
       "      <td>False</td>\n",
       "      <td>binette_bin3</td>\n",
       "      <td>96.10</td>\n",
       "      <td>0.27</td>\n",
       "      <td>95.56</td>\n",
       "      <td>Gradient Boost (General Model)</td>\n",
       "      <td>2559714</td>\n",
       "      <td>11656</td>\n",
       "      <td>0.8990</td>\n",
       "      <td>315</td>\n",
       "      <td>binette</td>\n",
       "      <td>2</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>binette_bin4</td>\n",
       "      <td>binette</td>\n",
       "      <td>False</td>\n",
       "      <td>binette_bin4</td>\n",
       "      <td>93.43</td>\n",
       "      <td>0.12</td>\n",
       "      <td>93.19</td>\n",
       "      <td>Neural Network (Specific Model)</td>\n",
       "      <td>4229623</td>\n",
       "      <td>40395</td>\n",
       "      <td>0.9031</td>\n",
       "      <td>148</td>\n",
       "      <td>binette</td>\n",
       "      <td>3</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>binette_bin5</td>\n",
       "      <td>binette</td>\n",
       "      <td>False</td>\n",
       "      <td>binette_bin5</td>\n",
       "      <td>95.15</td>\n",
       "      <td>2.36</td>\n",
       "      <td>90.43</td>\n",
       "      <td>Gradient Boost (General Model)</td>\n",
       "      <td>1843697</td>\n",
       "      <td>10106</td>\n",
       "      <td>0.8835</td>\n",
       "      <td>266</td>\n",
       "      <td>binette</td>\n",
       "      <td>4</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>5</th>\n",
       "      <td>binette_bin6</td>\n",
       "      <td>binette</td>\n",
       "      <td>False</td>\n",
       "      <td>binette_bin6</td>\n",
       "      <td>91.50</td>\n",
       "      <td>2.21</td>\n",
       "      <td>87.08</td>\n",
       "      <td>Gradient Boost (General Model)</td>\n",
       "      <td>3543663</td>\n",
       "      <td>5964</td>\n",
       "      <td>0.8542</td>\n",
       "      <td>786</td>\n",
       "      <td>binette</td>\n",
       "      <td>5</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>6</th>\n",
       "      <td>binette_bin7</td>\n",
       "      <td>semibin2_output/output_bins</td>\n",
       "      <td>True</td>\n",
       "      <td>SemiBin_23.fa</td>\n",
       "      <td>84.06</td>\n",
       "      <td>1.66</td>\n",
       "      <td>80.74</td>\n",
       "      <td>Neural Network (Specific Model)</td>\n",
       "      <td>1689331</td>\n",
       "      <td>8389</td>\n",
       "      <td>0.8678</td>\n",
       "      <td>246</td>\n",
       "      <td>binette</td>\n",
       "      <td>6</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>7</th>\n",
       "      <td>binette_bin8</td>\n",
       "      <td>binette</td>\n",
       "      <td>False</td>\n",
       "      <td>binette_bin8</td>\n",
       "      <td>74.32</td>\n",
       "      <td>2.17</td>\n",
       "      <td>69.98</td>\n",
       "      <td>Gradient Boost (General Model)</td>\n",
       "      <td>1257085</td>\n",
       "      <td>5017</td>\n",
       "      <td>0.8946</td>\n",
       "      <td>257</td>\n",
       "      <td>binette</td>\n",
       "      <td>7</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>8</th>\n",
       "      <td>binette_bin9</td>\n",
       "      <td>binette</td>\n",
       "      <td>False</td>\n",
       "      <td>binette_bin9</td>\n",
       "      <td>74.08</td>\n",
       "      <td>3.82</td>\n",
       "      <td>66.44</td>\n",
       "      <td>Neural Network (Specific Model)</td>\n",
       "      <td>3492747</td>\n",
       "      <td>3005</td>\n",
       "      <td>0.9218</td>\n",
       "      <td>1308</td>\n",
       "      <td>binette</td>\n",
       "      <td>8</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>9</th>\n",
       "      <td>binette_bin10</td>\n",
       "      <td>binette</td>\n",
       "      <td>False</td>\n",
       "      <td>binette_bin10</td>\n",
       "      <td>64.49</td>\n",
       "      <td>1.79</td>\n",
       "      <td>60.91</td>\n",
       "      <td>Gradient Boost (General Model)</td>\n",
       "      <td>1266713</td>\n",
       "      <td>3796</td>\n",
       "      <td>0.9064</td>\n",
       "      <td>415</td>\n",
       "      <td>binette</td>\n",
       "      <td>9</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>10</th>\n",
       "      <td>binette_bin11</td>\n",
       "      <td>binette</td>\n",
       "      <td>False</td>\n",
       "      <td>binette_bin11</td>\n",
       "      <td>60.27</td>\n",
       "      <td>1.85</td>\n",
       "      <td>56.57</td>\n",
       "      <td>Neural Network (Specific Model)</td>\n",
       "      <td>2080860</td>\n",
       "      <td>4612</td>\n",
       "      <td>0.9044</td>\n",
       "      <td>519</td>\n",
       "      <td>binette</td>\n",
       "      <td>10</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>11</th>\n",
       "      <td>binette_bin12</td>\n",
       "      <td>binette</td>\n",
       "      <td>False</td>\n",
       "      <td>binette_bin12</td>\n",
       "      <td>52.00</td>\n",
       "      <td>1.07</td>\n",
       "      <td>49.86</td>\n",
       "      <td>Neural Network (Specific Model)</td>\n",
       "      <td>2516999</td>\n",
       "      <td>5503</td>\n",
       "      <td>0.9092</td>\n",
       "      <td>482</td>\n",
       "      <td>binette</td>\n",
       "      <td>11</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>12</th>\n",
       "      <td>binette_bin13</td>\n",
       "      <td>binette</td>\n",
       "      <td>False</td>\n",
       "      <td>binette_bin13</td>\n",
       "      <td>48.86</td>\n",
       "      <td>4.50</td>\n",
       "      <td>39.86</td>\n",
       "      <td>Gradient Boost (General Model)</td>\n",
       "      <td>1119471</td>\n",
       "      <td>1517</td>\n",
       "      <td>0.8945</td>\n",
       "      <td>729</td>\n",
       "      <td>binette</td>\n",
       "      <td>12</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>13</th>\n",
       "      <td>binette_bin14</td>\n",
       "      <td>binette</td>\n",
       "      <td>False</td>\n",
       "      <td>binette_bin14</td>\n",
       "      <td>43.66</td>\n",
       "      <td>5.11</td>\n",
       "      <td>33.44</td>\n",
       "      <td>Neural Network (Specific Model)</td>\n",
       "      <td>2087483</td>\n",
       "      <td>4593</td>\n",
       "      <td>0.9248</td>\n",
       "      <td>476</td>\n",
       "      <td>binette</td>\n",
       "      <td>13</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>14</th>\n",
       "      <td>binette_bin15</td>\n",
       "      <td>binette</td>\n",
       "      <td>False</td>\n",
       "      <td>binette_bin15</td>\n",
       "      <td>43.93</td>\n",
       "      <td>9.52</td>\n",
       "      <td>24.89</td>\n",
       "      <td>Neural Network (Specific Model)</td>\n",
       "      <td>2451217</td>\n",
       "      <td>1480</td>\n",
       "      <td>0.8544</td>\n",
       "      <td>1627</td>\n",
       "      <td>binette</td>\n",
       "      <td>14</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "             name                       origin  is_original  original_name  \\\n",
       "0    binette_bin1                      binette        False   binette_bin1   \n",
       "1    binette_bin2                      binette        False   binette_bin2   \n",
       "2    binette_bin3                      binette        False   binette_bin3   \n",
       "3    binette_bin4                      binette        False   binette_bin4   \n",
       "4    binette_bin5                      binette        False   binette_bin5   \n",
       "5    binette_bin6                      binette        False   binette_bin6   \n",
       "6    binette_bin7  semibin2_output/output_bins         True  SemiBin_23.fa   \n",
       "7    binette_bin8                      binette        False   binette_bin8   \n",
       "8    binette_bin9                      binette        False   binette_bin9   \n",
       "9   binette_bin10                      binette        False  binette_bin10   \n",
       "10  binette_bin11                      binette        False  binette_bin11   \n",
       "11  binette_bin12                      binette        False  binette_bin12   \n",
       "12  binette_bin13                      binette        False  binette_bin13   \n",
       "13  binette_bin14                      binette        False  binette_bin14   \n",
       "14  binette_bin15                      binette        False  binette_bin15   \n",
       "\n",
       "    completeness  contamination  score                    checkm2_model  \\\n",
       "0         100.00           0.10  99.80  Neural Network (Specific Model)   \n",
       "1          99.94           0.23  99.48  Neural Network (Specific Model)   \n",
       "2          96.10           0.27  95.56   Gradient Boost (General Model)   \n",
       "3          93.43           0.12  93.19  Neural Network (Specific Model)   \n",
       "4          95.15           2.36  90.43   Gradient Boost (General Model)   \n",
       "5          91.50           2.21  87.08   Gradient Boost (General Model)   \n",
       "6          84.06           1.66  80.74  Neural Network (Specific Model)   \n",
       "7          74.32           2.17  69.98   Gradient Boost (General Model)   \n",
       "8          74.08           3.82  66.44  Neural Network (Specific Model)   \n",
       "9          64.49           1.79  60.91   Gradient Boost (General Model)   \n",
       "10         60.27           1.85  56.57  Neural Network (Specific Model)   \n",
       "11         52.00           1.07  49.86  Neural Network (Specific Model)   \n",
       "12         48.86           4.50  39.86   Gradient Boost (General Model)   \n",
       "13         43.66           5.11  33.44  Neural Network (Specific Model)   \n",
       "14         43.93           9.52  24.89  Neural Network (Specific Model)   \n",
       "\n",
       "       size    N50  coding_density  contig_count     tool  index  \n",
       "0   4658605  82084          0.8803            91  binette      0  \n",
       "1   2796059  41151          0.8882            98  binette      1  \n",
       "2   2559714  11656          0.8990           315  binette      2  \n",
       "3   4229623  40395          0.9031           148  binette      3  \n",
       "4   1843697  10106          0.8835           266  binette      4  \n",
       "5   3543663   5964          0.8542           786  binette      5  \n",
       "6   1689331   8389          0.8678           246  binette      6  \n",
       "7   1257085   5017          0.8946           257  binette      7  \n",
       "8   3492747   3005          0.9218          1308  binette      8  \n",
       "9   1266713   3796          0.9064           415  binette      9  \n",
       "10  2080860   4612          0.9044           519  binette     10  \n",
       "11  2516999   5503          0.9092           482  binette     11  \n",
       "12  1119471   1517          0.8945           729  binette     12  \n",
       "13  2087483   4593          0.9248           476  binette     13  \n",
       "14  2451217   1480          0.8544          1627  binette     14  "
      ]
     },
     "execution_count": 2,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "binette_result_file = \"./binette_output/final_bins_quality_reports.tsv\"\n",
    "df_binette = pd.read_csv(binette_result_file, sep='\\t')\n",
    "df_binette['tool'] = \"binette\"  # Add a column to label the tool\n",
    "df_binette['index'] = df_binette.index  # Add an index column\n",
    "df_binette"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "c1372a73",
   "metadata": {},
   "source": [
    "### Load and Combine Input Bin Quality Reports\n",
    "\n",
    "Next, we will load the quality reports of the input bin sets, computed by various tools and saved by Binette. We’ll combine these into a single DataFrame and add a column to indicate high-quality bins. We define a high-quality bin as one with contamination ≤ 5% and completeness ≥ 90%."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 3,
   "id": "fcb016f2",
   "metadata": {
    "execution": {
     "iopub.execute_input": "2025-10-14T08:42:22.924325Z",
     "iopub.status.busy": "2025-10-14T08:42:22.924161Z",
     "iopub.status.idle": "2025-10-14T08:42:22.954467Z",
     "shell.execute_reply": "2025-10-14T08:42:22.954109Z"
    }
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>tool</th>\n",
       "      <th>completeness</th>\n",
       "      <th>contamination</th>\n",
       "      <th>size</th>\n",
       "      <th>N50</th>\n",
       "      <th>contig_count</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>binette</td>\n",
       "      <td>100.00</td>\n",
       "      <td>0.10</td>\n",
       "      <td>4658605</td>\n",
       "      <td>82084</td>\n",
       "      <td>91</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>binette</td>\n",
       "      <td>99.94</td>\n",
       "      <td>0.23</td>\n",
       "      <td>2796059</td>\n",
       "      <td>41151</td>\n",
       "      <td>98</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>binette</td>\n",
       "      <td>96.10</td>\n",
       "      <td>0.27</td>\n",
       "      <td>2559714</td>\n",
       "      <td>11656</td>\n",
       "      <td>315</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>binette</td>\n",
       "      <td>93.43</td>\n",
       "      <td>0.12</td>\n",
       "      <td>4229623</td>\n",
       "      <td>40395</td>\n",
       "      <td>148</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>binette</td>\n",
       "      <td>95.15</td>\n",
       "      <td>2.36</td>\n",
       "      <td>1843697</td>\n",
       "      <td>10106</td>\n",
       "      <td>266</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>...</th>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>9</th>\n",
       "      <td>metabat2</td>\n",
       "      <td>44.85</td>\n",
       "      <td>0.79</td>\n",
       "      <td>987990</td>\n",
       "      <td>4743</td>\n",
       "      <td>220</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>10</th>\n",
       "      <td>metabat2</td>\n",
       "      <td>44.38</td>\n",
       "      <td>0.58</td>\n",
       "      <td>1745116</td>\n",
       "      <td>4265</td>\n",
       "      <td>420</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>11</th>\n",
       "      <td>metabat2</td>\n",
       "      <td>25.47</td>\n",
       "      <td>0.03</td>\n",
       "      <td>1077467</td>\n",
       "      <td>91995</td>\n",
       "      <td>14</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>12</th>\n",
       "      <td>metabat2</td>\n",
       "      <td>94.21</td>\n",
       "      <td>37.06</td>\n",
       "      <td>8631886</td>\n",
       "      <td>4347</td>\n",
       "      <td>1994</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>13</th>\n",
       "      <td>metabat2</td>\n",
       "      <td>7.06</td>\n",
       "      <td>0.03</td>\n",
       "      <td>252404</td>\n",
       "      <td>64012</td>\n",
       "      <td>6</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "<p>139 rows × 6 columns</p>\n",
       "</div>"
      ],
      "text/plain": [
       "        tool  completeness  contamination     size    N50  contig_count\n",
       "0    binette        100.00           0.10  4658605  82084            91\n",
       "1    binette         99.94           0.23  2796059  41151            98\n",
       "2    binette         96.10           0.27  2559714  11656           315\n",
       "3    binette         93.43           0.12  4229623  40395           148\n",
       "4    binette         95.15           2.36  1843697  10106           266\n",
       "..       ...           ...            ...      ...    ...           ...\n",
       "9   metabat2         44.85           0.79   987990   4743           220\n",
       "10  metabat2         44.38           0.58  1745116   4265           420\n",
       "11  metabat2         25.47           0.03  1077467  91995            14\n",
       "12  metabat2         94.21          37.06  8631886   4347          1994\n",
       "13  metabat2          7.06           0.03   252404  64012             6\n",
       "\n",
       "[139 rows x 6 columns]"
      ]
     },
     "execution_count": 3,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "from pathlib import Path\n",
    "\n",
    "input_bins_quality_reports_dir = Path(\"binette_output/input_bins_quality_reports/\")\n",
    "\n",
    "# Initialize the list with Binette results\n",
    "df_input_bin_list = [df_binette]\n",
    "\n",
    "# Load each input bin quality report\n",
    "for input_bin_metric_file in input_bins_quality_reports_dir.glob(\"*tsv\"):\n",
    "    tool = input_bin_metric_file.name.split('.')[1].split('_')[0]  # Extract tool name from file name\n",
    "    df_input = pd.read_csv(input_bin_metric_file, sep='\\t')\n",
    "    df_input['index'] = df_input.index\n",
    "    df_input['tool'] = tool\n",
    "    df_input_bin_list.append(df_input)\n",
    "\n",
    "# Combine all DataFrames into one\n",
    "df_bins = pd.concat(df_input_bin_list)\n",
    "\n",
    "# Add a column to indicate high-quality bins\n",
    "df_bins[\"High quality bin\"] = (df_bins['completeness'] >= 90) & (df_bins['contamination'] <= 5)\n",
    "\n",
    "# Display relevant columns\n",
    "df_bins[[ \"tool\", \"completeness\", \"contamination\", \"size\", \"N50\", \"contig_count\"]]\n"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "80ef2544",
   "metadata": {},
   "source": [
    "### Plot bin completeness and contamination\n",
    "With the DataFrame containing both Binette’s final bins and the input bins, we can now create a scatter plot to visualize the results:"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 4,
   "id": "277cb781",
   "metadata": {
    "execution": {
     "iopub.execute_input": "2025-10-14T08:42:22.956317Z",
     "iopub.status.busy": "2025-10-14T08:42:22.956154Z",
     "iopub.status.idle": "2025-10-14T08:42:23.439658Z",
     "shell.execute_reply": "2025-10-14T08:42:23.439082Z"
    }
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>            <script src=\"https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/MathJax.js?config=TeX-AMS-MML_SVG\"></script><script type=\"text/javascript\">if (window.MathJax && window.MathJax.Hub && window.MathJax.Hub.Config) {window.MathJax.Hub.Config({SVG: {font: \"STIX-Web\"}});}</script>                <script type=\"text/javascript\">window.PlotlyConfig = {MathJaxConfig: 'local'};</script>\n",
       "        <script charset=\"utf-8\" src=\"https://cdn.plot.ly/plotly-2.35.2.min.js\"></script>                <div id=\"94a0b306-4afc-43fc-9ab1-fa396c2db75a\" class=\"plotly-graph-div\" style=\"height:800px; width:600px;\"></div>            <script type=\"text/javascript\">                                    window.PLOTLYENV=window.PLOTLYENV || {};                                    if (document.getElementById(\"94a0b306-4afc-43fc-9ab1-fa396c2db75a\")) {                    Plotly.newPlot(                        \"94a0b306-4afc-43fc-9ab1-fa396c2db75a\",                        [{\"hovertemplate\":\"High quality bin=True\\u003cbr\\u003etool=binette\\u003cbr\\u003ecompleteness=%{x}\\u003cbr\\u003econtamination=%{y}\\u003cbr\\u003esize=%{marker.size}\\u003cextra\\u003e\\u003c\\u002fextra\\u003e\",\"legendgroup\":\"True\",\"marker\":{\"color\":\"#636efa\",\"size\":[4658605,2796059,2559714,4229623,1843697,3543663],\"sizemode\":\"area\",\"sizeref\":21579.715,\"symbol\":\"circle\"},\"mode\":\"markers\",\"name\":\"True\",\"orientation\":\"v\",\"showlegend\":true,\"x\":[100.0,99.94,96.1,93.43,95.15,91.5],\"xaxis\":\"x5\",\"y\":[0.1,0.23,0.27,0.12,2.36,2.21],\"yaxis\":\"y5\",\"type\":\"scatter\"},{\"hovertemplate\":\"High quality bin=True\\u003cbr\\u003etool=maxbin2\\u003cbr\\u003ecompleteness=%{x}\\u003cbr\\u003econtamination=%{y}\\u003cbr\\u003esize=%{marker.size}\\u003cextra\\u003e\\u003c\\u002fextra\\u003e\",\"legendgroup\":\"True\",\"marker\":{\"color\":\"#636efa\",\"size\":[4503016],\"sizemode\":\"area\",\"sizeref\":21579.715,\"symbol\":\"circle\"},\"mode\":\"markers\",\"name\":\"True\",\"orientation\":\"v\",\"showlegend\":false,\"x\":[95.16],\"xaxis\":\"x4\",\"y\":[0.48],\"yaxis\":\"y4\",\"type\":\"scatter\"},{\"hovertemplate\":\"High quality bin=True\\u003cbr\\u003etool=concoct\\u003cbr\\u003ecompleteness=%{x}\\u003cbr\\u003econtamination=%{y}\\u003cbr\\u003esize=%{marker.size}\\u003cextra\\u003e\\u003c\\u002fextra\\u003e\",\"legendgroup\":\"True\",\"marker\":{\"color\":\"#636efa\",\"size\":[3026209,4765466,2277652,3710127],\"sizemode\":\"area\",\"sizeref\":21579.715,\"symbol\":\"circle\"},\"mode\":\"markers\",\"name\":\"True\",\"orientation\":\"v\",\"showlegend\":false,\"x\":[100.0,100.0,92.76,92.75],\"xaxis\":\"x3\",\"y\":[0.37,0.5,0.29,4.19],\"yaxis\":\"y3\",\"type\":\"scatter\"},{\"hovertemplate\":\"High quality bin=True\\u003cbr\\u003etool=semibin2\\u003cbr\\u003ecompleteness=%{x}\\u003cbr\\u003econtamination=%{y}\\u003cbr\\u003esize=%{marker.size}\\u003cextra\\u003e\\u003c\\u002fextra\\u003e\",\"legendgroup\":\"True\",\"marker\":{\"color\":\"#636efa\",\"size\":[4665754,2912918,4107840,2110164],\"sizemode\":\"area\",\"sizeref\":21579.715,\"symbol\":\"circle\"},\"mode\":\"markers\",\"name\":\"True\",\"orientation\":\"v\",\"showlegend\":false,\"x\":[100.0,99.96,92.66,91.98],\"xaxis\":\"x2\",\"y\":[0.11,0.28,0.04,0.18],\"yaxis\":\"y2\",\"type\":\"scatter\"},{\"hovertemplate\":\"High quality bin=True\\u003cbr\\u003etool=metabat2\\u003cbr\\u003ecompleteness=%{x}\\u003cbr\\u003econtamination=%{y}\\u003cbr\\u003esize=%{marker.size}\\u003cextra\\u003e\\u003c\\u002fextra\\u003e\",\"legendgroup\":\"True\",\"marker\":{\"color\":\"#636efa\",\"size\":[2799572,2119954,4269732],\"sizemode\":\"area\",\"sizeref\":21579.715,\"symbol\":\"circle\"},\"mode\":\"markers\",\"name\":\"True\",\"orientation\":\"v\",\"showlegend\":false,\"x\":[99.94,93.17,93.58],\"xaxis\":\"x\",\"y\":[0.24,0.17,0.96],\"yaxis\":\"y\",\"type\":\"scatter\"},{\"hovertemplate\":\"High quality bin=False\\u003cbr\\u003etool=binette\\u003cbr\\u003ecompleteness=%{x}\\u003cbr\\u003econtamination=%{y}\\u003cbr\\u003esize=%{marker.size}\\u003cextra\\u003e\\u003c\\u002fextra\\u003e\",\"legendgroup\":\"False\",\"marker\":{\"color\":\"#EF553B\",\"size\":[1689331,1257085,3492747,1266713,2080860,2516999,1119471,2087483,2451217],\"sizemode\":\"area\",\"sizeref\":21579.715,\"symbol\":\"circle\"},\"mode\":\"markers\",\"name\":\"False\",\"orientation\":\"v\",\"showlegend\":true,\"x\":[84.06,74.32,74.08,64.49,60.27,52.0,48.86,43.66,43.93],\"xaxis\":\"x5\",\"y\":[1.66,2.17,3.82,1.79,1.85,1.07,4.5,5.11,9.52],\"yaxis\":\"y5\",\"type\":\"scatter\"},{\"hovertemplate\":\"High quality bin=False\\u003cbr\\u003etool=maxbin2\\u003cbr\\u003ecompleteness=%{x}\\u003cbr\\u003econtamination=%{y}\\u003cbr\\u003esize=%{marker.size}\\u003cextra\\u003e\\u003c\\u002fextra\\u003e\",\"legendgroup\":\"False\",\"marker\":{\"color\":\"#EF553B\",\"size\":[3249408,3025929,2374963,2805633,3632688,1716259,5044849,5815088,1703900,2136732,2691298,1869482,3343065,1272133,631726,402991,2432011,4626721],\"sizemode\":\"area\",\"sizeref\":21579.715,\"symbol\":\"circle\"},\"mode\":\"markers\",\"name\":\"False\",\"orientation\":\"v\",\"showlegend\":false,\"x\":[100.0,77.56,63.07,59.62,67.61,71.26,77.58,61.76,42.1,38.16,40.36,51.06,46.9,22.54,12.58,10.11,67.21,92.25],\"xaxis\":\"x4\",\"y\":[7.67,10.57,9.84,9.06,15.28,17.59,21.99,17.39,8.2,6.61,8.39,13.8,12.51,2.0,0.02,0.15,31.26,47.17],\"yaxis\":\"y4\",\"type\":\"scatter\"},{\"hovertemplate\":\"High quality bin=False\\u003cbr\\u003etool=concoct\\u003cbr\\u003ecompleteness=%{x}\\u003cbr\\u003econtamination=%{y}\\u003cbr\\u003esize=%{marker.size}\\u003cextra\\u003e\\u003c\\u002fextra\\u003e\",\"legendgroup\":\"False\",\"marker\":{\"color\":\"#EF553B\",\"size\":[1943868,7222837,3268395,1131941,828865,3987640,2611822,754033,5127864,1917855,6208056,408199,379824,3554673,8419694,912608,187478,302044,219238,223994,813241,4430,2419,22682,3216,3113,2161,21581,2475,1344,5320,2524,9498,4999,1240,11081,2363,1235,2310,7347,1160,2204,1058,1008,1037,9555,3622,7068,13077,5512,89935,31150,100818,133109,120657,106196,27566,24906,41153,43900,44907],\"sizemode\":\"area\",\"sizeref\":21579.715,\"symbol\":\"circle\"},\"mode\":\"markers\",\"name\":\"False\",\"orientation\":\"v\",\"showlegend\":false,\"x\":[88.02,100.0,72.7,48.81,35.51,99.99,45.65,33.36,47.24,39.86,85.75,22.18,19.73,90.21,100.0,18.34,15.25,13.07,15.1,9.37,15.35,6.98,6.6,6.42,6.4,6.38,6.38,6.38,6.38,6.38,6.38,6.38,6.38,6.38,6.38,6.38,6.38,6.38,6.38,6.38,6.38,6.38,6.38,6.38,6.38,6.05,5.66,5.63,5.47,5.09,5.03,4.31,3.86,3.74,3.69,3.68,2.73,2.62,2.53,2.48,2.4],\"xaxis\":\"x3\",\"y\":[2.42,12.49,3.8,6.56,2.03,36.32,10.31,4.27,11.26,8.23,31.45,0.41,0.05,36.15,41.64,0.87,0.1,0.09,2.61,0.0,3.6,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.01,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.01,0.0,0.0,0.0,0.0,0.0],\"yaxis\":\"y3\",\"type\":\"scatter\"},{\"hovertemplate\":\"High quality bin=False\\u003cbr\\u003etool=semibin2\\u003cbr\\u003ecompleteness=%{x}\\u003cbr\\u003econtamination=%{y}\\u003cbr\\u003esize=%{marker.size}\\u003cextra\\u003e\\u003c\\u002fextra\\u003e\",\"legendgroup\":\"False\",\"marker\":{\"color\":\"#EF553B\",\"size\":[1689331,1791548,2995112,1260287,1640979,2532101,978913,1528708,2229295,574575,344167,884088,221029,334379,346613,425947,346400,220633,207387,248021,277958,227306],\"sizemode\":\"area\",\"sizeref\":21579.715,\"symbol\":\"circle\"},\"mode\":\"markers\",\"name\":\"False\",\"orientation\":\"v\",\"showlegend\":false,\"x\":[84.06,82.15,83.73,74.0,51.84,51.87,44.82,36.97,45.26,14.34,10.73,10.93,8.98,9.95,8.25,8.0,5.99,6.02,5.29,5.14,4.65,3.46],\"xaxis\":\"x2\",\"y\":[1.66,0.98,2.21,2.22,0.32,1.21,0.15,0.34,6.4,0.17,0.0,0.21,0.0,0.58,0.0,0.0,0.01,0.13,0.01,0.01,0.0,0.0],\"yaxis\":\"y2\",\"type\":\"scatter\"},{\"hovertemplate\":\"High quality bin=False\\u003cbr\\u003etool=metabat2\\u003cbr\\u003ecompleteness=%{x}\\u003cbr\\u003econtamination=%{y}\\u003cbr\\u003esize=%{marker.size}\\u003cextra\\u003e\\u003c\\u002fextra\\u003e\",\"legendgroup\":\"False\",\"marker\":{\"color\":\"#EF553B\",\"size\":[1902761,3015034,1840680,3477636,1401920,1673426,987990,1745116,1077467,8631886,252404],\"sizemode\":\"area\",\"sizeref\":21579.715,\"symbol\":\"circle\"},\"mode\":\"markers\",\"name\":\"False\",\"orientation\":\"v\",\"showlegend\":false,\"x\":[85.01,85.86,84.18,76.64,70.81,51.2,44.85,44.38,25.47,94.21,7.06],\"xaxis\":\"x\",\"y\":[1.83,2.76,3.72,0.13,5.63,0.43,0.79,0.58,0.03,37.06,0.03],\"yaxis\":\"y\",\"type\":\"scatter\"}],                        {\"template\":{\"data\":{\"histogram2dcontour\":[{\"type\":\"histogram2dcontour\",\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"},\"colorscale\":[[0.0,\"#0d0887\"],[0.1111111111111111,\"#46039f\"],[0.2222222222222222,\"#7201a8\"],[0.3333333333333333,\"#9c179e\"],[0.4444444444444444,\"#bd3786\"],[0.5555555555555556,\"#d8576b\"],[0.6666666666666666,\"#ed7953\"],[0.7777777777777778,\"#fb9f3a\"],[0.8888888888888888,\"#fdca26\"],[1.0,\"#f0f921\"]]}],\"choropleth\":[{\"type\":\"choropleth\",\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}],\"histogram2d\":[{\"type\":\"histogram2d\",\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"},\"colorscale\":[[0.0,\"#0d0887\"],[0.1111111111111111,\"#46039f\"],[0.2222222222222222,\"#7201a8\"],[0.3333333333333333,\"#9c179e\"],[0.4444444444444444,\"#bd3786\"],[0.5555555555555556,\"#d8576b\"],[0.6666666666666666,\"#ed7953\"],[0.7777777777777778,\"#fb9f3a\"],[0.8888888888888888,\"#fdca26\"],[1.0,\"#f0f921\"]]}],\"heatmap\":[{\"type\":\"heatmap\",\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"},\"colorscale\":[[0.0,\"#0d0887\"],[0.1111111111111111,\"#46039f\"],[0.2222222222222222,\"#7201a8\"],[0.3333333333333333,\"#9c179e\"],[0.4444444444444444,\"#bd3786\"],[0.5555555555555556,\"#d8576b\"],[0.6666666666666666,\"#ed7953\"],[0.7777777777777778,\"#fb9f3a\"],[0.8888888888888888,\"#fdca26\"],[1.0,\"#f0f921\"]]}],\"heatmapgl\":[{\"type\":\"heatmapgl\",\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"},\"colorscale\":[[0.0,\"#0d0887\"],[0.1111111111111111,\"#46039f\"],[0.2222222222222222,\"#7201a8\"],[0.3333333333333333,\"#9c179e\"],[0.4444444444444444,\"#bd3786\"],[0.5555555555555556,\"#d8576b\"],[0.6666666666666666,\"#ed7953\"],[0.7777777777777778,\"#fb9f3a\"],[0.8888888888888888,\"#fdca26\"],[1.0,\"#f0f921\"]]}],\"contourcarpet\":[{\"type\":\"contourcarpet\",\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}],\"contour\":[{\"type\":\"contour\",\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"},\"colorscale\":[[0.0,\"#0d0887\"],[0.1111111111111111,\"#46039f\"],[0.2222222222222222,\"#7201a8\"],[0.3333333333333333,\"#9c179e\"],[0.4444444444444444,\"#bd3786\"],[0.5555555555555556,\"#d8576b\"],[0.6666666666666666,\"#ed7953\"],[0.7777777777777778,\"#fb9f3a\"],[0.8888888888888888,\"#fdca26\"],[1.0,\"#f0f921\"]]}],\"surface\":[{\"type\":\"surface\",\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"},\"colorscale\":[[0.0,\"#0d0887\"],[0.1111111111111111,\"#46039f\"],[0.2222222222222222,\"#7201a8\"],[0.3333333333333333,\"#9c179e\"],[0.4444444444444444,\"#bd3786\"],[0.5555555555555556,\"#d8576b\"],[0.6666666666666666,\"#ed7953\"],[0.7777777777777778,\"#fb9f3a\"],[0.8888888888888888,\"#fdca26\"],[1.0,\"#f0f921\"]]}],\"mesh3d\":[{\"type\":\"mesh3d\",\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}],\"scatter\":[{\"fillpattern\":{\"fillmode\":\"overlay\",\"size\":10,\"solidity\":0.2},\"type\":\"scatter\"}],\"parcoords\":[{\"type\":\"parcoords\",\"line\":{\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}}],\"scatterpolargl\":[{\"type\":\"scatterpolargl\",\"marker\":{\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}}],\"bar\":[{\"error_x\":{\"color\":\"#2a3f5f\"},\"error_y\":{\"color\":\"#2a3f5f\"},\"marker\":{\"line\":{\"color\":\"#E5ECF6\",\"width\":0.5},\"pattern\":{\"fillmode\":\"overlay\",\"size\":10,\"solidity\":0.2}},\"type\":\"bar\"}],\"scattergeo\":[{\"type\":\"scattergeo\",\"marker\":{\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}}],\"scatterpolar\":[{\"type\":\"scatterpolar\",\"marker\":{\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}}],\"histogram\":[{\"marker\":{\"pattern\":{\"fillmode\":\"overlay\",\"size\":10,\"solidity\":0.2}},\"type\":\"histogram\"}],\"scattergl\":[{\"type\":\"scattergl\",\"marker\":{\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}}],\"scatter3d\":[{\"type\":\"scatter3d\",\"line\":{\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}},\"marker\":{\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}}],\"scattermapbox\":[{\"type\":\"scattermapbox\",\"marker\":{\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}}],\"scatterternary\":[{\"type\":\"scatterternary\",\"marker\":{\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}}],\"scattercarpet\":[{\"type\":\"scattercarpet\",\"marker\":{\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}}],\"carpet\":[{\"aaxis\":{\"endlinecolor\":\"#2a3f5f\",\"gridcolor\":\"white\",\"linecolor\":\"white\",\"minorgridcolor\":\"white\",\"startlinecolor\":\"#2a3f5f\"},\"baxis\":{\"endlinecolor\":\"#2a3f5f\",\"gridcolor\":\"white\",\"linecolor\":\"white\",\"minorgridcolor\":\"white\",\"startlinecolor\":\"#2a3f5f\"},\"type\":\"carpet\"}],\"table\":[{\"cells\":{\"fill\":{\"color\":\"#EBF0F8\"},\"line\":{\"color\":\"white\"}},\"header\":{\"fill\":{\"color\":\"#C8D4E3\"},\"line\":{\"color\":\"white\"}},\"type\":\"table\"}],\"barpolar\":[{\"marker\":{\"line\":{\"color\":\"#E5ECF6\",\"width\":0.5},\"pattern\":{\"fillmode\":\"overlay\",\"size\":10,\"solidity\":0.2}},\"type\":\"barpolar\"}],\"pie\":[{\"automargin\":true,\"type\":\"pie\"}]},\"layout\":{\"autotypenumbers\":\"strict\",\"colorway\":[\"#636efa\",\"#EF553B\",\"#00cc96\",\"#ab63fa\",\"#FFA15A\",\"#19d3f3\",\"#FF6692\",\"#B6E880\",\"#FF97FF\",\"#FECB52\"],\"font\":{\"color\":\"#2a3f5f\"},\"hovermode\":\"closest\",\"hoverlabel\":{\"align\":\"left\"},\"paper_bgcolor\":\"white\",\"plot_bgcolor\":\"#E5ECF6\",\"polar\":{\"bgcolor\":\"#E5ECF6\",\"angularaxis\":{\"gridcolor\":\"white\",\"linecolor\":\"white\",\"ticks\":\"\"},\"radialaxis\":{\"gridcolor\":\"white\",\"linecolor\":\"white\",\"ticks\":\"\"}},\"ternary\":{\"bgcolor\":\"#E5ECF6\",\"aaxis\":{\"gridcolor\":\"white\",\"linecolor\":\"white\",\"ticks\":\"\"},\"baxis\":{\"gridcolor\":\"white\",\"linecolor\":\"white\",\"ticks\":\"\"},\"caxis\":{\"gridcolor\":\"white\",\"linecolor\":\"white\",\"ticks\":\"\"}},\"coloraxis\":{\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}},\"colorscale\":{\"sequential\":[[0.0,\"#0d0887\"],[0.1111111111111111,\"#46039f\"],[0.2222222222222222,\"#7201a8\"],[0.3333333333333333,\"#9c179e\"],[0.4444444444444444,\"#bd3786\"],[0.5555555555555556,\"#d8576b\"],[0.6666666666666666,\"#ed7953\"],[0.7777777777777778,\"#fb9f3a\"],[0.8888888888888888,\"#fdca26\"],[1.0,\"#f0f921\"]],\"sequentialminus\":[[0.0,\"#0d0887\"],[0.1111111111111111,\"#46039f\"],[0.2222222222222222,\"#7201a8\"],[0.3333333333333333,\"#9c179e\"],[0.4444444444444444,\"#bd3786\"],[0.5555555555555556,\"#d8576b\"],[0.6666666666666666,\"#ed7953\"],[0.7777777777777778,\"#fb9f3a\"],[0.8888888888888888,\"#fdca26\"],[1.0,\"#f0f921\"]],\"diverging\":[[0,\"#8e0152\"],[0.1,\"#c51b7d\"],[0.2,\"#de77ae\"],[0.3,\"#f1b6da\"],[0.4,\"#fde0ef\"],[0.5,\"#f7f7f7\"],[0.6,\"#e6f5d0\"],[0.7,\"#b8e186\"],[0.8,\"#7fbc41\"],[0.9,\"#4d9221\"],[1,\"#276419\"]]},\"xaxis\":{\"gridcolor\":\"white\",\"linecolor\":\"white\",\"ticks\":\"\",\"title\":{\"standoff\":15},\"zerolinecolor\":\"white\",\"automargin\":true,\"zerolinewidth\":2},\"yaxis\":{\"gridcolor\":\"white\",\"linecolor\":\"white\",\"ticks\":\"\",\"title\":{\"standoff\":15},\"zerolinecolor\":\"white\",\"automargin\":true,\"zerolinewidth\":2},\"scene\":{\"xaxis\":{\"backgroundcolor\":\"#E5ECF6\",\"gridcolor\":\"white\",\"linecolor\":\"white\",\"showbackground\":true,\"ticks\":\"\",\"zerolinecolor\":\"white\",\"gridwidth\":2},\"yaxis\":{\"backgroundcolor\":\"#E5ECF6\",\"gridcolor\":\"white\",\"linecolor\":\"white\",\"showbackground\":true,\"ticks\":\"\",\"zerolinecolor\":\"white\",\"gridwidth\":2},\"zaxis\":{\"backgroundcolor\":\"#E5ECF6\",\"gridcolor\":\"white\",\"linecolor\":\"white\",\"showbackground\":true,\"ticks\":\"\",\"zerolinecolor\":\"white\",\"gridwidth\":2}},\"shapedefaults\":{\"line\":{\"color\":\"#2a3f5f\"}},\"annotationdefaults\":{\"arrowcolor\":\"#2a3f5f\",\"arrowhead\":0,\"arrowwidth\":1},\"geo\":{\"bgcolor\":\"white\",\"landcolor\":\"#E5ECF6\",\"subunitcolor\":\"white\",\"showland\":true,\"showlakes\":true,\"lakecolor\":\"white\"},\"title\":{\"x\":0.05},\"mapbox\":{\"style\":\"light\"}}},\"xaxis\":{\"anchor\":\"y\",\"domain\":[0.0,0.98],\"title\":{\"text\":\"completeness\"}},\"yaxis\":{\"anchor\":\"x\",\"domain\":[0.0,0.17600000000000002],\"title\":{\"text\":\"contamination\"}},\"xaxis2\":{\"anchor\":\"y2\",\"domain\":[0.0,0.98],\"matches\":\"x\",\"showticklabels\":false},\"yaxis2\":{\"anchor\":\"x2\",\"domain\":[0.20600000000000002,0.382],\"matches\":\"y\",\"title\":{\"text\":\"contamination\"}},\"xaxis3\":{\"anchor\":\"y3\",\"domain\":[0.0,0.98],\"matches\":\"x\",\"showticklabels\":false},\"yaxis3\":{\"anchor\":\"x3\",\"domain\":[0.41200000000000003,0.5880000000000001],\"matches\":\"y\",\"title\":{\"text\":\"contamination\"}},\"xaxis4\":{\"anchor\":\"y4\",\"domain\":[0.0,0.98],\"matches\":\"x\",\"showticklabels\":false},\"yaxis4\":{\"anchor\":\"x4\",\"domain\":[0.618,0.794],\"matches\":\"y\",\"title\":{\"text\":\"contamination\"}},\"xaxis5\":{\"anchor\":\"y5\",\"domain\":[0.0,0.98],\"matches\":\"x\",\"showticklabels\":false},\"yaxis5\":{\"anchor\":\"x5\",\"domain\":[0.8240000000000001,1.0],\"matches\":\"y\",\"title\":{\"text\":\"contamination\"}},\"annotations\":[{\"font\":{},\"showarrow\":false,\"text\":\"tool=metabat2\",\"textangle\":90,\"x\":0.98,\"xanchor\":\"left\",\"xref\":\"paper\",\"y\":0.08800000000000001,\"yanchor\":\"middle\",\"yref\":\"paper\"},{\"font\":{},\"showarrow\":false,\"text\":\"tool=semibin2\",\"textangle\":90,\"x\":0.98,\"xanchor\":\"left\",\"xref\":\"paper\",\"y\":0.29400000000000004,\"yanchor\":\"middle\",\"yref\":\"paper\"},{\"font\":{},\"showarrow\":false,\"text\":\"tool=concoct\",\"textangle\":90,\"x\":0.98,\"xanchor\":\"left\",\"xref\":\"paper\",\"y\":0.5,\"yanchor\":\"middle\",\"yref\":\"paper\"},{\"font\":{},\"showarrow\":false,\"text\":\"tool=maxbin2\",\"textangle\":90,\"x\":0.98,\"xanchor\":\"left\",\"xref\":\"paper\",\"y\":0.706,\"yanchor\":\"middle\",\"yref\":\"paper\"},{\"font\":{},\"showarrow\":false,\"text\":\"tool=binette\",\"textangle\":90,\"x\":0.98,\"xanchor\":\"left\",\"xref\":\"paper\",\"y\":0.912,\"yanchor\":\"middle\",\"yref\":\"paper\"}],\"legend\":{\"title\":{\"text\":\"High Quality Bin\"},\"tracegroupgap\":0,\"itemsizing\":\"constant\"},\"title\":{\"text\":\"Comparison of Bin Quality Metrics\"},\"width\":600,\"height\":800},                        {\"responsive\": true}                    )                };                            </script>        </div>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    }
   ],
   "source": [
    "import plotly.express as px\n",
    "\n",
    "# Create a scatter plot to visualize completeness and contamination\n",
    "fig = px.scatter(df_bins, \n",
    "                 x=\"completeness\", \n",
    "                 y=\"contamination\", \n",
    "                 color=\"High quality bin\", \n",
    "                 size=\"size\",  \n",
    "                 facet_row=\"tool\",\n",
    "                 title=\"Bin Quality Comparison\",\n",
    "                )\n",
    "\n",
    "# Update layout for better visibility\n",
    "fig.update_layout(\n",
    "    width=600,\n",
    "    height=800,\n",
    "    legend_title=\"High Quality Bin\",\n",
    "    title=\"Comparison of Bin Quality Metrics\"\n",
    ")\n",
    "\n",
    "# Show the plot\n",
    "fig.show()"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "06a14412",
   "metadata": {},
   "source": [
    "We can see that binette bins are the one displaying the most high quality bins (completeness ≥ 90% and contamination ≤ 5%).\n",
    "\n",
    "\n"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "323f5637",
   "metadata": {},
   "source": [
    "### Comparing Binning Tools Using Bin Score Curves\n",
    "\n",
    "A common way to compare bin sets is by sorting the bins based on their scores and plotting them against their index.\n",
    "\n",
    "Here’s how we can create such a plot:"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 5,
   "id": "79faaa3a",
   "metadata": {
    "execution": {
     "iopub.execute_input": "2025-10-14T08:42:23.441369Z",
     "iopub.status.busy": "2025-10-14T08:42:23.441165Z",
     "iopub.status.idle": "2025-10-14T08:42:23.482193Z",
     "shell.execute_reply": "2025-10-14T08:42:23.481769Z"
    }
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>            <script src=\"https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/MathJax.js?config=TeX-AMS-MML_SVG\"></script><script type=\"text/javascript\">if (window.MathJax && window.MathJax.Hub && window.MathJax.Hub.Config) {window.MathJax.Hub.Config({SVG: {font: \"STIX-Web\"}});}</script>                <script type=\"text/javascript\">window.PlotlyConfig = {MathJaxConfig: 'local'};</script>\n",
       "        <script charset=\"utf-8\" src=\"https://cdn.plot.ly/plotly-2.35.2.min.js\"></script>                <div id=\"a9b43cfd-bcf4-4912-b7f1-678b6c585c49\" class=\"plotly-graph-div\" style=\"height:500px; width:600px;\"></div>            <script type=\"text/javascript\">                                    window.PLOTLYENV=window.PLOTLYENV || {};                                    if (document.getElementById(\"a9b43cfd-bcf4-4912-b7f1-678b6c585c49\")) {                    Plotly.newPlot(                        \"a9b43cfd-bcf4-4912-b7f1-678b6c585c49\",                        [{\"hovertemplate\":\"tool=binette\\u003cbr\\u003eindex=%{x}\\u003cbr\\u003ecompleteness - 2*contamination=%{y}\\u003cextra\\u003e\\u003c\\u002fextra\\u003e\",\"legendgroup\":\"binette\",\"line\":{\"color\":\"#636efa\",\"dash\":\"solid\"},\"marker\":{\"symbol\":\"circle\"},\"mode\":\"lines+markers\",\"name\":\"binette\",\"orientation\":\"v\",\"showlegend\":true,\"x\":[0,1,2,3,4,5,6,7,8,9,10,11,12,13,14],\"xaxis\":\"x\",\"y\":[99.8,99.48,95.55999999999999,93.19000000000001,90.43,87.08,80.74000000000001,69.97999999999999,66.44,60.91,56.57,49.86,39.86,33.44,24.89],\"yaxis\":\"y\",\"type\":\"scatter\"},{\"hovertemplate\":\"tool=maxbin2\\u003cbr\\u003eindex=%{x}\\u003cbr\\u003ecompleteness - 2*contamination=%{y}\\u003cextra\\u003e\\u003c\\u002fextra\\u003e\",\"legendgroup\":\"maxbin2\",\"line\":{\"color\":\"#EF553B\",\"dash\":\"solid\"},\"marker\":{\"symbol\":\"circle\"},\"mode\":\"lines+markers\",\"name\":\"maxbin2\",\"orientation\":\"v\",\"showlegend\":true,\"x\":[0,1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18],\"xaxis\":\"x\",\"y\":[94.2,84.66,56.42,43.39,41.5,37.05,36.080000000000005,33.6,26.979999999999997,25.700000000000003,24.939999999999998,23.58,23.46,21.88,18.54,12.540000000000001,9.809999999999999,4.689999999999991,-2.0900000000000034],\"yaxis\":\"y\",\"type\":\"scatter\"},{\"hovertemplate\":\"tool=concoct\\u003cbr\\u003eindex=%{x}\\u003cbr\\u003ecompleteness - 2*contamination=%{y}\\u003cextra\\u003e\\u003c\\u002fextra\\u003e\",\"legendgroup\":\"concoct\",\"line\":{\"color\":\"#00cc96\",\"dash\":\"solid\"},\"marker\":{\"symbol\":\"circle\"},\"mode\":\"lines+markers\",\"name\":\"concoct\",\"orientation\":\"v\",\"showlegend\":true,\"x\":[0,1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64],\"xaxis\":\"x\",\"y\":[99.26,99.0,92.18,84.37,83.17999999999999,75.02,65.10000000000001,35.690000000000005,31.45,27.349999999999994,25.029999999999998,24.82,24.720000000000002,23.4,22.85,21.36,19.63,17.909999999999997,16.72,16.6,15.05,12.89,9.879999999999999,9.37,8.149999999999999,6.98,6.6,6.42,6.4,6.38,6.38,6.38,6.38,6.38,6.38,6.38,6.38,6.38,6.38,6.38,6.38,6.38,6.38,6.38,6.38,6.38,6.38,6.38,6.36,6.05,5.66,5.63,5.47,5.09,5.03,4.31,3.86,3.74,3.69,3.66,2.73,2.62,2.53,2.48,2.4],\"yaxis\":\"y\",\"type\":\"scatter\"},{\"hovertemplate\":\"tool=semibin2\\u003cbr\\u003eindex=%{x}\\u003cbr\\u003ecompleteness - 2*contamination=%{y}\\u003cextra\\u003e\\u003c\\u002fextra\\u003e\",\"legendgroup\":\"semibin2\",\"line\":{\"color\":\"#ab63fa\",\"dash\":\"solid\"},\"marker\":{\"symbol\":\"circle\"},\"mode\":\"lines+markers\",\"name\":\"semibin2\",\"orientation\":\"v\",\"showlegend\":true,\"x\":[0,1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25],\"xaxis\":\"x\",\"y\":[99.78,99.39999999999999,92.58,91.62,80.74000000000001,80.19000000000001,79.31,69.56,51.2,49.449999999999996,44.52,36.29,32.459999999999994,14.0,10.73,10.51,8.98,8.79,8.25,8.0,5.970000000000001,5.76,5.2700000000000005,5.12,4.65,3.46],\"yaxis\":\"y\",\"type\":\"scatter\"},{\"hovertemplate\":\"tool=metabat2\\u003cbr\\u003eindex=%{x}\\u003cbr\\u003ecompleteness - 2*contamination=%{y}\\u003cextra\\u003e\\u003c\\u002fextra\\u003e\",\"legendgroup\":\"metabat2\",\"line\":{\"color\":\"#FFA15A\",\"dash\":\"solid\"},\"marker\":{\"symbol\":\"circle\"},\"mode\":\"lines+markers\",\"name\":\"metabat2\",\"orientation\":\"v\",\"showlegend\":true,\"x\":[0,1,2,3,4,5,6,7,8,9,10,11,12,13],\"xaxis\":\"x\",\"y\":[99.46,92.83,91.66,81.35000000000001,80.34,76.74000000000001,76.38,59.550000000000004,50.34,43.27,43.220000000000006,25.41,20.08999999999999,7.0],\"yaxis\":\"y\",\"type\":\"scatter\"}],                        {\"template\":{\"data\":{\"histogram2dcontour\":[{\"type\":\"histogram2dcontour\",\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"},\"colorscale\":[[0.0,\"#0d0887\"],[0.1111111111111111,\"#46039f\"],[0.2222222222222222,\"#7201a8\"],[0.3333333333333333,\"#9c179e\"],[0.4444444444444444,\"#bd3786\"],[0.5555555555555556,\"#d8576b\"],[0.6666666666666666,\"#ed7953\"],[0.7777777777777778,\"#fb9f3a\"],[0.8888888888888888,\"#fdca26\"],[1.0,\"#f0f921\"]]}],\"choropleth\":[{\"type\":\"choropleth\",\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}],\"histogram2d\":[{\"type\":\"histogram2d\",\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"},\"colorscale\":[[0.0,\"#0d0887\"],[0.1111111111111111,\"#46039f\"],[0.2222222222222222,\"#7201a8\"],[0.3333333333333333,\"#9c179e\"],[0.4444444444444444,\"#bd3786\"],[0.5555555555555556,\"#d8576b\"],[0.6666666666666666,\"#ed7953\"],[0.7777777777777778,\"#fb9f3a\"],[0.8888888888888888,\"#fdca26\"],[1.0,\"#f0f921\"]]}],\"heatmap\":[{\"type\":\"heatmap\",\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"},\"colorscale\":[[0.0,\"#0d0887\"],[0.1111111111111111,\"#46039f\"],[0.2222222222222222,\"#7201a8\"],[0.3333333333333333,\"#9c179e\"],[0.4444444444444444,\"#bd3786\"],[0.5555555555555556,\"#d8576b\"],[0.6666666666666666,\"#ed7953\"],[0.7777777777777778,\"#fb9f3a\"],[0.8888888888888888,\"#fdca26\"],[1.0,\"#f0f921\"]]}],\"heatmapgl\":[{\"type\":\"heatmapgl\",\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"},\"colorscale\":[[0.0,\"#0d0887\"],[0.1111111111111111,\"#46039f\"],[0.2222222222222222,\"#7201a8\"],[0.3333333333333333,\"#9c179e\"],[0.4444444444444444,\"#bd3786\"],[0.5555555555555556,\"#d8576b\"],[0.6666666666666666,\"#ed7953\"],[0.7777777777777778,\"#fb9f3a\"],[0.8888888888888888,\"#fdca26\"],[1.0,\"#f0f921\"]]}],\"contourcarpet\":[{\"type\":\"contourcarpet\",\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}],\"contour\":[{\"type\":\"contour\",\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"},\"colorscale\":[[0.0,\"#0d0887\"],[0.1111111111111111,\"#46039f\"],[0.2222222222222222,\"#7201a8\"],[0.3333333333333333,\"#9c179e\"],[0.4444444444444444,\"#bd3786\"],[0.5555555555555556,\"#d8576b\"],[0.6666666666666666,\"#ed7953\"],[0.7777777777777778,\"#fb9f3a\"],[0.8888888888888888,\"#fdca26\"],[1.0,\"#f0f921\"]]}],\"surface\":[{\"type\":\"surface\",\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"},\"colorscale\":[[0.0,\"#0d0887\"],[0.1111111111111111,\"#46039f\"],[0.2222222222222222,\"#7201a8\"],[0.3333333333333333,\"#9c179e\"],[0.4444444444444444,\"#bd3786\"],[0.5555555555555556,\"#d8576b\"],[0.6666666666666666,\"#ed7953\"],[0.7777777777777778,\"#fb9f3a\"],[0.8888888888888888,\"#fdca26\"],[1.0,\"#f0f921\"]]}],\"mesh3d\":[{\"type\":\"mesh3d\",\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}],\"scatter\":[{\"fillpattern\":{\"fillmode\":\"overlay\",\"size\":10,\"solidity\":0.2},\"type\":\"scatter\"}],\"parcoords\":[{\"type\":\"parcoords\",\"line\":{\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}}],\"scatterpolargl\":[{\"type\":\"scatterpolargl\",\"marker\":{\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}}],\"bar\":[{\"error_x\":{\"color\":\"#2a3f5f\"},\"error_y\":{\"color\":\"#2a3f5f\"},\"marker\":{\"line\":{\"color\":\"#E5ECF6\",\"width\":0.5},\"pattern\":{\"fillmode\":\"overlay\",\"size\":10,\"solidity\":0.2}},\"type\":\"bar\"}],\"scattergeo\":[{\"type\":\"scattergeo\",\"marker\":{\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}}],\"scatterpolar\":[{\"type\":\"scatterpolar\",\"marker\":{\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}}],\"histogram\":[{\"marker\":{\"pattern\":{\"fillmode\":\"overlay\",\"size\":10,\"solidity\":0.2}},\"type\":\"histogram\"}],\"scattergl\":[{\"type\":\"scattergl\",\"marker\":{\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}}],\"scatter3d\":[{\"type\":\"scatter3d\",\"line\":{\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}},\"marker\":{\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}}],\"scattermapbox\":[{\"type\":\"scattermapbox\",\"marker\":{\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}}],\"scatterternary\":[{\"type\":\"scatterternary\",\"marker\":{\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}}],\"scattercarpet\":[{\"type\":\"scattercarpet\",\"marker\":{\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}}],\"carpet\":[{\"aaxis\":{\"endlinecolor\":\"#2a3f5f\",\"gridcolor\":\"white\",\"linecolor\":\"white\",\"minorgridcolor\":\"white\",\"startlinecolor\":\"#2a3f5f\"},\"baxis\":{\"endlinecolor\":\"#2a3f5f\",\"gridcolor\":\"white\",\"linecolor\":\"white\",\"minorgridcolor\":\"white\",\"startlinecolor\":\"#2a3f5f\"},\"type\":\"carpet\"}],\"table\":[{\"cells\":{\"fill\":{\"color\":\"#EBF0F8\"},\"line\":{\"color\":\"white\"}},\"header\":{\"fill\":{\"color\":\"#C8D4E3\"},\"line\":{\"color\":\"white\"}},\"type\":\"table\"}],\"barpolar\":[{\"marker\":{\"line\":{\"color\":\"#E5ECF6\",\"width\":0.5},\"pattern\":{\"fillmode\":\"overlay\",\"size\":10,\"solidity\":0.2}},\"type\":\"barpolar\"}],\"pie\":[{\"automargin\":true,\"type\":\"pie\"}]},\"layout\":{\"autotypenumbers\":\"strict\",\"colorway\":[\"#636efa\",\"#EF553B\",\"#00cc96\",\"#ab63fa\",\"#FFA15A\",\"#19d3f3\",\"#FF6692\",\"#B6E880\",\"#FF97FF\",\"#FECB52\"],\"font\":{\"color\":\"#2a3f5f\"},\"hovermode\":\"closest\",\"hoverlabel\":{\"align\":\"left\"},\"paper_bgcolor\":\"white\",\"plot_bgcolor\":\"#E5ECF6\",\"polar\":{\"bgcolor\":\"#E5ECF6\",\"angularaxis\":{\"gridcolor\":\"white\",\"linecolor\":\"white\",\"ticks\":\"\"},\"radialaxis\":{\"gridcolor\":\"white\",\"linecolor\":\"white\",\"ticks\":\"\"}},\"ternary\":{\"bgcolor\":\"#E5ECF6\",\"aaxis\":{\"gridcolor\":\"white\",\"linecolor\":\"white\",\"ticks\":\"\"},\"baxis\":{\"gridcolor\":\"white\",\"linecolor\":\"white\",\"ticks\":\"\"},\"caxis\":{\"gridcolor\":\"white\",\"linecolor\":\"white\",\"ticks\":\"\"}},\"coloraxis\":{\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}},\"colorscale\":{\"sequential\":[[0.0,\"#0d0887\"],[0.1111111111111111,\"#46039f\"],[0.2222222222222222,\"#7201a8\"],[0.3333333333333333,\"#9c179e\"],[0.4444444444444444,\"#bd3786\"],[0.5555555555555556,\"#d8576b\"],[0.6666666666666666,\"#ed7953\"],[0.7777777777777778,\"#fb9f3a\"],[0.8888888888888888,\"#fdca26\"],[1.0,\"#f0f921\"]],\"sequentialminus\":[[0.0,\"#0d0887\"],[0.1111111111111111,\"#46039f\"],[0.2222222222222222,\"#7201a8\"],[0.3333333333333333,\"#9c179e\"],[0.4444444444444444,\"#bd3786\"],[0.5555555555555556,\"#d8576b\"],[0.6666666666666666,\"#ed7953\"],[0.7777777777777778,\"#fb9f3a\"],[0.8888888888888888,\"#fdca26\"],[1.0,\"#f0f921\"]],\"diverging\":[[0,\"#8e0152\"],[0.1,\"#c51b7d\"],[0.2,\"#de77ae\"],[0.3,\"#f1b6da\"],[0.4,\"#fde0ef\"],[0.5,\"#f7f7f7\"],[0.6,\"#e6f5d0\"],[0.7,\"#b8e186\"],[0.8,\"#7fbc41\"],[0.9,\"#4d9221\"],[1,\"#276419\"]]},\"xaxis\":{\"gridcolor\":\"white\",\"linecolor\":\"white\",\"ticks\":\"\",\"title\":{\"standoff\":15},\"zerolinecolor\":\"white\",\"automargin\":true,\"zerolinewidth\":2},\"yaxis\":{\"gridcolor\":\"white\",\"linecolor\":\"white\",\"ticks\":\"\",\"title\":{\"standoff\":15},\"zerolinecolor\":\"white\",\"automargin\":true,\"zerolinewidth\":2},\"scene\":{\"xaxis\":{\"backgroundcolor\":\"#E5ECF6\",\"gridcolor\":\"white\",\"linecolor\":\"white\",\"showbackground\":true,\"ticks\":\"\",\"zerolinecolor\":\"white\",\"gridwidth\":2},\"yaxis\":{\"backgroundcolor\":\"#E5ECF6\",\"gridcolor\":\"white\",\"linecolor\":\"white\",\"showbackground\":true,\"ticks\":\"\",\"zerolinecolor\":\"white\",\"gridwidth\":2},\"zaxis\":{\"backgroundcolor\":\"#E5ECF6\",\"gridcolor\":\"white\",\"linecolor\":\"white\",\"showbackground\":true,\"ticks\":\"\",\"zerolinecolor\":\"white\",\"gridwidth\":2}},\"shapedefaults\":{\"line\":{\"color\":\"#2a3f5f\"}},\"annotationdefaults\":{\"arrowcolor\":\"#2a3f5f\",\"arrowhead\":0,\"arrowwidth\":1},\"geo\":{\"bgcolor\":\"white\",\"landcolor\":\"#E5ECF6\",\"subunitcolor\":\"white\",\"showland\":true,\"showlakes\":true,\"lakecolor\":\"white\"},\"title\":{\"x\":0.05},\"mapbox\":{\"style\":\"light\"}}},\"xaxis\":{\"anchor\":\"y\",\"domain\":[0.0,1.0],\"title\":{\"text\":\"index\"}},\"yaxis\":{\"anchor\":\"x\",\"domain\":[0.0,1.0],\"title\":{\"text\":\"completeness - 2*contamination\"}},\"legend\":{\"title\":{\"text\":\"tool\"},\"tracegroupgap\":0},\"margin\":{\"t\":60},\"width\":600,\"height\":500},                        {\"responsive\": true}                    )                };                            </script>        </div>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    }
   ],
   "source": [
    "# Calculate the score for each bin\n",
    "df_bins['completeness - 2*contamination'] = df_bins['completeness'] - 2 * df_bins['contamination']\n",
    "\n",
    "# Plot the score against the bin index\n",
    "fig = px.line(df_bins, x=\"index\", y='completeness - 2*contamination', color=\"tool\", markers=True)\n",
    "fig.update_layout(width=600, height=500)\n",
    "fig.show()"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "97aee4d0",
   "metadata": {},
   "source": [
    "From the plot, you might notice that Concoct has a lot of bins with lower quality scores. Let’s zoom in to get a better look:"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 6,
   "id": "063974f6",
   "metadata": {
    "execution": {
     "iopub.execute_input": "2025-10-14T08:42:23.483831Z",
     "iopub.status.busy": "2025-10-14T08:42:23.483667Z",
     "iopub.status.idle": "2025-10-14T08:42:23.490478Z",
     "shell.execute_reply": "2025-10-14T08:42:23.490114Z"
    }
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>            <script src=\"https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/MathJax.js?config=TeX-AMS-MML_SVG\"></script><script type=\"text/javascript\">if (window.MathJax && window.MathJax.Hub && window.MathJax.Hub.Config) {window.MathJax.Hub.Config({SVG: {font: \"STIX-Web\"}});}</script>                <script type=\"text/javascript\">window.PlotlyConfig = {MathJaxConfig: 'local'};</script>\n",
       "        <script charset=\"utf-8\" src=\"https://cdn.plot.ly/plotly-2.35.2.min.js\"></script>                <div id=\"2d9c1084-3bb0-45f3-a7f3-b3d9cd776275\" class=\"plotly-graph-div\" style=\"height:500px; width:600px;\"></div>            <script type=\"text/javascript\">                                    window.PLOTLYENV=window.PLOTLYENV || {};                                    if (document.getElementById(\"2d9c1084-3bb0-45f3-a7f3-b3d9cd776275\")) {                    Plotly.newPlot(                        \"2d9c1084-3bb0-45f3-a7f3-b3d9cd776275\",                        [{\"hovertemplate\":\"tool=binette\\u003cbr\\u003eindex=%{x}\\u003cbr\\u003ecompleteness - 2*contamination=%{y}\\u003cextra\\u003e\\u003c\\u002fextra\\u003e\",\"legendgroup\":\"binette\",\"line\":{\"color\":\"#636efa\",\"dash\":\"solid\"},\"marker\":{\"symbol\":\"circle\"},\"mode\":\"lines+markers\",\"name\":\"binette\",\"orientation\":\"v\",\"showlegend\":true,\"x\":[0,1,2,3,4,5,6,7,8,9,10,11,12,13,14],\"xaxis\":\"x\",\"y\":[99.8,99.48,95.55999999999999,93.19000000000001,90.43,87.08,80.74000000000001,69.97999999999999,66.44,60.91,56.57,49.86,39.86,33.44,24.89],\"yaxis\":\"y\",\"type\":\"scatter\"},{\"hovertemplate\":\"tool=maxbin2\\u003cbr\\u003eindex=%{x}\\u003cbr\\u003ecompleteness - 2*contamination=%{y}\\u003cextra\\u003e\\u003c\\u002fextra\\u003e\",\"legendgroup\":\"maxbin2\",\"line\":{\"color\":\"#EF553B\",\"dash\":\"solid\"},\"marker\":{\"symbol\":\"circle\"},\"mode\":\"lines+markers\",\"name\":\"maxbin2\",\"orientation\":\"v\",\"showlegend\":true,\"x\":[0,1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18],\"xaxis\":\"x\",\"y\":[94.2,84.66,56.42,43.39,41.5,37.05,36.080000000000005,33.6,26.979999999999997,25.700000000000003,24.939999999999998,23.58,23.46,21.88,18.54,12.540000000000001,9.809999999999999,4.689999999999991,-2.0900000000000034],\"yaxis\":\"y\",\"type\":\"scatter\"},{\"hovertemplate\":\"tool=concoct\\u003cbr\\u003eindex=%{x}\\u003cbr\\u003ecompleteness - 2*contamination=%{y}\\u003cextra\\u003e\\u003c\\u002fextra\\u003e\",\"legendgroup\":\"concoct\",\"line\":{\"color\":\"#00cc96\",\"dash\":\"solid\"},\"marker\":{\"symbol\":\"circle\"},\"mode\":\"lines+markers\",\"name\":\"concoct\",\"orientation\":\"v\",\"showlegend\":true,\"x\":[0,1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64],\"xaxis\":\"x\",\"y\":[99.26,99.0,92.18,84.37,83.17999999999999,75.02,65.10000000000001,35.690000000000005,31.45,27.349999999999994,25.029999999999998,24.82,24.720000000000002,23.4,22.85,21.36,19.63,17.909999999999997,16.72,16.6,15.05,12.89,9.879999999999999,9.37,8.149999999999999,6.98,6.6,6.42,6.4,6.38,6.38,6.38,6.38,6.38,6.38,6.38,6.38,6.38,6.38,6.38,6.38,6.38,6.38,6.38,6.38,6.38,6.38,6.38,6.36,6.05,5.66,5.63,5.47,5.09,5.03,4.31,3.86,3.74,3.69,3.66,2.73,2.62,2.53,2.48,2.4],\"yaxis\":\"y\",\"type\":\"scatter\"},{\"hovertemplate\":\"tool=semibin2\\u003cbr\\u003eindex=%{x}\\u003cbr\\u003ecompleteness - 2*contamination=%{y}\\u003cextra\\u003e\\u003c\\u002fextra\\u003e\",\"legendgroup\":\"semibin2\",\"line\":{\"color\":\"#ab63fa\",\"dash\":\"solid\"},\"marker\":{\"symbol\":\"circle\"},\"mode\":\"lines+markers\",\"name\":\"semibin2\",\"orientation\":\"v\",\"showlegend\":true,\"x\":[0,1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25],\"xaxis\":\"x\",\"y\":[99.78,99.39999999999999,92.58,91.62,80.74000000000001,80.19000000000001,79.31,69.56,51.2,49.449999999999996,44.52,36.29,32.459999999999994,14.0,10.73,10.51,8.98,8.79,8.25,8.0,5.970000000000001,5.76,5.2700000000000005,5.12,4.65,3.46],\"yaxis\":\"y\",\"type\":\"scatter\"},{\"hovertemplate\":\"tool=metabat2\\u003cbr\\u003eindex=%{x}\\u003cbr\\u003ecompleteness - 2*contamination=%{y}\\u003cextra\\u003e\\u003c\\u002fextra\\u003e\",\"legendgroup\":\"metabat2\",\"line\":{\"color\":\"#FFA15A\",\"dash\":\"solid\"},\"marker\":{\"symbol\":\"circle\"},\"mode\":\"lines+markers\",\"name\":\"metabat2\",\"orientation\":\"v\",\"showlegend\":true,\"x\":[0,1,2,3,4,5,6,7,8,9,10,11,12,13],\"xaxis\":\"x\",\"y\":[99.46,92.83,91.66,81.35000000000001,80.34,76.74000000000001,76.38,59.550000000000004,50.34,43.27,43.220000000000006,25.41,20.08999999999999,7.0],\"yaxis\":\"y\",\"type\":\"scatter\"}],                        {\"template\":{\"data\":{\"histogram2dcontour\":[{\"type\":\"histogram2dcontour\",\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"},\"colorscale\":[[0.0,\"#0d0887\"],[0.1111111111111111,\"#46039f\"],[0.2222222222222222,\"#7201a8\"],[0.3333333333333333,\"#9c179e\"],[0.4444444444444444,\"#bd3786\"],[0.5555555555555556,\"#d8576b\"],[0.6666666666666666,\"#ed7953\"],[0.7777777777777778,\"#fb9f3a\"],[0.8888888888888888,\"#fdca26\"],[1.0,\"#f0f921\"]]}],\"choropleth\":[{\"type\":\"choropleth\",\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}],\"histogram2d\":[{\"type\":\"histogram2d\",\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"},\"colorscale\":[[0.0,\"#0d0887\"],[0.1111111111111111,\"#46039f\"],[0.2222222222222222,\"#7201a8\"],[0.3333333333333333,\"#9c179e\"],[0.4444444444444444,\"#bd3786\"],[0.5555555555555556,\"#d8576b\"],[0.6666666666666666,\"#ed7953\"],[0.7777777777777778,\"#fb9f3a\"],[0.8888888888888888,\"#fdca26\"],[1.0,\"#f0f921\"]]}],\"heatmap\":[{\"type\":\"heatmap\",\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"},\"colorscale\":[[0.0,\"#0d0887\"],[0.1111111111111111,\"#46039f\"],[0.2222222222222222,\"#7201a8\"],[0.3333333333333333,\"#9c179e\"],[0.4444444444444444,\"#bd3786\"],[0.5555555555555556,\"#d8576b\"],[0.6666666666666666,\"#ed7953\"],[0.7777777777777778,\"#fb9f3a\"],[0.8888888888888888,\"#fdca26\"],[1.0,\"#f0f921\"]]}],\"heatmapgl\":[{\"type\":\"heatmapgl\",\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"},\"colorscale\":[[0.0,\"#0d0887\"],[0.1111111111111111,\"#46039f\"],[0.2222222222222222,\"#7201a8\"],[0.3333333333333333,\"#9c179e\"],[0.4444444444444444,\"#bd3786\"],[0.5555555555555556,\"#d8576b\"],[0.6666666666666666,\"#ed7953\"],[0.7777777777777778,\"#fb9f3a\"],[0.8888888888888888,\"#fdca26\"],[1.0,\"#f0f921\"]]}],\"contourcarpet\":[{\"type\":\"contourcarpet\",\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}],\"contour\":[{\"type\":\"contour\",\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"},\"colorscale\":[[0.0,\"#0d0887\"],[0.1111111111111111,\"#46039f\"],[0.2222222222222222,\"#7201a8\"],[0.3333333333333333,\"#9c179e\"],[0.4444444444444444,\"#bd3786\"],[0.5555555555555556,\"#d8576b\"],[0.6666666666666666,\"#ed7953\"],[0.7777777777777778,\"#fb9f3a\"],[0.8888888888888888,\"#fdca26\"],[1.0,\"#f0f921\"]]}],\"surface\":[{\"type\":\"surface\",\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"},\"colorscale\":[[0.0,\"#0d0887\"],[0.1111111111111111,\"#46039f\"],[0.2222222222222222,\"#7201a8\"],[0.3333333333333333,\"#9c179e\"],[0.4444444444444444,\"#bd3786\"],[0.5555555555555556,\"#d8576b\"],[0.6666666666666666,\"#ed7953\"],[0.7777777777777778,\"#fb9f3a\"],[0.8888888888888888,\"#fdca26\"],[1.0,\"#f0f921\"]]}],\"mesh3d\":[{\"type\":\"mesh3d\",\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}],\"scatter\":[{\"fillpattern\":{\"fillmode\":\"overlay\",\"size\":10,\"solidity\":0.2},\"type\":\"scatter\"}],\"parcoords\":[{\"type\":\"parcoords\",\"line\":{\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}}],\"scatterpolargl\":[{\"type\":\"scatterpolargl\",\"marker\":{\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}}],\"bar\":[{\"error_x\":{\"color\":\"#2a3f5f\"},\"error_y\":{\"color\":\"#2a3f5f\"},\"marker\":{\"line\":{\"color\":\"#E5ECF6\",\"width\":0.5},\"pattern\":{\"fillmode\":\"overlay\",\"size\":10,\"solidity\":0.2}},\"type\":\"bar\"}],\"scattergeo\":[{\"type\":\"scattergeo\",\"marker\":{\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}}],\"scatterpolar\":[{\"type\":\"scatterpolar\",\"marker\":{\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}}],\"histogram\":[{\"marker\":{\"pattern\":{\"fillmode\":\"overlay\",\"size\":10,\"solidity\":0.2}},\"type\":\"histogram\"}],\"scattergl\":[{\"type\":\"scattergl\",\"marker\":{\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}}],\"scatter3d\":[{\"type\":\"scatter3d\",\"line\":{\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}},\"marker\":{\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}}],\"scattermapbox\":[{\"type\":\"scattermapbox\",\"marker\":{\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}}],\"scatterternary\":[{\"type\":\"scatterternary\",\"marker\":{\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}}],\"scattercarpet\":[{\"type\":\"scattercarpet\",\"marker\":{\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}}],\"carpet\":[{\"aaxis\":{\"endlinecolor\":\"#2a3f5f\",\"gridcolor\":\"white\",\"linecolor\":\"white\",\"minorgridcolor\":\"white\",\"startlinecolor\":\"#2a3f5f\"},\"baxis\":{\"endlinecolor\":\"#2a3f5f\",\"gridcolor\":\"white\",\"linecolor\":\"white\",\"minorgridcolor\":\"white\",\"startlinecolor\":\"#2a3f5f\"},\"type\":\"carpet\"}],\"table\":[{\"cells\":{\"fill\":{\"color\":\"#EBF0F8\"},\"line\":{\"color\":\"white\"}},\"header\":{\"fill\":{\"color\":\"#C8D4E3\"},\"line\":{\"color\":\"white\"}},\"type\":\"table\"}],\"barpolar\":[{\"marker\":{\"line\":{\"color\":\"#E5ECF6\",\"width\":0.5},\"pattern\":{\"fillmode\":\"overlay\",\"size\":10,\"solidity\":0.2}},\"type\":\"barpolar\"}],\"pie\":[{\"automargin\":true,\"type\":\"pie\"}]},\"layout\":{\"autotypenumbers\":\"strict\",\"colorway\":[\"#636efa\",\"#EF553B\",\"#00cc96\",\"#ab63fa\",\"#FFA15A\",\"#19d3f3\",\"#FF6692\",\"#B6E880\",\"#FF97FF\",\"#FECB52\"],\"font\":{\"color\":\"#2a3f5f\"},\"hovermode\":\"closest\",\"hoverlabel\":{\"align\":\"left\"},\"paper_bgcolor\":\"white\",\"plot_bgcolor\":\"#E5ECF6\",\"polar\":{\"bgcolor\":\"#E5ECF6\",\"angularaxis\":{\"gridcolor\":\"white\",\"linecolor\":\"white\",\"ticks\":\"\"},\"radialaxis\":{\"gridcolor\":\"white\",\"linecolor\":\"white\",\"ticks\":\"\"}},\"ternary\":{\"bgcolor\":\"#E5ECF6\",\"aaxis\":{\"gridcolor\":\"white\",\"linecolor\":\"white\",\"ticks\":\"\"},\"baxis\":{\"gridcolor\":\"white\",\"linecolor\":\"white\",\"ticks\":\"\"},\"caxis\":{\"gridcolor\":\"white\",\"linecolor\":\"white\",\"ticks\":\"\"}},\"coloraxis\":{\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}},\"colorscale\":{\"sequential\":[[0.0,\"#0d0887\"],[0.1111111111111111,\"#46039f\"],[0.2222222222222222,\"#7201a8\"],[0.3333333333333333,\"#9c179e\"],[0.4444444444444444,\"#bd3786\"],[0.5555555555555556,\"#d8576b\"],[0.6666666666666666,\"#ed7953\"],[0.7777777777777778,\"#fb9f3a\"],[0.8888888888888888,\"#fdca26\"],[1.0,\"#f0f921\"]],\"sequentialminus\":[[0.0,\"#0d0887\"],[0.1111111111111111,\"#46039f\"],[0.2222222222222222,\"#7201a8\"],[0.3333333333333333,\"#9c179e\"],[0.4444444444444444,\"#bd3786\"],[0.5555555555555556,\"#d8576b\"],[0.6666666666666666,\"#ed7953\"],[0.7777777777777778,\"#fb9f3a\"],[0.8888888888888888,\"#fdca26\"],[1.0,\"#f0f921\"]],\"diverging\":[[0,\"#8e0152\"],[0.1,\"#c51b7d\"],[0.2,\"#de77ae\"],[0.3,\"#f1b6da\"],[0.4,\"#fde0ef\"],[0.5,\"#f7f7f7\"],[0.6,\"#e6f5d0\"],[0.7,\"#b8e186\"],[0.8,\"#7fbc41\"],[0.9,\"#4d9221\"],[1,\"#276419\"]]},\"xaxis\":{\"gridcolor\":\"white\",\"linecolor\":\"white\",\"ticks\":\"\",\"title\":{\"standoff\":15},\"zerolinecolor\":\"white\",\"automargin\":true,\"zerolinewidth\":2},\"yaxis\":{\"gridcolor\":\"white\",\"linecolor\":\"white\",\"ticks\":\"\",\"title\":{\"standoff\":15},\"zerolinecolor\":\"white\",\"automargin\":true,\"zerolinewidth\":2},\"scene\":{\"xaxis\":{\"backgroundcolor\":\"#E5ECF6\",\"gridcolor\":\"white\",\"linecolor\":\"white\",\"showbackground\":true,\"ticks\":\"\",\"zerolinecolor\":\"white\",\"gridwidth\":2},\"yaxis\":{\"backgroundcolor\":\"#E5ECF6\",\"gridcolor\":\"white\",\"linecolor\":\"white\",\"showbackground\":true,\"ticks\":\"\",\"zerolinecolor\":\"white\",\"gridwidth\":2},\"zaxis\":{\"backgroundcolor\":\"#E5ECF6\",\"gridcolor\":\"white\",\"linecolor\":\"white\",\"showbackground\":true,\"ticks\":\"\",\"zerolinecolor\":\"white\",\"gridwidth\":2}},\"shapedefaults\":{\"line\":{\"color\":\"#2a3f5f\"}},\"annotationdefaults\":{\"arrowcolor\":\"#2a3f5f\",\"arrowhead\":0,\"arrowwidth\":1},\"geo\":{\"bgcolor\":\"white\",\"landcolor\":\"#E5ECF6\",\"subunitcolor\":\"white\",\"showland\":true,\"showlakes\":true,\"lakecolor\":\"white\"},\"title\":{\"x\":0.05},\"mapbox\":{\"style\":\"light\"}}},\"xaxis\":{\"anchor\":\"y\",\"domain\":[0.0,1.0],\"title\":{\"text\":\"index\"},\"range\":[-1,20]},\"yaxis\":{\"anchor\":\"x\",\"domain\":[0.0,1.0],\"title\":{\"text\":\"completeness - 2*contamination\"},\"range\":[0,100]},\"legend\":{\"title\":{\"text\":\"tool\"},\"tracegroupgap\":0},\"margin\":{\"t\":60},\"width\":600,\"height\":500},                        {\"responsive\": true}                    )                };                            </script>        </div>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    }
   ],
   "source": [
    "# Adjust the plot view to zoom in\n",
    "fig.update_layout(\n",
    "    xaxis_range=[-1, 20],  # Zoom on x-axis\n",
    "    yaxis_range=[0, 100],  # Zoom on y-axis\n",
    "    width=600,\n",
    "    height=500\n",
    ")\n",
    "fig.show()"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "136b17e4",
   "metadata": {},
   "source": [
    "Binette line consistently appears above the other binning tools. This indicates that Binette produce higher-quality bins compared to the initial bin sets."
   ]
  },
  {
   "cell_type": "markdown",
   "id": "46f1b3d0",
   "metadata": {},
   "source": [
    "### Plot Number of High-Quality Bins per Bin Set\n",
    "\n",
    "Let's plot the number of bins falling into different quality categories. We’ll focus on bins with a maximum of 10% contamination and classify them into three completeness categories:\n",
    "\n",
    "- **`> 50% and ≤ 70%`**\n",
    "- **`> 70% and ≤ 90%`**\n",
    "- **`> 90%`**\n",
    "\n",
    "First, let’s group and count the bins in each category:"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 7,
   "id": "943f88b4",
   "metadata": {
    "execution": {
     "iopub.execute_input": "2025-10-14T08:42:23.492146Z",
     "iopub.status.busy": "2025-10-14T08:42:23.492003Z",
     "iopub.status.idle": "2025-10-14T08:42:23.506310Z",
     "shell.execute_reply": "2025-10-14T08:42:23.505928Z"
    }
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Contamination ≤ 10 and&lt;br&gt;Completeness</th>\n",
       "      <th>tool</th>\n",
       "      <th>bin_count</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>&gt; 50% and ≤ 70%</td>\n",
       "      <td>binette</td>\n",
       "      <td>3</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>&gt; 50% and ≤ 70%</td>\n",
       "      <td>maxbin2</td>\n",
       "      <td>2</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>&gt; 50% and ≤ 70%</td>\n",
       "      <td>metabat2</td>\n",
       "      <td>1</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>&gt; 50% and ≤ 70%</td>\n",
       "      <td>semibin2</td>\n",
       "      <td>2</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>&gt; 70% and ≤ 90%</td>\n",
       "      <td>binette</td>\n",
       "      <td>3</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>5</th>\n",
       "      <td>&gt; 70% and ≤ 90%</td>\n",
       "      <td>concoct</td>\n",
       "      <td>2</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>6</th>\n",
       "      <td>&gt; 70% and ≤ 90%</td>\n",
       "      <td>metabat2</td>\n",
       "      <td>5</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>7</th>\n",
       "      <td>&gt; 70% and ≤ 90%</td>\n",
       "      <td>semibin2</td>\n",
       "      <td>4</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>8</th>\n",
       "      <td>&gt; 90%</td>\n",
       "      <td>binette</td>\n",
       "      <td>6</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>9</th>\n",
       "      <td>&gt; 90%</td>\n",
       "      <td>concoct</td>\n",
       "      <td>4</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>10</th>\n",
       "      <td>&gt; 90%</td>\n",
       "      <td>maxbin2</td>\n",
       "      <td>2</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>11</th>\n",
       "      <td>&gt; 90%</td>\n",
       "      <td>metabat2</td>\n",
       "      <td>3</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>12</th>\n",
       "      <td>&gt; 90%</td>\n",
       "      <td>semibin2</td>\n",
       "      <td>4</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "   Contamination ≤ 10 and<br>Completeness      tool  bin_count\n",
       "0                         > 50% and ≤ 70%   binette          3\n",
       "1                         > 50% and ≤ 70%   maxbin2          2\n",
       "2                         > 50% and ≤ 70%  metabat2          1\n",
       "3                         > 50% and ≤ 70%  semibin2          2\n",
       "4                         > 70% and ≤ 90%   binette          3\n",
       "5                         > 70% and ≤ 90%   concoct          2\n",
       "6                         > 70% and ≤ 90%  metabat2          5\n",
       "7                         > 70% and ≤ 90%  semibin2          4\n",
       "8                                   > 90%   binette          6\n",
       "9                                   > 90%   concoct          4\n",
       "10                                  > 90%   maxbin2          2\n",
       "11                                  > 90%  metabat2          3\n",
       "12                                  > 90%  semibin2          4"
      ]
     },
     "execution_count": 7,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "# Define the contamination cutoff\n",
    "contamination_cutoff = 10\n",
    "\n",
    "# Create filters for completeness categories\n",
    "low_contamination_filt = df_bins['contamination'] <= contamination_cutoff\n",
    "high_completeness_filt = df_bins['completeness'] > 90\n",
    "medium_completeness_filt = df_bins['completeness'] > 70\n",
    "low_completeness_filt = df_bins['completeness'] > 50\n",
    "\n",
    "# Define quality categories\n",
    "quality  = f'Contamination ≤ {contamination_cutoff} and<br>Completeness'\n",
    "df_bins.loc[low_contamination_filt & low_completeness_filt, quality] =  '> 50% and ≤ 70%'\n",
    "df_bins.loc[low_contamination_filt & medium_completeness_filt, quality] =  '> 70% and ≤ 90%'\n",
    "df_bins.loc[low_contamination_filt & high_completeness_filt, quality] = '> 90%'\n",
    "\n",
    "# Group and count bins by quality category and tool\n",
    "df_bins_quality_grouped = df_bins.groupby([quality, 'tool']).agg(bin_count=('index', 'count')).reset_index()\n",
    "df_bins_quality_grouped"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "6eec391a",
   "metadata": {},
   "source": [
    "Now, let’s create a bar plot to visualize the number of bins in each quality category for each bin sets:"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 8,
   "id": "36ce51ac",
   "metadata": {
    "execution": {
     "iopub.execute_input": "2025-10-14T08:42:23.507763Z",
     "iopub.status.busy": "2025-10-14T08:42:23.507583Z",
     "iopub.status.idle": "2025-10-14T08:42:23.620352Z",
     "shell.execute_reply": "2025-10-14T08:42:23.619829Z"
    }
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>            <script src=\"https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/MathJax.js?config=TeX-AMS-MML_SVG\"></script><script type=\"text/javascript\">if (window.MathJax && window.MathJax.Hub && window.MathJax.Hub.Config) {window.MathJax.Hub.Config({SVG: {font: \"STIX-Web\"}});}</script>                <script type=\"text/javascript\">window.PlotlyConfig = {MathJaxConfig: 'local'};</script>\n",
       "        <script charset=\"utf-8\" src=\"https://cdn.plot.ly/plotly-2.35.2.min.js\"></script>                <div id=\"9d58458f-df52-431b-b1de-3b03815ab83b\" class=\"plotly-graph-div\" style=\"height:500px; width:600px;\"></div>            <script type=\"text/javascript\">                                    window.PLOTLYENV=window.PLOTLYENV || {};                                    if (document.getElementById(\"9d58458f-df52-431b-b1de-3b03815ab83b\")) {                    Plotly.newPlot(                        \"9d58458f-df52-431b-b1de-3b03815ab83b\",                        [{\"alignmentgroup\":\"True\",\"hovertemplate\":\"Contamination \\u2264 10 and\\u003cbr\\u003eCompleteness=\\u003e 50% and \\u2264 70%\\u003cbr\\u003etool=%{x}\\u003cbr\\u003ebin_count=%{text}\\u003cextra\\u003e\\u003c\\u002fextra\\u003e\",\"legendgroup\":\"\\u003e 50% and \\u2264 70%\",\"marker\":{\"color\":\"rgb(225, 124, 5)\",\"opacity\":0.9,\"pattern\":{\"shape\":\"\"}},\"name\":\"\\u003e 50% and \\u2264 70%\",\"offsetgroup\":\"\\u003e 50% and \\u2264 70%\",\"orientation\":\"v\",\"showlegend\":true,\"text\":[3.0,2.0,1.0,2.0],\"textposition\":\"auto\",\"x\":[\"binette\",\"maxbin2\",\"metabat2\",\"semibin2\"],\"xaxis\":\"x\",\"y\":[3,2,1,2],\"yaxis\":\"y\",\"type\":\"bar\"},{\"alignmentgroup\":\"True\",\"hovertemplate\":\"Contamination \\u2264 10 and\\u003cbr\\u003eCompleteness=\\u003e 70% and \\u2264 90%\\u003cbr\\u003etool=%{x}\\u003cbr\\u003ebin_count=%{text}\\u003cextra\\u003e\\u003c\\u002fextra\\u003e\",\"legendgroup\":\"\\u003e 70% and \\u2264 90%\",\"marker\":{\"color\":\"rgb(56, 166, 165)\",\"opacity\":0.9,\"pattern\":{\"shape\":\"\"}},\"name\":\"\\u003e 70% and \\u2264 90%\",\"offsetgroup\":\"\\u003e 70% and \\u2264 90%\",\"orientation\":\"v\",\"showlegend\":true,\"text\":[3.0,2.0,5.0,4.0],\"textposition\":\"auto\",\"x\":[\"binette\",\"concoct\",\"metabat2\",\"semibin2\"],\"xaxis\":\"x\",\"y\":[3,2,5,4],\"yaxis\":\"y\",\"type\":\"bar\"},{\"alignmentgroup\":\"True\",\"hovertemplate\":\"Contamination \\u2264 10 and\\u003cbr\\u003eCompleteness=\\u003e 90%\\u003cbr\\u003etool=%{x}\\u003cbr\\u003ebin_count=%{text}\\u003cextra\\u003e\\u003c\\u002fextra\\u003e\",\"legendgroup\":\"\\u003e 90%\",\"marker\":{\"color\":\"rgb(115, 175, 72)\",\"opacity\":0.9,\"pattern\":{\"shape\":\"\"}},\"name\":\"\\u003e 90%\",\"offsetgroup\":\"\\u003e 90%\",\"orientation\":\"v\",\"showlegend\":true,\"text\":[6.0,4.0,2.0,3.0,4.0],\"textposition\":\"auto\",\"x\":[\"binette\",\"concoct\",\"maxbin2\",\"metabat2\",\"semibin2\"],\"xaxis\":\"x\",\"y\":[6,4,2,3,4],\"yaxis\":\"y\",\"type\":\"bar\"}],                        {\"template\":{\"data\":{\"histogram2dcontour\":[{\"type\":\"histogram2dcontour\",\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"},\"colorscale\":[[0.0,\"#0d0887\"],[0.1111111111111111,\"#46039f\"],[0.2222222222222222,\"#7201a8\"],[0.3333333333333333,\"#9c179e\"],[0.4444444444444444,\"#bd3786\"],[0.5555555555555556,\"#d8576b\"],[0.6666666666666666,\"#ed7953\"],[0.7777777777777778,\"#fb9f3a\"],[0.8888888888888888,\"#fdca26\"],[1.0,\"#f0f921\"]]}],\"choropleth\":[{\"type\":\"choropleth\",\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}],\"histogram2d\":[{\"type\":\"histogram2d\",\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"},\"colorscale\":[[0.0,\"#0d0887\"],[0.1111111111111111,\"#46039f\"],[0.2222222222222222,\"#7201a8\"],[0.3333333333333333,\"#9c179e\"],[0.4444444444444444,\"#bd3786\"],[0.5555555555555556,\"#d8576b\"],[0.6666666666666666,\"#ed7953\"],[0.7777777777777778,\"#fb9f3a\"],[0.8888888888888888,\"#fdca26\"],[1.0,\"#f0f921\"]]}],\"heatmap\":[{\"type\":\"heatmap\",\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"},\"colorscale\":[[0.0,\"#0d0887\"],[0.1111111111111111,\"#46039f\"],[0.2222222222222222,\"#7201a8\"],[0.3333333333333333,\"#9c179e\"],[0.4444444444444444,\"#bd3786\"],[0.5555555555555556,\"#d8576b\"],[0.6666666666666666,\"#ed7953\"],[0.7777777777777778,\"#fb9f3a\"],[0.8888888888888888,\"#fdca26\"],[1.0,\"#f0f921\"]]}],\"heatmapgl\":[{\"type\":\"heatmapgl\",\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"},\"colorscale\":[[0.0,\"#0d0887\"],[0.1111111111111111,\"#46039f\"],[0.2222222222222222,\"#7201a8\"],[0.3333333333333333,\"#9c179e\"],[0.4444444444444444,\"#bd3786\"],[0.5555555555555556,\"#d8576b\"],[0.6666666666666666,\"#ed7953\"],[0.7777777777777778,\"#fb9f3a\"],[0.8888888888888888,\"#fdca26\"],[1.0,\"#f0f921\"]]}],\"contourcarpet\":[{\"type\":\"contourcarpet\",\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}],\"contour\":[{\"type\":\"contour\",\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"},\"colorscale\":[[0.0,\"#0d0887\"],[0.1111111111111111,\"#46039f\"],[0.2222222222222222,\"#7201a8\"],[0.3333333333333333,\"#9c179e\"],[0.4444444444444444,\"#bd3786\"],[0.5555555555555556,\"#d8576b\"],[0.6666666666666666,\"#ed7953\"],[0.7777777777777778,\"#fb9f3a\"],[0.8888888888888888,\"#fdca26\"],[1.0,\"#f0f921\"]]}],\"surface\":[{\"type\":\"surface\",\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"},\"colorscale\":[[0.0,\"#0d0887\"],[0.1111111111111111,\"#46039f\"],[0.2222222222222222,\"#7201a8\"],[0.3333333333333333,\"#9c179e\"],[0.4444444444444444,\"#bd3786\"],[0.5555555555555556,\"#d8576b\"],[0.6666666666666666,\"#ed7953\"],[0.7777777777777778,\"#fb9f3a\"],[0.8888888888888888,\"#fdca26\"],[1.0,\"#f0f921\"]]}],\"mesh3d\":[{\"type\":\"mesh3d\",\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}],\"scatter\":[{\"fillpattern\":{\"fillmode\":\"overlay\",\"size\":10,\"solidity\":0.2},\"type\":\"scatter\"}],\"parcoords\":[{\"type\":\"parcoords\",\"line\":{\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}}],\"scatterpolargl\":[{\"type\":\"scatterpolargl\",\"marker\":{\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}}],\"bar\":[{\"error_x\":{\"color\":\"#2a3f5f\"},\"error_y\":{\"color\":\"#2a3f5f\"},\"marker\":{\"line\":{\"color\":\"#E5ECF6\",\"width\":0.5},\"pattern\":{\"fillmode\":\"overlay\",\"size\":10,\"solidity\":0.2}},\"type\":\"bar\"}],\"scattergeo\":[{\"type\":\"scattergeo\",\"marker\":{\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}}],\"scatterpolar\":[{\"type\":\"scatterpolar\",\"marker\":{\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}}],\"histogram\":[{\"marker\":{\"pattern\":{\"fillmode\":\"overlay\",\"size\":10,\"solidity\":0.2}},\"type\":\"histogram\"}],\"scattergl\":[{\"type\":\"scattergl\",\"marker\":{\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}}],\"scatter3d\":[{\"type\":\"scatter3d\",\"line\":{\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}},\"marker\":{\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}}],\"scattermapbox\":[{\"type\":\"scattermapbox\",\"marker\":{\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}}],\"scatterternary\":[{\"type\":\"scatterternary\",\"marker\":{\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}}],\"scattercarpet\":[{\"type\":\"scattercarpet\",\"marker\":{\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}}}],\"carpet\":[{\"aaxis\":{\"endlinecolor\":\"#2a3f5f\",\"gridcolor\":\"white\",\"linecolor\":\"white\",\"minorgridcolor\":\"white\",\"startlinecolor\":\"#2a3f5f\"},\"baxis\":{\"endlinecolor\":\"#2a3f5f\",\"gridcolor\":\"white\",\"linecolor\":\"white\",\"minorgridcolor\":\"white\",\"startlinecolor\":\"#2a3f5f\"},\"type\":\"carpet\"}],\"table\":[{\"cells\":{\"fill\":{\"color\":\"#EBF0F8\"},\"line\":{\"color\":\"white\"}},\"header\":{\"fill\":{\"color\":\"#C8D4E3\"},\"line\":{\"color\":\"white\"}},\"type\":\"table\"}],\"barpolar\":[{\"marker\":{\"line\":{\"color\":\"#E5ECF6\",\"width\":0.5},\"pattern\":{\"fillmode\":\"overlay\",\"size\":10,\"solidity\":0.2}},\"type\":\"barpolar\"}],\"pie\":[{\"automargin\":true,\"type\":\"pie\"}]},\"layout\":{\"autotypenumbers\":\"strict\",\"colorway\":[\"#636efa\",\"#EF553B\",\"#00cc96\",\"#ab63fa\",\"#FFA15A\",\"#19d3f3\",\"#FF6692\",\"#B6E880\",\"#FF97FF\",\"#FECB52\"],\"font\":{\"color\":\"#2a3f5f\"},\"hovermode\":\"closest\",\"hoverlabel\":{\"align\":\"left\"},\"paper_bgcolor\":\"white\",\"plot_bgcolor\":\"#E5ECF6\",\"polar\":{\"bgcolor\":\"#E5ECF6\",\"angularaxis\":{\"gridcolor\":\"white\",\"linecolor\":\"white\",\"ticks\":\"\"},\"radialaxis\":{\"gridcolor\":\"white\",\"linecolor\":\"white\",\"ticks\":\"\"}},\"ternary\":{\"bgcolor\":\"#E5ECF6\",\"aaxis\":{\"gridcolor\":\"white\",\"linecolor\":\"white\",\"ticks\":\"\"},\"baxis\":{\"gridcolor\":\"white\",\"linecolor\":\"white\",\"ticks\":\"\"},\"caxis\":{\"gridcolor\":\"white\",\"linecolor\":\"white\",\"ticks\":\"\"}},\"coloraxis\":{\"colorbar\":{\"outlinewidth\":0,\"ticks\":\"\"}},\"colorscale\":{\"sequential\":[[0.0,\"#0d0887\"],[0.1111111111111111,\"#46039f\"],[0.2222222222222222,\"#7201a8\"],[0.3333333333333333,\"#9c179e\"],[0.4444444444444444,\"#bd3786\"],[0.5555555555555556,\"#d8576b\"],[0.6666666666666666,\"#ed7953\"],[0.7777777777777778,\"#fb9f3a\"],[0.8888888888888888,\"#fdca26\"],[1.0,\"#f0f921\"]],\"sequentialminus\":[[0.0,\"#0d0887\"],[0.1111111111111111,\"#46039f\"],[0.2222222222222222,\"#7201a8\"],[0.3333333333333333,\"#9c179e\"],[0.4444444444444444,\"#bd3786\"],[0.5555555555555556,\"#d8576b\"],[0.6666666666666666,\"#ed7953\"],[0.7777777777777778,\"#fb9f3a\"],[0.8888888888888888,\"#fdca26\"],[1.0,\"#f0f921\"]],\"diverging\":[[0,\"#8e0152\"],[0.1,\"#c51b7d\"],[0.2,\"#de77ae\"],[0.3,\"#f1b6da\"],[0.4,\"#fde0ef\"],[0.5,\"#f7f7f7\"],[0.6,\"#e6f5d0\"],[0.7,\"#b8e186\"],[0.8,\"#7fbc41\"],[0.9,\"#4d9221\"],[1,\"#276419\"]]},\"xaxis\":{\"gridcolor\":\"white\",\"linecolor\":\"white\",\"ticks\":\"\",\"title\":{\"standoff\":15},\"zerolinecolor\":\"white\",\"automargin\":true,\"zerolinewidth\":2},\"yaxis\":{\"gridcolor\":\"white\",\"linecolor\":\"white\",\"ticks\":\"\",\"title\":{\"standoff\":15},\"zerolinecolor\":\"white\",\"automargin\":true,\"zerolinewidth\":2},\"scene\":{\"xaxis\":{\"backgroundcolor\":\"#E5ECF6\",\"gridcolor\":\"white\",\"linecolor\":\"white\",\"showbackground\":true,\"ticks\":\"\",\"zerolinecolor\":\"white\",\"gridwidth\":2},\"yaxis\":{\"backgroundcolor\":\"#E5ECF6\",\"gridcolor\":\"white\",\"linecolor\":\"white\",\"showbackground\":true,\"ticks\":\"\",\"zerolinecolor\":\"white\",\"gridwidth\":2},\"zaxis\":{\"backgroundcolor\":\"#E5ECF6\",\"gridcolor\":\"white\",\"linecolor\":\"white\",\"showbackground\":true,\"ticks\":\"\",\"zerolinecolor\":\"white\",\"gridwidth\":2}},\"shapedefaults\":{\"line\":{\"color\":\"#2a3f5f\"}},\"annotationdefaults\":{\"arrowcolor\":\"#2a3f5f\",\"arrowhead\":0,\"arrowwidth\":1},\"geo\":{\"bgcolor\":\"white\",\"landcolor\":\"#E5ECF6\",\"subunitcolor\":\"white\",\"showland\":true,\"showlakes\":true,\"lakecolor\":\"white\"},\"title\":{\"x\":0.05},\"mapbox\":{\"style\":\"light\"}}},\"xaxis\":{\"anchor\":\"y\",\"domain\":[0.0,1.0],\"title\":{\"text\":\"tool\"},\"categoryorder\":\"array\",\"categoryarray\":[\"binette\",\"semibin2\",\"concoct\",\"metabat2\",\"maxbin2\"]},\"yaxis\":{\"anchor\":\"x\",\"domain\":[0.0,1.0],\"title\":{\"text\":\"bin_count\"}},\"legend\":{\"title\":{\"text\":\"Contamination \\u2264 10 and\\u003cbr\\u003eCompleteness\"},\"tracegroupgap\":0,\"traceorder\":\"reversed\"},\"margin\":{\"t\":60},\"barmode\":\"stack\",\"width\":600,\"height\":500},                        {\"responsive\": true}                    )                };                            </script>        </div>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    }
   ],
   "source": [
    "# Define colors for each completeness category\n",
    "color_discrete_map = {\n",
    "    \"> 90%\": px.colors.qualitative.Prism[4],\n",
    "    \"> 70% and ≤ 90%\": px.colors.qualitative.Prism[2],\n",
    "    \"> 50% and ≤ 70%\": px.colors.qualitative.Prism[6]\n",
    "}\n",
    "\n",
    "# Create the bar plot\n",
    "fig = px.bar(\n",
    "    df_bins_quality_grouped, \n",
    "    x='tool', \n",
    "    y=\"bin_count\", \n",
    "    color=quality,\n",
    "    barmode='stack', \n",
    "    color_discrete_map=color_discrete_map, \n",
    "    text=\"bin_count\",\n",
    "    category_orders={\"tool\": [\"binette\", \"semibin2\", \"concoct\", \"metabat2\", \"maxbin2\"]},\n",
    "    opacity=0.9\n",
    ")\n",
    "\n",
    "# Update layout for better appearance\n",
    "fig.update_layout(\n",
    "    width=600,\n",
    "    height=500,\n",
    "    legend=dict(traceorder=\"reversed\")\n",
    ")\n",
    "\n",
    "fig.show()"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "f78d0f29",
   "metadata": {},
   "source": [
    "From the plot, you can see that Binette produces more high-quality bins compared to the initial bin sets! 🎉"
   ]
  }
 ],
 "metadata": {
  "kernelspec": {
   "display_name": "Python 3 (ipykernel)",
   "language": "python",
   "name": "python3"
  },
  "language_info": {
   "codemirror_mode": {
    "name": "ipython",
    "version": 3
   },
   "file_extension": ".py",
   "mimetype": "text/x-python",
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
   "version": "3.12.11"
  }
 },
 "nbformat": 4,
 "nbformat_minor": 5
}