Multi-Armed bandit problems

See below for a few resources.

  • R1: List of useful multi-armed bandit resources

  • R2: Tutorial - Reinforcement multi-armed banding (python)

  • R3: Tutorial - Solving multi-armed bandit problems

  • R4: Building a recommender system using ML

  • R5: Nature article

  • R6: GitHub Code

  • R7: Introduction to Deep Q-Learning

Explore the applicability of multi-armed bandit problems for recommender systems.

  1. Could it be used a strategy to define which antimicrobials suggestions to be shown?

The data considered would be susceptibility test data. Once we know the resistances of the microorganisms to the available antimicrobials we have to display which antimicrobials would be recommended. Without any patient related information, this could be seen as “advertising” narrower spectrum antimicrobials while exploring how these are being “clicked”.

  1. Recommender Systems

We have our CBR system which measures similarities between patients. For example, by using an unsupervised (or self-supervised) approach with a neural network we can compute the similarity metrics (e.g. demographics, clinical, biomarkers, radiology, …). Anyways, based on these we have to retrieve from the database cases which are similar and show them to the clinicians. By using clinicians feedback we could see which cases were being clicked and adapt the ones that are being retrieved?

Shouldn’t we adapt the model?

Total running time of the script: ( 0 minutes 0.000 seconds)

Gallery generated by Sphinx-Gallery