menu
techminis

A naukri.com initiative

google-web-stories
Home

>

ML News

>

Graph-Depe...
source image

Arxiv

1d

read

56

img
dot

Image Credit: Arxiv

Graph-Dependent Regret Bounds in Multi-Armed Bandits with Interference

  • Study examines multi-armed bandits with network interference affecting rewards based on local graph structure.
  • Proposed algorithm leverages graph characteristics to reduce regret in exponentially large action spaces.
  • A graph-dependent upper bound on cumulative regret is achieved, surpassing previous research.
  • Lower bounds for bandits with diverse network interference types are established using graph properties.
  • Algorithm's optimality is demonstrated for dense and sparse graphs with near-optimal performance.
  • In cases of unknown interference graph, algorithm variant is Pareto optimal, leading in all scenarios.
  • Theoretical findings are supported by numerical experiments, illustrating superior performance over standard methods.

Read Full Article

like

3 Likes

For uninterrupted reading, download the app