Works matching DE "MULTI-armed bandit problem (Probability theory)"
Results: 70
Asymptotically optimal algorithms for budgeted multiple play bandits.
- Published in:
- Machine Learning, 2019, v. 108, n. 11, p. 1919, doi. 10.1007/s10994-019-05799-x
- By:
- Publication type:
- Article
QoS-Based Blind Spectrum Selection with Multi-armed Bandit Problem in Cognitive Radio Networks.
- Published in:
- Wireless Personal Communications, 2016, v. 89, n. 2, p. 663, doi. 10.1007/s11277-016-3301-1
- By:
- Publication type:
- Article
Explore First, Exploit Next: The True Shape of Regret in Bandit Problems.
- Published in:
- Mathematics of Operations Research, 2019, v. 44, n. 2, p. 377, doi. 10.1287/moor.2017.0928
- By:
- Publication type:
- Article
Design-Based Estimators for Average Treatment Effects for Multi-Armed RCTs.
- Published in:
- Journal of Educational & Behavioral Statistics, 2018, v. 43, n. 5, p. 568, doi. 10.3102/1076998618786968
- By:
- Publication type:
- Article
Editorial.
- Published in:
- 2024
- By:
- Publication type:
- Editorial
Sliding-Window Thompson Sampling for Non-Stationary Settings.
- Published in:
- Journal of Artificial Intelligence Research, 2020, v. 68, p. 311, doi. 10.1613/jair.1.11407
- By:
- Publication type:
- Article
Rate-Optimal Bayesian Simple Regret in Best Arm Identification.
- Published in:
- Mathematics of Operations Research, 2024, v. 49, n. 3, p. 1629, doi. 10.1287/moor.2022.0011
- By:
- Publication type:
- Article
Multiplayer Bandits Without Observing Collision Information.
- Published in:
- Mathematics of Operations Research, 2022, v. 47, n. 2, p. 1247, doi. 10.1287/moor.2021.1168
- By:
- Publication type:
- Article
Online Causal Inference for Advertising in Real-Time Bidding Auctions.
- Published in:
- Marketing Science, 2025, v. 44, n. 1, p. 176, doi. 10.1287/mksc.2022.0406
- By:
- Publication type:
- Article
Customer Acquisition via Display Advertising Using Multi-Armed Bandit Experiments.
- Published in:
- Marketing Science, 2017, v. 36, n. 4, p. 500, doi. 10.1287/mksc.2016.1023
- By:
- Publication type:
- Article
Distributed decision making policy for frequency band selection boosting RF energy harvesting rate in wireless sensor nodes.
- Published in:
- Wireless Networks (10220038), 2018, v. 24, n. 8, p. 3189, doi. 10.1007/s11276-017-1529-7
- By:
- Publication type:
- Article
Effects of Ventral Striatum Lesions on Stimulus-Based versus Action-Based Reinforcement Learning.
- Published in:
- Journal of Neuroscience, 2017, v. 37, n. 29, p. 6902, doi. 10.1523/JNEUROSCI.0631-17.2017
- By:
- Publication type:
- Article
ON THE ASYMPTOTIC OPTIMALITY OF GREEDY INDEX HEURISTICS FOR MULTI-ACTION RESTLESS BANDITS.
- Published in:
- Advances in Applied Probability, 2015, v. 47, n. 3, p. 652, doi. 10.1239/aap/1444308876
- By:
- Publication type:
- Article
Reinforcement learning in queues.
- Published in:
- Queueing Systems, 2022, v. 100, n. 3/4, p. 497, doi. 10.1007/s11134-022-09844-w
- By:
- Publication type:
- Article
Fast Demand Response Based on Model Predictive Control for a Net Zero Energy Building in Cold Climate.
- Published in:
- ASHRAE Transactions, 2024, v. 130, n. Part 2, p. 87
- By:
- Publication type:
- Article
Robust learning in expert networks: a comparative analysis.
- Published in:
- Journal of Intelligent Information Systems, 2018, v. 51, n. 2, p. 207, doi. 10.1007/s10844-018-0515-6
- By:
- Publication type:
- Article
Reinforcement Learning in Economics and Finance.
- Published in:
- Computational Economics, 2023, v. 62, n. 1, p. 425, doi. 10.1007/s10614-021-10119-4
- By:
- Publication type:
- Article
EXPLORATION–EXPLOITATION POLICIES WITH ALMOST SURE, ARBITRARILY SLOW GROWING ASYMPTOTIC REGRET.
- Published in:
- Probability in the Engineering & Informational Sciences, 2020, v. 34, n. 3, p. 406, doi. 10.1017/S0269964818000529
- By:
- Publication type:
- Article
ON THE IDENTIFICATION AND MITIGATION OF WEAKNESSES IN THE KNOWLEDGE GRADIENT POLICY FOR MULTI-ARMED BANDITS.
- Published in:
- Probability in the Engineering & Informational Sciences, 2017, v. 31, n. 2, p. 239, doi. 10.1017/S0269964816000279
- By:
- Publication type:
- Article
Decision making for large-scale multi-armed bandit problems using bias control of chaotic temporal waveforms in semiconductor lasers.
- Published in:
- Scientific Reports, 2022, v. 12, n. 1, p. 1, doi. 10.1038/s41598-022-12155-y
- By:
- Publication type:
- Article
An asymptotically optimal strategy for constrained multi-armed bandit problems.
- Published in:
- Mathematical Methods of Operations Research, 2020, v. 91, n. 3, p. 545, doi. 10.1007/s00186-019-00697-3
- By:
- Publication type:
- Article
Infomax Strategies for an Optimal Balance Between Exploration and Exploitation.
- Published in:
- Journal of Statistical Physics, 2016, v. 163, n. 6, p. 1454, doi. 10.1007/s10955-016-1521-0
- By:
- Publication type:
- Article
Non Stationary Multi-Armed Bandit: Empirical Evaluation of a New Concept Drift-Aware Algorithm.
- Published in:
- Entropy, 2021, v. 23, n. 3, p. 380, doi. 10.3390/e23030380
- By:
- Publication type:
- Article
On Gap-Based Lower Bounding Techniques for Best-Arm Identification.
- Published in:
- Entropy, 2020, v. 22, n. 7, p. 788, doi. 10.3390/e22070788
- By:
- Publication type:
- Article
An Analysis of the Value of Information when Exploring Stochastic, Discrete Multi-Armed Bandits: Supplementary Materials 2.
- Published in:
- Entropy, 2018, v. 20, n. 3, p. 155, doi. 10.3390/e20030155
- By:
- Publication type:
- Article
An Analysis of the Value of Information when Exploring Stochastic, Discrete Multi-Armed Bandits: Supplementary Materials 1.
- Published in:
- Entropy, 2018, v. 20, n. 3, p. 155, doi. 10.3390/e20030155
- By:
- Publication type:
- Article
An Analysis of the Value of Information When Exploring Stochastic, Discrete Multi-Armed Bandits.
- Published in:
- Entropy, 2018, v. 20, n. 3, p. 155, doi. 10.3390/e20030155
- By:
- Publication type:
- Article
Technical Note—Online Matching with Bayesian Rewards.
- Published in:
- Operations Research, 2025, v. 73, n. 1, p. 278, doi. 10.1287/opre.2021.0499
- By:
- Publication type:
- Article
Smoothness-Adaptive Contextual Bandits.
- Published in:
- Operations Research, 2022, v. 70, n. 6, p. 3198, doi. 10.1287/opre.2021.2215
- By:
- Publication type:
- Article
Optimistic Gittins Indices.
- Published in:
- Operations Research, 2022, v. 70, n. 6, p. 3432, doi. 10.1287/opre.2021.2207
- By:
- Publication type:
- Article
Dynamic Programs with Shared Resources and Signals: Dynamic Fluid Policies and Asymptotic Optimality.
- Published in:
- Operations Research, 2022, v. 70, n. 5, p. 3015, doi. 10.1287/opre.2021.2181
- By:
- Publication type:
- Article
In This Issue.
- Published in:
- Operations Research, 2022, v. 70, n. 1, p. iii, doi. 10.1287/opre.2021.2252
- Publication type:
- Article
A Restless Bandit Model for Resource Allocation, Competition, and Reservation.
- Published in:
- Operations Research, 2022, v. 70, n. 1, p. 416, doi. 10.1287/opre.2020.2066
- By:
- Publication type:
- Article
Simple Bayesian Algorithms for Best-Arm Identification.
- Published in:
- Operations Research, 2020, v. 68, n. 6, p. 1625, doi. 10.1287/opre.2019.1911
- By:
- Publication type:
- Article
Bandits with Global Convex Constraints and Objective.
- Published in:
- Operations Research, 2019, v. 67, n. 5, p. 1486, doi. 10.1287/opre.2019.1840
- By:
- Publication type:
- Article
Robust Multiarmed Bandit Problems.
- Published in:
- Management Science, 2016, v. 62, n. 1, p. 264, doi. 10.1287/mnsc.2015.2153
- By:
- Publication type:
- Article
Decision maker based on atomic switches.
- Published in:
- AIMS Materials Science, 2015, v. 3, n. 1, p. 245, doi. 10.3934/matersci.2016.1.245
- By:
- Publication type:
- Article
False discovery rate control with e‐values.
- Published in:
- Journal of the Royal Statistical Society: Series B (Statistical Methodology), 2022, v. 84, n. 3, p. 822, doi. 10.1111/rssb.12489
- By:
- Publication type:
- Article
Optimizing Infill Drilling Decisions Using Multi-Armed Bandits: Application in a Long-Term, Multi-Element Stockpile.
- Published in:
- Mathematical Geosciences, 2018, v. 50, n. 1, p. 35, doi. 10.1007/s11004-017-9695-9
- By:
- Publication type:
- Article
An efficient learning framework for multiproduct inventory systems with customer choices.
- Published in:
- Production & Operations Management, 2022, v. 31, n. 6, p. 2492, doi. 10.1111/poms.13693
- By:
- Publication type:
- Article
Self-Assembly of Complex DNA Tessellations by Using Low-Symmetry Multi-arm DNA Tiles.
- Published in:
- Angewandte Chemie International Edition, 2016, v. 55, n. 31, p. 8860, doi. 10.1002/anie.201601944
- By:
- Publication type:
- Article
Customization of J. Bather's UCB Strategy for a Gaussian Multiarmed Bandit.
- Published in:
- Automation & Remote Control, 2022, v. 83, n. 11, p. 1857, doi. 10.1134/S00051179220110108
- By:
- Publication type:
- Article
Dynamic channel selection in wireless communications via a multi-armed bandit algorithm using laser chaos time series.
- Published in:
- Scientific Reports, 2020, v. 10, n. 1, p. 1, doi. 10.1038/s41598-020-58541-2
- By:
- Publication type:
- Article
A Cost-Sensitive Decision Tree Learning Algorithm Based on a Multi-Armed Bandit Framework.
- Published in:
- Computer Journal, 2017, v. 60, n. 7, p. 941, doi. 10.1093/comjnl/bxw015
- By:
- Publication type:
- Article
A Flexible Mechanism of Rule Selection Enables Rapid Feature-Based Reinforcement Learning.
- Published in:
- Frontiers in Neuroscience, 2016, p. 1, doi. 10.3389/fnins.2016.00125
- By:
- Publication type:
- Article
Power considerations for trials of two experimental arms versus a standard active control or placebo.
- Published in:
- 2016
- By:
- Publication type:
- journal article
Interactive Restless Multi-armed Bandit Game and Swarm Intelligence Effect.
- Published in:
- New Generation Computing, 2016, v. 34, n. 3, p. 291, doi. 10.1007/s00354-016-0306-y
- By:
- Publication type:
- Article
PAC-Bayesian lifelong learning for multi-armed bandits.
- Published in:
- Data Mining & Knowledge Discovery, 2022, v. 36, n. 2, p. 841, doi. 10.1007/s10618-022-00825-4
- By:
- Publication type:
- Article
Contextual bandits with hidden contexts: a focused data capture from social media streams.
- Published in:
- Data Mining & Knowledge Discovery, 2019, v. 33, n. 6, p. 1853, doi. 10.1007/s10618-019-00648-w
- By:
- Publication type:
- Article
Editorial.
- Published in:
- Naval Research Logistics, 2023, v. 70, n. 5, p. 395, doi. 10.1002/nav.22138
- By:
- Publication type:
- Article