Works matching DE "MULTI-armed bandit problem (Probability theory)"
Results: 69
Sliding-Window Thompson Sampling for Non-Stationary Settings.
- Published in:
- Journal of Artificial Intelligence Research, 2020, v. 68, p. 311, doi. 10.1613/jair.1.11407
- By:
- Publication type:
- Article
False discovery rate control with e‐values.
- Published in:
- Journal of the Royal Statistical Society: Series B (Statistical Methodology), 2022, v. 84, n. 3, p. 822, doi. 10.1111/rssb.12489
- By:
- Publication type:
- Article
Optimizing Infill Drilling Decisions Using Multi-Armed Bandits: Application in a Long-Term, Multi-Element Stockpile.
- Published in:
- Mathematical Geosciences, 2018, v. 50, n. 1, p. 35, doi. 10.1007/s11004-017-9695-9
- By:
- Publication type:
- Article
Infomax Strategies for an Optimal Balance Between Exploration and Exploitation.
- Published in:
- Journal of Statistical Physics, 2016, v. 163, n. 6, p. 1454, doi. 10.1007/s10955-016-1521-0
- By:
- Publication type:
- Article
Design-Based Estimators for Average Treatment Effects for Multi-Armed RCTs.
- Published in:
- Journal of Educational & Behavioral Statistics, 2018, v. 43, n. 5, p. 568, doi. 10.3102/1076998618786968
- By:
- Publication type:
- Article
Effects of Ventral Striatum Lesions on Stimulus-Based versus Action-Based Reinforcement Learning.
- Published in:
- Journal of Neuroscience, 2017, v. 37, n. 29, p. 6902, doi. 10.1523/JNEUROSCI.0631-17.2017
- By:
- Publication type:
- Article
On the Hardness of Learning from Censored and Nonstationary Demand.
- Published in:
- INFORMS Journal on Optimization, 2024, v. 6, n. 2, p. 63, doi. 10.1287/ijoo.2022.0017
- By:
- Publication type:
- Article
Editorial to the special issue: modern streaming data analytics.
- Published in:
- Journal of Applied Statistics, 2023, v. 50, n. 14, p. 2857, doi. 10.1080/02664763.2023.2247646
- By:
- Publication type:
- Article
Dynamic channel selection in wireless communications via a multi-armed bandit algorithm using laser chaos time series.
- Published in:
- Scientific Reports, 2020, v. 10, n. 1, p. 1, doi. 10.1038/s41598-020-58541-2
- By:
- Publication type:
- Article
Reinforcement learning in queues.
- Published in:
- Queueing Systems, 2022, v. 100, n. 3/4, p. 497, doi. 10.1007/s11134-022-09844-w
- By:
- Publication type:
- Article
PAC-Bayesian lifelong learning for multi-armed bandits.
- Published in:
- Data Mining & Knowledge Discovery, 2022, v. 36, n. 2, p. 841, doi. 10.1007/s10618-022-00825-4
- By:
- Publication type:
- Article
Contextual bandits with hidden contexts: a focused data capture from social media streams.
- Published in:
- Data Mining & Knowledge Discovery, 2019, v. 33, n. 6, p. 1853, doi. 10.1007/s10618-019-00648-w
- By:
- Publication type:
- Article
Reward Maximization Under Uncertainty: Leveraging Side-Observations on Networks.
- Published in:
- Journal of Machine Learning Research, 2018, v. 18, n. 154-234, p. 1
- By:
- Publication type:
- Article
TSEC: A Framework for Online Experimentation under Experimental Constraints.
- Published in:
- Technometrics, 2022, v. 64, n. 4, p. 513, doi. 10.1080/00401706.2022.2125443
- By:
- Publication type:
- Article
Overcoming Free-Riding in Bandit Games.
- Published in:
- Review of Economic Studies, 2022, v. 89, n. 4, p. 1948, doi. 10.1093/restud/rdab078
- By:
- Publication type:
- Article
Percentile optimization in multi-armed bandit problems.
- Published in:
- Annals of Operations Research, 2024, v. 340, n. 2/3, p. 837, doi. 10.1007/s10479-024-06165-4
- By:
- Publication type:
- Article
Four proofs of Gittins' multiarmed bandit theorem.
- Published in:
- Annals of Operations Research, 2016, v. 241, n. 1/2, p. 127, doi. 10.1007/s10479-013-1523-0
- By:
- Publication type:
- Article
An asymptotically optimal strategy for constrained multi-armed bandit problems.
- Published in:
- Mathematical Methods of Operations Research, 2020, v. 91, n. 3, p. 545, doi. 10.1007/s00186-019-00697-3
- By:
- Publication type:
- Article
ON THE ASYMPTOTIC OPTIMALITY OF GREEDY INDEX HEURISTICS FOR MULTI-ACTION RESTLESS BANDITS.
- Published in:
- Advances in Applied Probability, 2015, v. 47, n. 3, p. 652, doi. 10.1239/aap/1444308876
- By:
- Publication type:
- Article
Editorial.
- Published in:
- Naval Research Logistics, 2023, v. 70, n. 5, p. 395, doi. 10.1002/nav.22138
- By:
- Publication type:
- Article
Decision maker based on atomic switches.
- Published in:
- AIMS Materials Science, 2015, v. 3, n. 1, p. 245, doi. 10.3934/matersci.2016.1.245
- By:
- Publication type:
- Article
Decision making for large-scale multi-armed bandit problems using bias control of chaotic temporal waveforms in semiconductor lasers.
- Published in:
- Scientific Reports, 2022, v. 12, n. 1, p. 1, doi. 10.1038/s41598-022-12155-y
- By:
- Publication type:
- Article
Conflict-free collective stochastic decision making by orbital angular momentum of photons through quantum interference.
- Published in:
- Scientific Reports, 2021, v. 11, n. 1, p. 1, doi. 10.1038/s41598-021-00493-2
- By:
- Publication type:
- Article
A Cost-Sensitive Decision Tree Learning Algorithm Based on a Multi-Armed Bandit Framework.
- Published in:
- Computer Journal, 2017, v. 60, n. 7, p. 941, doi. 10.1093/comjnl/bxw015
- By:
- Publication type:
- Article
A Flexible Mechanism of Rule Selection Enables Rapid Feature-Based Reinforcement Learning.
- Published in:
- Frontiers in Neuroscience, 2016, p. 1, doi. 10.3389/fnins.2016.00125
- By:
- Publication type:
- Article
Non Stationary Multi-Armed Bandit: Empirical Evaluation of a New Concept Drift-Aware Algorithm.
- Published in:
- Entropy, 2021, v. 23, n. 3, p. 380, doi. 10.3390/e23030380
- By:
- Publication type:
- Article
On Gap-Based Lower Bounding Techniques for Best-Arm Identification.
- Published in:
- Entropy, 2020, v. 22, n. 7, p. 788, doi. 10.3390/e22070788
- By:
- Publication type:
- Article
An Analysis of the Value of Information when Exploring Stochastic, Discrete Multi-Armed Bandits: Supplementary Materials 2.
- Published in:
- Entropy, 2018, v. 20, n. 3, p. 155, doi. 10.3390/e20030155
- By:
- Publication type:
- Article
An Analysis of the Value of Information when Exploring Stochastic, Discrete Multi-Armed Bandits: Supplementary Materials 1.
- Published in:
- Entropy, 2018, v. 20, n. 3, p. 155, doi. 10.3390/e20030155
- By:
- Publication type:
- Article
An Analysis of the Value of Information When Exploring Stochastic, Discrete Multi-Armed Bandits.
- Published in:
- Entropy, 2018, v. 20, n. 3, p. 155, doi. 10.3390/e20030155
- By:
- Publication type:
- Article
Reinforcement Learning in Economics and Finance.
- Published in:
- Computational Economics, 2023, v. 62, n. 1, p. 425, doi. 10.1007/s10614-021-10119-4
- By:
- Publication type:
- Article
Learning to school in the presence of hydrodynamic interactions.
- Published in:
- Journal of Fluid Mechanics, 2016, v. 789, p. 726, doi. 10.1017/jfm.2015.686
- By:
- Publication type:
- Article
Multitasking, Multiarmed Bandits, and the Italian Judiciary.
- Published in:
- M&SOM: Manufacturing & Service Operations Management, 2016, v. 18, n. 4, p. 545, doi. 10.1287/msom.2016.0586
- By:
- Publication type:
- Article
Asymptotically optimal algorithms for budgeted multiple play bandits.
- Published in:
- Machine Learning, 2019, v. 108, n. 11, p. 1919, doi. 10.1007/s10994-019-05799-x
- By:
- Publication type:
- Article
Online Causal Inference for Advertising in Real-Time Bidding Auctions.
- Published in:
- Marketing Science, 2025, v. 44, n. 1, p. 176, doi. 10.1287/mksc.2022.0406
- By:
- Publication type:
- Article
Customer Acquisition via Display Advertising Using Multi-Armed Bandit Experiments.
- Published in:
- Marketing Science, 2017, v. 36, n. 4, p. 500, doi. 10.1287/mksc.2016.1023
- By:
- Publication type:
- Article
Offline Planning and Online Learning Under Recovering Rewards.
- Published in:
- Management Science, 2025, v. 71, n. 1, p. 298, doi. 10.1287/mnsc.2021.04202
- By:
- Publication type:
- Article
Maximal Objectives in the Multiarmed Bandit with Applications.
- Published in:
- Management Science, 2024, v. 70, n. 12, p. 8853, doi. 10.1287/mnsc.2022.00801
- By:
- Publication type:
- Article
Weak Signal Asymptotics for Sequentially Randomized Experiments.
- Published in:
- Management Science, 2024, v. 70, n. 10, p. 7024, doi. 10.1287/mnsc.2023.4964
- By:
- Publication type:
- Article
Phase Transitions in Bandits with Switching Constraints.
- Published in:
- Management Science, 2023, v. 69, n. 12, p. 7182, doi. 10.1287/mnsc.2023.4755
- By:
- Publication type:
- Article
A Model of Search with Two Stages of Information Acquisition and Additive Learning.
- Published in:
- Management Science, 2022, v. 68, n. 2, p. 1212, doi. 10.1287/mnsc.2021.4150
- By:
- Publication type:
- Article
Robust Multiarmed Bandit Problems.
- Published in:
- Management Science, 2016, v. 62, n. 1, p. 264, doi. 10.1287/mnsc.2015.2153
- By:
- Publication type:
- Article
EXPLORATION–EXPLOITATION POLICIES WITH ALMOST SURE, ARBITRARILY SLOW GROWING ASYMPTOTIC REGRET.
- Published in:
- Probability in the Engineering & Informational Sciences, 2020, v. 34, n. 3, p. 406, doi. 10.1017/S0269964818000529
- By:
- Publication type:
- Article
ON THE IDENTIFICATION AND MITIGATION OF WEAKNESSES IN THE KNOWLEDGE GRADIENT POLICY FOR MULTI-ARMED BANDITS.
- Published in:
- Probability in the Engineering & Informational Sciences, 2017, v. 31, n. 2, p. 239, doi. 10.1017/S0269964816000279
- By:
- Publication type:
- Article
Efficient Wireless Sensor Network for Radiation Detection in Nuclear Sites.
- Published in:
- International Journal of Electronics & Telecommunications, 2021, v. 67, n. 2, p. 175, doi. 10.24425/ijet.2021.135961
- By:
- Publication type:
- Article
Millimeter Wave Beamforming Training: A Reinforcement Learning Approach.
- Published in:
- International Journal of Electronics & Telecommunications, 2021, v. 67, n. 1, p. 95, doi. 10.24425/ijet.2021.135949
- By:
- Publication type:
- Article
Editorial.
- Published in:
- 2024
- By:
- Publication type:
- Editorial
Technical Note—Online Matching with Bayesian Rewards.
- Published in:
- Operations Research, 2025, v. 73, n. 1, p. 278, doi. 10.1287/opre.2021.0499
- By:
- Publication type:
- Article
Smoothness-Adaptive Contextual Bandits.
- Published in:
- Operations Research, 2022, v. 70, n. 6, p. 3198, doi. 10.1287/opre.2021.2215
- By:
- Publication type:
- Article
Optimistic Gittins Indices.
- Published in:
- Operations Research, 2022, v. 70, n. 6, p. 3432, doi. 10.1287/opre.2021.2207
- By:
- Publication type:
- Article