Works matching DE "MULTI-armed bandit problem (Probability theory)"

Results: 70

Asymptotically optimal algorithms for budgeted multiple play bandits.
Published in:
Machine Learning, 2019, v. 108, n. 11, p. 1919, doi. 10.1007/s10994-019-05799-x
By:
- Luedtke, Alex;
- Kaufmann, Emilie;
- Chambaz, Antoine
Publication type:
Article
QoS-Based Blind Spectrum Selection with Multi-armed Bandit Problem in Cognitive Radio Networks.
Published in:
Wireless Personal Communications, 2016, v. 89, n. 2, p. 663, doi. 10.1007/s11277-016-3301-1
By:
- Chen, Yongqun;
- Zhou, Huaibei;
- Kong, Ruoshan;
- Huang, Junyuan;
- Chen, Bo
Publication type:
Article
Explore First, Exploit Next: The True Shape of Regret in Bandit Problems.
Published in:
Mathematics of Operations Research, 2019, v. 44, n. 2, p. 377, doi. 10.1287/moor.2017.0928
By:
- Garivier, Aurélien;
- Ménard, Pierre;
- Stoltz, Gilles
Publication type:
Article
Design-Based Estimators for Average Treatment Effects for Multi-Armed RCTs.
Published in:
Journal of Educational & Behavioral Statistics, 2018, v. 43, n. 5, p. 568, doi. 10.3102/1076998618786968
By:
- Schochet, Peter Z.
Publication type:
Article
Editorial.
Published in:
2024
By:
- Pardalos, Panos;
- Kalyagin, Valery;
- Guarracino, Mario R.
Publication type:
Editorial
Sliding-Window Thompson Sampling for Non-Stationary Settings.
Published in:
Journal of Artificial Intelligence Research, 2020, v. 68, p. 311, doi. 10.1613/jair.1.11407
By:
- Trovò, Francesco;
- Paladino, Stefano;
- Restelli, Marcello;
- Gatti, Nicola
Publication type:
Article
Rate-Optimal Bayesian Simple Regret in Best Arm Identification.
Published in:
Mathematics of Operations Research, 2024, v. 49, n. 3, p. 1629, doi. 10.1287/moor.2022.0011
By:
- Komiyama, Junpei;
- Ariu, Kaito;
- Kato, Masahiro;
- Qin, Chao
Publication type:
Article
Multiplayer Bandits Without Observing Collision Information.
Published in:
Mathematics of Operations Research, 2022, v. 47, n. 2, p. 1247, doi. 10.1287/moor.2021.1168
By:
- Lugosi, Gábor;
- Mehrabian, Abbas
Publication type:
Article
Online Causal Inference for Advertising in Real-Time Bidding Auctions.
Published in:
Marketing Science, 2025, v. 44, n. 1, p. 176, doi. 10.1287/mksc.2022.0406
By:
- Waisman, Caio;
- Nair, Harikesh S.;
- Carrion, Carlos
Publication type:
Article
Customer Acquisition via Display Advertising Using Multi-Armed Bandit Experiments.
Published in:
Marketing Science, 2017, v. 36, n. 4, p. 500, doi. 10.1287/mksc.2016.1023
By:
- Schwartz, Eric M.;
- Bradlow, Eric T.;
- Fader, Peter S.
Publication type:
Article
Distributed decision making policy for frequency band selection boosting RF energy harvesting rate in wireless sensor nodes.
Published in:
Wireless Networks (10220038), 2018, v. 24, n. 8, p. 3189, doi. 10.1007/s11276-017-1529-7
By:
- Darak, S. J.;
- Moy, Christophe;
- Palicot, Jacques
Publication type:
Article
Effects of Ventral Striatum Lesions on Stimulus-Based versus Action-Based Reinforcement Learning.
Published in:
Journal of Neuroscience, 2017, v. 37, n. 29, p. 6902, doi. 10.1523/JNEUROSCI.0631-17.2017
By:
- Rothenhoefer, Kathryn M.;
- Costa, Vincent D.;
- Bartolo, Ramón;
- Vicario-Feliciano, Raquel;
- Murray, Elisabeth A.;
- Averbeck, Bruno B.
Publication type:
Article
ON THE ASYMPTOTIC OPTIMALITY OF GREEDY INDEX HEURISTICS FOR MULTI-ACTION RESTLESS BANDITS.
Published in:
Advances in Applied Probability, 2015, v. 47, n. 3, p. 652, doi. 10.1239/aap/1444308876
By:
- HODGE, D. J.;
- GLAZEBROOK, K. D.
Publication type:
Article
Reinforcement learning in queues.
Published in:
Queueing Systems, 2022, v. 100, n. 3/4, p. 497, doi. 10.1007/s11134-022-09844-w
By:
- Ayesta, U.
Publication type:
Article
Fast Demand Response Based on Model Predictive Control for a Net Zero Energy Building in Cold Climate.
Published in:
ASHRAE Transactions, 2024, v. 130, n. Part 2, p. 87
By:
- Hiroyuki Ichikawa;
- Takahashi, Ken;
- Ryozo Ooka
Publication type:
Article
Robust learning in expert networks: a comparative analysis.
Published in:
Journal of Intelligent Information Systems, 2018, v. 51, n. 2, p. 207, doi. 10.1007/s10844-018-0515-6
By:
- KhudaBukhsh, Ashiqur R.;
- Carbonell, Jaime G.;
- Jansen, Peter J.
Publication type:
Article
Reinforcement Learning in Economics and Finance.
Published in:
Computational Economics, 2023, v. 62, n. 1, p. 425, doi. 10.1007/s10614-021-10119-4
By:
- Charpentier, Arthur;
- Élie, Romuald;
- Remlinger, Carl
Publication type:
Article
EXPLORATION–EXPLOITATION POLICIES WITH ALMOST SURE, ARBITRARILY SLOW GROWING ASYMPTOTIC REGRET.
Published in:
Probability in the Engineering & Informational Sciences, 2020, v. 34, n. 3, p. 406, doi. 10.1017/S0269964818000529
By:
- Cowan, Wesley;
- Katehakis, Michael N.
Publication type:
Article
ON THE IDENTIFICATION AND MITIGATION OF WEAKNESSES IN THE KNOWLEDGE GRADIENT POLICY FOR MULTI-ARMED BANDITS.
Published in:
Probability in the Engineering & Informational Sciences, 2017, v. 31, n. 2, p. 239, doi. 10.1017/S0269964816000279
By:
- Edwards, James;
- Fearnhead, Paul;
- Glazebrook, Kevin
Publication type:
Article
Decision making for large-scale multi-armed bandit problems using bias control of chaotic temporal waveforms in semiconductor lasers.
Published in:
Scientific Reports, 2022, v. 12, n. 1, p. 1, doi. 10.1038/s41598-022-12155-y
By:
- Morijiri, Kensei;
- Mihana, Takatomo;
- Kanno, Kazutaka;
- Naruse, Makoto;
- Uchida, Atsushi
Publication type:
Article
An asymptotically optimal strategy for constrained multi-armed bandit problems.
Published in:
Mathematical Methods of Operations Research, 2020, v. 91, n. 3, p. 545, doi. 10.1007/s00186-019-00697-3
By:
- Chang, Hyeong Soo
Publication type:
Article
Infomax Strategies for an Optimal Balance Between Exploration and Exploitation.
Published in:
Journal of Statistical Physics, 2016, v. 163, n. 6, p. 1454, doi. 10.1007/s10955-016-1521-0
By:
- Reddy, Gautam;
- Celani, Antonio;
- Vergassola, Massimo
Publication type:
Article
Non Stationary Multi-Armed Bandit: Empirical Evaluation of a New Concept Drift-Aware Algorithm.
Published in:
Entropy, 2021, v. 23, n. 3, p. 380, doi. 10.3390/e23030380
By:
- Cavenaghi, Emanuele;
- Sottocornola, Gabriele;
- Stella, Fabio;
- Zanker, Markus;
- Cattani, Carlo
Publication type:
Article
On Gap-Based Lower Bounding Techniques for Best-Arm Identification.
Published in:
Entropy, 2020, v. 22, n. 7, p. 788, doi. 10.3390/e22070788
By:
- Truong, Lan V.;
- Scarlett, Jonathan
Publication type:
Article
An Analysis of the Value of Information when Exploring Stochastic, Discrete Multi-Armed Bandits: Supplementary Materials 2.
Published in:
Entropy, 2018, v. 20, n. 3, p. 155, doi. 10.3390/e20030155
By:
- Sledge, Isaac John;
- Príncipe, José Carlos
Publication type:
Article
An Analysis of the Value of Information when Exploring Stochastic, Discrete Multi-Armed Bandits: Supplementary Materials 1.
Published in:
Entropy, 2018, v. 20, n. 3, p. 155, doi. 10.3390/e20030155
By:
- Sledge, Isaac John;
- Príncipe, José Carlos
Publication type:
Article
An Analysis of the Value of Information When Exploring Stochastic, Discrete Multi-Armed Bandits.
Published in:
Entropy, 2018, v. 20, n. 3, p. 155, doi. 10.3390/e20030155
By:
- Sledge, Isaac J.;
- Príncipe, José C.
Publication type:
Article
Technical Note—Online Matching with Bayesian Rewards.
Published in:
Operations Research, 2025, v. 73, n. 1, p. 278, doi. 10.1287/opre.2021.0499
By:
- Simchi-Levi, David;
- Sun, Rui;
- Wang, Xinshang
Publication type:
Article
Smoothness-Adaptive Contextual Bandits.
Published in:
Operations Research, 2022, v. 70, n. 6, p. 3198, doi. 10.1287/opre.2021.2215
By:
- Gur, Yonatan;
- Momeni, Ahmadreza;
- Wager, Stefan
Publication type:
Article
Optimistic Gittins Indices.
Published in:
Operations Research, 2022, v. 70, n. 6, p. 3432, doi. 10.1287/opre.2021.2207
By:
- Farias, Vivek F.;
- Gutin, Eli
Publication type:
Article
Dynamic Programs with Shared Resources and Signals: Dynamic Fluid Policies and Asymptotic Optimality.
Published in:
Operations Research, 2022, v. 70, n. 5, p. 3015, doi. 10.1287/opre.2021.2181
By:
- Brown, David B.;
- Zhang, Jingwei
Publication type:
Article
In This Issue.
Published in:
Operations Research, 2022, v. 70, n. 1, p. iii, doi. 10.1287/opre.2021.2252
Publication type:
Article
A Restless Bandit Model for Resource Allocation, Competition, and Reservation.
Published in:
Operations Research, 2022, v. 70, n. 1, p. 416, doi. 10.1287/opre.2020.2066
By:
- Fu, Jing;
- Moran, Bill;
- Taylor, Peter G.
Publication type:
Article
Simple Bayesian Algorithms for Best-Arm Identification.
Published in:
Operations Research, 2020, v. 68, n. 6, p. 1625, doi. 10.1287/opre.2019.1911
By:
- Russo, Daniel
Publication type:
Article
Bandits with Global Convex Constraints and Objective.
Published in:
Operations Research, 2019, v. 67, n. 5, p. 1486, doi. 10.1287/opre.2019.1840
By:
- Agrawal, Shipra;
- Devanur, Nikhil R.
Publication type:
Article
Robust Multiarmed Bandit Problems.
Published in:
Management Science, 2016, v. 62, n. 1, p. 264, doi. 10.1287/mnsc.2015.2153
By:
- Jong Kim, Michael;
- Lim, Andrew E. B.
Publication type:
Article
Decision maker based on atomic switches.
Published in:
AIMS Materials Science, 2015, v. 3, n. 1, p. 245, doi. 10.3934/matersci.2016.1.245
By:
- Song-Ju Kim;
- Tohru Tsuruoka;
- Tsuyoshi Hasegawa;
- Masashi Aono;
- Kazuya Terabe;
- Masakazu Aono
Publication type:
Article
False discovery rate control with e‐values.
Published in:
Journal of the Royal Statistical Society: Series B (Statistical Methodology), 2022, v. 84, n. 3, p. 822, doi. 10.1111/rssb.12489
By:
- Wang, Ruodu;
- Ramdas, Aaditya
Publication type:
Article
Optimizing Infill Drilling Decisions Using Multi-Armed Bandits: Application in a Long-Term, Multi-Element Stockpile.
Published in:
Mathematical Geosciences, 2018, v. 50, n. 1, p. 35, doi. 10.1007/s11004-017-9695-9
By:
- Dirkx, Rein;
- Dimitrakopoulos, Roussos
Publication type:
Article
An efficient learning framework for multiproduct inventory systems with customer choices.
Published in:
Production & Operations Management, 2022, v. 31, n. 6, p. 2492, doi. 10.1111/poms.13693
By:
- Gao, Xiangyu;
- Zhang, Huanan
Publication type:
Article
Self-Assembly of Complex DNA Tessellations by Using Low-Symmetry Multi-arm DNA Tiles.
Published in:
Angewandte Chemie International Edition, 2016, v. 55, n. 31, p. 8860, doi. 10.1002/anie.201601944
By:
- Zhang, Fei;
- Jiang, Shuoxing;
- Li, Wei;
- Hunt, Ashley;
- Liu, Yan;
- Yan, Hao
Publication type:
Article
Customization of J. Bather's UCB Strategy for a Gaussian Multiarmed Bandit.
Published in:
Automation & Remote Control, 2022, v. 83, n. 11, p. 1857, doi. 10.1134/S00051179220110108
By:
- Garbar, S. V.;
- Kolnogorov, A. V.
Publication type:
Article
Dynamic channel selection in wireless communications via a multi-armed bandit algorithm using laser chaos time series.
Published in:
Scientific Reports, 2020, v. 10, n. 1, p. 1, doi. 10.1038/s41598-020-58541-2
By:
- Takeuchi, Shungo;
- Hasegawa, Mikio;
- Kanno, Kazutaka;
- Uchida, Atsushi;
- Chauvet, Nicolas;
- Naruse, Makoto
Publication type:
Article
A Cost-Sensitive Decision Tree Learning Algorithm Based on a Multi-Armed Bandit Framework.
Published in:
Computer Journal, 2017, v. 60, n. 7, p. 941, doi. 10.1093/comjnl/bxw015
By:
- LOMAX, SUSAN;
- VADERA, SUNIL
Publication type:
Article
A Flexible Mechanism of Rule Selection Enables Rapid Feature-Based Reinforcement Learning.
Published in:
Frontiers in Neuroscience, 2016, p. 1, doi. 10.3389/fnins.2016.00125
By:
- Balcarras, Matthew;
- Womelsdorf, Thilo
Publication type:
Article
Power considerations for trials of two experimental arms versus a standard active control or placebo.
Published in:
2016
By:
- Hasselblad, Vic
Publication type:
journal article
Interactive Restless Multi-armed Bandit Game and Swarm Intelligence Effect.
Published in:
New Generation Computing, 2016, v. 34, n. 3, p. 291, doi. 10.1007/s00354-016-0306-y
By:
- Yoshida, Shunsuke;
- Hisakado, Masato;
- Mori, Shintaro
Publication type:
Article
PAC-Bayesian lifelong learning for multi-armed bandits.
Published in:
Data Mining & Knowledge Discovery, 2022, v. 36, n. 2, p. 841, doi. 10.1007/s10618-022-00825-4
By:
- Flynn, Hamish;
- Reeb, David;
- Kandemir, Melih;
- Peters, Jan
Publication type:
Article
Contextual bandits with hidden contexts: a focused data capture from social media streams.
Published in:
Data Mining & Knowledge Discovery, 2019, v. 33, n. 6, p. 1853, doi. 10.1007/s10618-019-00648-w
By:
- Lamprier, Sylvain;
- Gisselbrecht, Thibault;
- Gallinari, Patrick
Publication type:
Article
Editorial.
Published in:
Naval Research Logistics, 2023, v. 70, n. 5, p. 395, doi. 10.1002/nav.22138
By:
- Federgruen, Awi;
- Katehakis, Michael;
- Spieksma, Floske
Publication type:
Article