Online action learning in high dimensions: A new exploration rule for contextual et-greedy heuristics
Year of publication: |
2020
|
---|---|
Authors: | Flores, Claudio C. ; Medeiros, Marcelo C. |
Publisher: |
Rio de Janeiro : Pontifícia Universidade Católica do Rio de Janeiro (PUC-Rio), Departamento de Economia |
Subject: | Bandit | sequential treatment | high dimensions | LASSO | regret |
Series: | Texto para discussão ; 674 |
---|---|
Type of publication: | Book / Working Paper |
Type of publication (narrower categories): | Working Paper |
Language: | English |
Other identifiers: | 1734830778 [GVK] hdl:10419/249722 [Handle] RePEc:rio:texdis:674 [RePEc] |
Source: |
-
Flores, Claudio C., (2020)
-
Zbonakova, Lenka, (2016)
-
Stable graphical model estimation with Random Forests for discrete, continuous, and mixed variables
Fellinghauer, Bernd, (2013)
- More ...
-
Flores, Claudio C., (2020)
-
Forecasting with machine learning methods
Medeiros, Marcelo C., (2022)
-
Smooth Regimes, Macroeconomic Variables, andBagging for the Short-Term Interest Rate Process
Audrino, Francesco, (2009)
- More ...