智能论文笔记

RieszNet and ForestRiesz: Automatic Debiased Machine Learning with Neural Nets and Random Forests

Victor Chernozhukov , Whitney K. Newey , Victor Quintas-Martinez , Vasilis Syrgkanis

分类：机器学习 | (统计)机器学习

2021-10-06

感兴趣的许多因果和政策效应都是由高维或非参数回归函数的线性功能定义的。 $ \ sqrt {n} $ - 对目标对象的一致且渐近地正常估计需要偏见，以减少正则化和/或模型选择对感兴趣对象的影响。通常，通过将校正项添加到功能的插件估计器中来实现，从而导致属性，例如半参数效率，双重鲁棒性和Neyman正交性。我们基于自动学习使用神经网和随机森林的Riesz表示的自动偏差程序。我们的方法仅依赖于黑框评估Oracle访问线性功能，并且不需要其分析形式的知识。我们提出了一种多任务神经网络偏见方法，具有随机梯度下降最小化的Riesz代表和回归损失，同时共享这两个函数的表示层。我们还提出了一种随机森林方法，该方法了解Riesz函数的局部线性表示。即使我们的方法适用于任意功能，我们在实验上发现它的性能与Shi等人的最先进的神经网状算法相比。（2019）对于平均治疗效果功能的情况。我们还使用汽油需求的汽油价格变化的半合成数据来评估我们的方法，即通过连续处理估算平均边缘效应的问题。

translated by 谷歌翻译

Localized Debiased Machine Learning: Efficient Inference on Quantile Treatment Effects and Beyond

Nathan Kallus , Xiaojie Mao , Masatoshi Uehara

分类： (统计)机器学习 | 机器学习

2019-12-30

我们考虑在估计涉及依赖参数的高维滋扰的估计方程中估计一个低维参数。一个中心示例是因果推理中（局部）分位数处理效应（（L）QTE）的有效估计方程，涉及在分位数以估计的分位数评估的协方差累积分布函数。借记机学习（DML）是一种使用灵活的机器学习方法估算高维滋扰的数据分解方法，但是将其应用于参数依赖性滋扰的问题是不切实际的。对于（L）QTE，DML要求我们学习整个协变量累积分布函数。相反，我们提出了局部偏见的机器学习（LDML），该学习避免了这一繁重的步骤，并且只需要对参数进行一次初始粗糙猜测而估算烦恼。对于（L）QTE，LDML仅涉及学习两个回归功能，这是机器学习方法的标准任务。我们证明，在松弛速率条件下，我们的估计量与使用未知的真实滋扰的不可行的估计器具有相同的有利渐近行为。因此，LDML值得注意的是，当我们必须控制许多协变量和/或灵活的关系时，如（l）QTES在（（l）QTES）中，实际上可以有效地估算重要数量，例如（l）QTES。

translated by 谷歌翻译

Normalized Augmented Inverse Probability Weighting with Neural Network Predictions

Mehdi Rostami , Olli Saarela

分类： (统计)机器学习

2021-08-03

作为因果参数的平均处理效果（ATE）的估计分为两个步骤，其中在第一步中，建模治疗和结果以包含潜在的混乱，并且在第二步中，将预测插入到其中ATE估计器，例如增强逆概率加权（AIPW）估计器。由于对混乱与治疗和结果之间的非线性或未知关系的担忧，有兴趣应用非参数学方法，例如机器学习（ML）算法。一些文献建议使用两个单独的神经网络（NNS），其中网络的参数没有正则化，除了NN优化中的随机梯度下降（SGD）。我们的模拟表明，如果没有使用正则化，则AIPW估计器会受到广泛的影响。我们提出了AIPW（称为Naipw）的正常化，这在某些情况下可以有所帮助。 Naipw，可否提供与AIPW相同的属性，即双重稳健性和正交性属性。此外，如果第一步算法收敛到足够快，则在监管条件下，Naipw将是渐近正常的。我们还在NNS上施加小于中等L1正则化的偏差和方差方面比较AIPW和NAIPW的性能。

translated by 谷歌翻译

Estimation and Inference of Heterogeneous Treatment Effects using Random Forests

Stefan Wager , Susan Athey

分类：

2015-10-14

Many scientific and engineering challenges-ranging from personalized medicine to customized marketing recommendations-require an understanding of treatment effect heterogeneity. In this paper, we develop a non-parametric causal forest for estimating heterogeneous treatment effects that extends Breiman's widely used random forest algorithm. In the potential outcomes framework with unconfoundedness, we show that causal forests are pointwise consistent for the true treatment effect, and have an asymptotically Gaussian and centered sampling distribution. We also discuss a practical method for constructing asymptotic confidence intervals for the true treatment effect that are centered at the causal forest estimates. Our theoretical results rely on a generic Gaussian theory for a large family of random forest algorithms. To our knowledge, this is the first set of results that allows any type of random forest, including classification and regression forests, to be used for provably valid statistical inference. In experiments, we find causal forests to be substantially more powerful than classical methods based on nearest-neighbor matching, especially in the presence of irrelevant covariates.

translated by 谷歌翻译

DoubleML -- An Object-Oriented Implementation of Double Machine Learning in R

Philipp Bach , Victor Chernozhukov , Malte S. Kurz , Martin Spindler

分类： (统计)机器学习 | 机器学习

2021-03-17

R包Doubleml实现了Chernozhukov等人的双重/辩护机器学习框架。（2018）。它提供了基于机器学习方法的因果模型中估计参数的功能。双机器学习框架由三个关键成分组成：Neyman正交性，高质量的机器学习估计和样品拆分。可以通过MLR3生态系统中可用的各种最新机器学习方法来执行滋扰组件的估计。 Doubleml使得可以在各种因果模型中进行推断，包括部分线性和交互式回归模型及其扩展到仪器变量估计。 Doubleml的面向对象的实现为模型规范具有很高的灵活性，并使其易于扩展。本文是对双机器学习框架和R软件包DOUBLEML的介绍。在具有模拟和真实数据集的可再现代码示例中，我们演示了Doubleml用户如何基于机器学习方法执行有效的推断。

translated by 谷歌翻译

Estimating Heterogeneous Bounds for Treatment Effects under Sample Selection and Non-response

Phillip Heiler

分类： (统计)机器学习

2022-09-09

在本文中，我们提出了一种非参数估计的方法，并推断了一般样本选择模型中因果效应参数的异质界限，初始治疗可能会影响干预后结果是否观察到。可观察到的协变量可能会混淆治疗选择，而观察结果和不可观察的结果可能会混淆。该方法提供条件效应界限作为策略相关的预处理变量的功能。它允许对身份不明的条件效应曲线进行有效的统计推断。我们使用灵活的半参数脱偏机学习方法，该方法可以适应柔性功能形式和治疗，选择和结果过程之间的高维混杂变量。还提供了易于验证的高级条件，以进行估计和错误指定的鲁棒推理保证。

translated by 谷歌翻译

DeepMed: Semiparametric Causal Mediation Analysis with Debiased Deep Learning

Siqi Xu , Lin Liu , Zhonghua Liu

分类： (统计)机器学习 | 机器学习

2022-10-10

Causal mediation analysis can unpack the black box of causality and is therefore a powerful tool for disentangling causal pathways in biomedical and social sciences, and also for evaluating machine learning fairness. To reduce bias for estimating Natural Direct and Indirect Effects in mediation analysis, we propose a new method called DeepMed that uses deep neural networks (DNNs) to cross-fit the infinite-dimensional nuisance functions in the efficient influence functions. We obtain novel theoretical results that our DeepMed method (1) can achieve semiparametric efficiency bound without imposing sparsity constraints on the DNN architecture and (2) can adapt to certain low dimensional structures of the nuisance functions, significantly advancing the existing literature on DNN-based semiparametric causal inference. Extensive synthetic experiments are conducted to support our findings and also expose the gap between theory and practice. As a proof of concept, we apply DeepMed to analyze two real datasets on machine learning fairness and reach conclusions consistent with previous findings.

translated by 谷歌翻译

On the role of surrogates in the efficient estimation of treatment effects with limited outcome data

Nathan Kallus , Xiaojie Mao

分类： (统计)机器学习 | 机器学习

2020-03-27

In many investigations, the primary outcome of interest is difficult or expensive to collect. Examples include long-term health effects of medical interventions, measurements requiring expensive testing or follow-up, and outcomes only measurable on small panels as in marketing. This reduces effective sample sizes for estimating the average treatment effect (ATE). However, there is often an abundance of observations on surrogate outcomes not of primary interest, such as short-term health effects or online-ad click-through. We study the role of such surrogate observations in the efficient estimation of treatment effects. To quantify their value, we derive the semiparametric efficiency bounds on ATE estimation with and without the presence of surrogates and several intermediary settings. The difference between these characterizes the efficiency gains from optimally leveraging surrogates. We study two regimes: when the number of surrogate observations is comparable to primary-outcome observations and when the former dominates the latter. We take an agnostic missing-data approach circumventing strong surrogate conditions previously assumed. To leverage surrogates' efficiency gains, we develop efficient ATE estimation and inference based on flexible machine-learning estimates of nuisance functions appearing in the influence functions we derive. We empirically demonstrate the gains by studying the long-term earnings effect of job training.

translated by 谷歌翻译

Finite-Sample Guarantees for High-Dimensional DML

Victor Quintas-Martinez

分类：机器学习 | (统计)机器学习

2022-06-15

DECIASED机器学习（DML）提供了一种有吸引力的方法来估计观察环境中的治疗效果，在这种情况下，因果参数的识别需要有条件的独立性或不符的假设，因为它可以灵活地控制大量的协变量。本文提供了新的有限样本保证，可保证对高维DML的关节推断，从而界定了估计量的有限样本分布与其渐近高斯近似相距多远。这些保证对应用研究人员很有用，因为它们可以提供距离标称级别的联合置信带覆盖范围的距离。在许多情况下，高维因果参数可能引起人们的关注，例如许多治疗概况的吃量，或者在许多结果上进行治疗的食品。我们还涵盖了无限维度参数，例如对潜在结果的整个边际分布的影响。本文中的有限样本保证补充了DML估计量的一致性和渐近正态性的现有结果，DML估计量是渐近的，或仅处理一维情况。

translated by 谷歌翻译

Distribution-free Prediction Sets Adaptive to Unknown Covariate Shift

Hongxiang Qiu , Edgar Dobriban , Eric Tchetgen Tchetgen

分类： (统计)机器学习

2022-03-11

预测一组结果 - 而不是独特的结果 - 是统计学习中不确定性定量的有前途的解决方案。尽管有关于构建具有统计保证的预测集的丰富文献，但适应未知的协变量转变（实践中普遍存在的问题）还是一个严重的未解决的挑战。在本文中，我们表明具有有限样本覆盖范围保证的预测集是非信息性的，并提出了一种新型的无灵活分配方法PredSet-1Step，以有效地构建了在未知协方差转移下具有渐近覆盖范围保证的预测集。我们正式表明我们的方法是\ textIt {渐近上可能是近似正确}，对大型样本的置信度有很好的覆盖误差。我们说明，在南非队列研究中，它在许多实验和有关HIV风险预测的数据集中实现了名义覆盖范围。我们的理论取决于基于一般渐近线性估计器的WALD置信区间覆盖范围的融合率的新结合。

translated by 谷歌翻译

Falsification before Extrapolation in Causal Effect Estimation

Zeshan Hussain , Michael Oberst , Ming-Chieh Shih , David Sontag

分类：机器学习

2022-09-27

在制定政策指南时，随机对照试验（RCT）代表了黄金标准。但是，RCT通常是狭窄的，并且缺乏更广泛的感兴趣人群的数据。这些人群中的因果效应通常是使用观察数据集估算的，这可能会遭受未观察到的混杂和选择偏见。考虑到一组观察估计（例如，来自多项研究），我们提出了一个试图拒绝偏见的观察性估计值的元偏值。我们使用验证效应，可以从RCT和观察数据中推断出的因果效应。在拒绝未通过此测试的估计器之后，我们对RCT中未观察到的亚组的外推性效应产生了保守的置信区间。假设至少一个观察估计量在验证和外推效果方面是渐近正常且一致的，我们为我们算法输出的间隔的覆盖率概率提供了保证。为了促进在跨数据集的因果效应运输的设置中，我们给出的条件下，即使使用灵活的机器学习方法用于估计滋扰参数，群体平均治疗效应的双重稳定估计值也是渐近的正常。我们说明了方法在半合成和现实世界数据集上的特性，并表明它与标准的荟萃分析技术相比。

translated by 谷歌翻译

Machine Learning for Variance Reduction in Online Experiments

Yongyi Guo , Dominic Coey , Mikael Konutgan , Wenting Li , Chris Schoener , Matt Goldman

分类： (统计)机器学习 | 机器学习

2021-06-14

我们考虑随机对照试验的差异问题，通过使用与结果相关的协变量但与治疗无关。我们提出了一种机器学习回归调整的处理效果估算器，我们称之为Mlrate。 Mlrate使用机器学习预测结果来降低估计方差。它采用交叉配件来避免过度偏置，在一般条件下，我们证明了一致性和渐近正常性。 Mlrate对机器学习的预测较差的鲁棒步骤：如果预测与结果不相关，则估计器执行渐近的差异，而不是标准差异估计器，而如果预测与结果高度相关，则效率提升大。在A / A测试中，对于在Facebook实验中通常监测的一组48个结果指标，估计器的差异比简单差分估计器差异超过70％，比仅调整的共同单变量过程约19％用于结果的预测值。

translated by 谷歌翻译

Estimating heterogeneous treatment effects with right-censored data via causal survival forests

Yifan Cui , Michael R. Kosorok , Erik Sverdrup , Stefan Wager , Ruoqing Zhu

分类：机器学习 | (统计)机器学习

2020-01-27

基于森林的方法最近在非参数治疗效应估计中获得了普及。在这一工作方面，我们引入了因果生存森林，可用于在可能右估计结果的生存和观察环境中估计异质治疗效果。我们的方法依赖于正交估计方程来在不满意的情况下对审查和选择效果进行鲁棒性调整。在我们的实验中，我们发现相对于许多基线的表现良好的方法。

translated by 谷歌翻译

Off-Policy Evaluation with Policy-Dependent Optimization Response

Wenshuo Guo , Michael I. Jordan , Angela Zhou

分类：机器学习

2022-02-25

The intersection of causal inference and machine learning for decision-making is rapidly expanding, but the default decision criterion remains an \textit{average} of individual causal outcomes across a population. In practice, various operational restrictions ensure that a decision-maker's utility is not realized as an \textit{average} but rather as an \textit{output} of a downstream decision-making problem (such as matching, assignment, network flow, minimizing predictive risk). In this work, we develop a new framework for off-policy evaluation with \textit{policy-dependent} linear optimization responses: causal outcomes introduce stochasticity in objective function coefficients. Under this framework, a decision-maker's utility depends on the policy-dependent optimization, which introduces a fundamental challenge of \textit{optimization} bias even for the case of policy evaluation. We construct unbiased estimators for the policy-dependent estimand by a perturbation method, and discuss asymptotic variance properties for a set of adjusted plug-in estimators. Lastly, attaining unbiased policy evaluation allows for policy optimization: we provide a general algorithm for optimizing causal interventions. We corroborate our theoretical results with numerical simulations.

translated by 谷歌翻译

High-dimensional Inference for Dynamic Treatment Effects

Jelena Bradic , Weijie Ji , Yuqian Zhang

分类：机器学习 | (统计)机器学习

2021-10-10

本文提出了在多阶段实验的背景下的异质治疗效应的置信区间结构，以$ N $样品和高维，$ D $，混淆。我们的重点是$ d \ gg n $的情况，但获得的结果也适用于低维病例。我们展示了正则化估计的偏差，在高维变焦空间中不可避免，具有简单的双重稳固分数。通过这种方式，不需要额外的偏差，并且我们获得root $ N $推理结果，同时允许治疗和协变量的多级相互依赖性。记忆财产也没有假设;治疗可能取决于所有先前的治疗作业以及以前的所有多阶段混淆。我们的结果依赖于潜在依赖的某些稀疏假设。我们发现具有动态处理的强大推理所需的新产品率条件。

translated by 谷歌翻译

Orthogonal Series Estimation for the Ratio of Conditional Expectation Functions

Kazuhiko Shinoda , Takahiro Hoshino

分类： (统计)机器学习

2022-12-26

In various fields of data science, researchers are often interested in estimating the ratio of conditional expectation functions (CEFR). Specifically in causal inference problems, it is sometimes natural to consider ratio-based treatment effects, such as odds ratios and hazard ratios, and even difference-based treatment effects are identified as CEFR in some empirically relevant settings. This chapter develops the general framework for estimation and inference on CEFR, which allows the use of flexible machine learning for infinite-dimensional nuisance parameters. In the first stage of the framework, the orthogonal signals are constructed using debiased machine learning techniques to mitigate the negative impacts of the regularization bias in the nuisance estimates on the target estimates. The signals are then combined with a novel series estimator tailored for CEFR. We derive the pointwise and uniform asymptotic results for estimation and inference on CEFR, including the validity of the Gaussian bootstrap, and provide low-level sufficient conditions to apply the proposed framework to some specific examples. We demonstrate the finite-sample performance of the series estimator constructed under the proposed framework by numerical simulations. Finally, we apply the proposed method to estimate the causal effect of the 401(k) program on household assets.

translated by 谷歌翻译

Doubly-Valid/Doubly-Sharp Sensitivity Analysis for Causal Inference with Unmeasured Confounding

Jacob Dorn , Kevin Guo , Nathan Kallus

分类：机器学习 | (统计)机器学习

2021-12-21

在TAN（2006）边缘敏感模型下，在不观察到的混淆存在下构建平均处理效应的界限问题。结合涉及对冲倾向分数的现有表征具有对问题的新的分布稳健特征，我们提出了我们称之为“双重有效/双重尖锐”（DVD）估计的这些界限的新颖估算器。双重清晰度对应于DVD估计始终估计灵敏度模型所暗示的最有可能（即，夏普）的界限，即使当所有滋扰参数都适当一致时，即使在两个滋扰参数中的一个被击败并实现半污染参数之一。双倍有效性是部分识别的全新财产：DVD估计仍然提供有效，但即使在大多数滋扰参数都被遗漏时，仍然没有锐利。实际上，即使在DVDS点估计无法渐近正常的情况下，标准沃尔德置信区间也可能保持有效。在二进制结果的情况下，DVD估计是特别方便的并且在结果回归和倾向评分方面具有闭合形式的表达。我们展示了模拟研究中的DVD估计，以及对右心导管插入的案例研究。

translated by 谷歌翻译

A Free Lunch with Influence Functions? Improving Neural Network Estimates with Concepts from Semiparametric Statistics

Matthew J. Vowels , Sina Akbari , Necati Cihan Camgoz , Richard Bowden

分类：机器学习 | (统计)机器学习

2022-02-18

通常使用参数模型进行经验领域的参数估计，并且此类模型很容易促进统计推断。不幸的是，它们不太可能足够灵活，无法充分建模现实现象，并可能产生偏见的估计。相反，非参数方法是灵活的，但不容易促进统计推断，并且仍然可能表现出残留的偏见。我们探索了影响功能（IFS）的潜力（a）改善初始估计器而无需更多数据（b）增加模型的鲁棒性和（c）促进统计推断。我们首先对IFS进行广泛的介绍，并提出了一种神经网络方法“ Multinet”，该方法使用单个体系结构寻求合奏的多样性。我们还介绍了我们称为“ Multistep”的IF更新步骤的变体，并对不同方法提供了全面的评估。发现这些改进是依赖数据集的，这表明所使用的方法与数据生成过程的性质之间存在相互作用。我们的实验强调了从业人员需要通过不同的估计器组合进行多次分析来检查其发现的一致性。我们还表明，可以改善“自由”的现有神经网络，而无需更多数据，而无需重新训练。

translated by 谷歌翻译

Evaluating Treatment Prioritization Rules via Rank-Weighted Average Treatment Effects

Steve Yadlowsky , Scott Fleming , Nigam Shah , Emma Brunskill , Stefan Wager

分类： (统计)机器学习

2021-11-15

有许多可用于选择优先考虑治疗的可用方法，包括基于治疗效果估计，风险评分和手工制作规则的遵循申请。我们将秩加权平均治疗效应（RATY）指标作为一种简单常见的指标系列，用于比较水平竞争范围的治疗优先级规则。对于如何获得优先级规则，率是不可知的，并且仅根据他们在识别受益于治疗中受益的单位的方式进行评估。我们定义了一系列速率估算器，并证明了一个中央限位定理，可以在各种随机和观测研究环境中实现渐近精确的推断。我们为使用自主置信区间的使用提供了理由，以及用于测试关于治疗效果中的异质性的假设的框架，与优先级规则相关。我们对速率的定义嵌套了许多现有度量，包括QINI系数，以及我们的分析直接产生了这些指标的推论方法。我们展示了我们从个性化医学和营销的示例中的方法。在医疗环境中，使用来自Sprint和Accor-BP随机对照试验的数据，我们发现没有明显的证据证明异质治疗效果。另一方面，在大量的营销审判中，我们在一些数字广告活动的治疗效果中发现了具有的强大证据，并证明了如何使用率如何比较优先考虑估计风险的目标规则与估计治疗效益优先考虑的目标规则。

translated by 谷歌翻译

Estimating Individual Treatment Effects using Non-Parametric Regression Models: a Review

Alberto Caron , Gianluca Baio , Ioanna Manolopoulou

分类：机器学习 | (统计)机器学习

2020-09-14

大型观察数据越来越多地提供健康，经济和社会科学等学科，研究人员对因果问题而不是预测感兴趣。在本文中，从旨在调查参与学校膳食计划对健康指标的实证研究，研究了使用非参数回归的方法估算异质治疗效果的问题。首先，我们介绍了与观察或非完全随机数据进行因果推断相关的设置和相关的问题，以及如何在统计学习工具的帮助下解决这些问题。然后，我们审查并制定现有最先进的框架的统一分类，允许通过非参数回归模型来估算单个治疗效果。在介绍模型选择问题的简要概述后，我们说明了一些关于三种不同模拟研究的方法的性能。我们通过展示一些关于学校膳食计划数据的实证分析的一些方法的使用来结束。

translated by 谷歌翻译