智能论文笔记

Probabilistic quantile factor analysis

Dimitris Korobilis , Maximilian Schröder

分类： (统计)机器学习

2022-12-20

This paper extends quantile factor analysis to a probabilistic variant that incorporates regularization and computationally efficient variational approximations. By means of synthetic and real data experiments it is established that the proposed estimator can achieve, in many cases, better accuracy than a recently proposed loss-based estimator. We contribute to the literature on measuring uncertainty by extracting new indexes of low, medium and high economic policy uncertainty, using the probabilistic quantile factor methodology. Medium and high indexes have clear contractionary effects, while the low index is benign for the economy, showing that not all manifestations of uncertainty are the same.

translated by 谷歌翻译

Enhanced Bayesian Neural Networks for Macroeconomics and Finance

Niko Hauzenberger , Florian Huber , Karin Klieber , Massimiliano Marcellino

分类： (统计)机器学习

2022-11-09

We develop Bayesian neural networks (BNNs) that permit to model generic nonlinearities and time variation for (possibly large sets of) macroeconomic and financial variables. From a methodological point of view, we allow for a general specification of networks that can be applied to either dense or sparse datasets, and combines various activation functions, a possibly very large number of neurons, and stochastic volatility (SV) for the error term. From a computational point of view, we develop fast and efficient estimation algorithms for the general BNNs we introduce. From an empirical point of view, we show both with simulated data and with a set of common macro and financial applications that our BNNs can be of practical use, particularly so for observations in the tails of the cross-sectional or time series distributions of the target variables.

translated by 谷歌翻译

Forecast combinations: an over 50-year review

Xiaoqian Wang , Rob J Hyndman , Feng Li , Yanfei Kang

分类： (统计)机器学习

2022-05-09

预测组合在预测社区中蓬勃发展，近年来，已经成为预测研究和活动主流的一部分。现在，由单个（目标）系列产生的多个预测组合通过整合来自不同来源收集的信息，从而提高准确性，从而减轻了识别单个“最佳”预测的风险。组合方案已从没有估计的简单组合方法演变为涉及时间变化的权重，非线性组合，组件之间的相关性和交叉学习的复杂方法。它们包括结合点预测和结合概率预测。本文提供了有关预测组合的广泛文献的最新评论，并参考可用的开源软件实施。我们讨论了各种方法的潜在和局限性，并突出了这些思想如何随着时间的推移而发展。还调查了有关预测组合实用性的一些重要问题。最后，我们以当前的研究差距和未来研究的潜在见解得出结论。

translated by 谷歌翻译

Variational Inference: A Review for Statisticians

David M. Blei , Alp Kucukelbir , Jon D. McAuliffe

分类：

2016-01-04

One of the core problems of modern statistics is to approximate difficult-to-compute probability densities. This problem is especially important in Bayesian statistics, which frames all inference about unknown quantities as a calculation involving the posterior density. In this paper, we review variational inference (VI), a method from machine learning that approximates probability densities through optimization. VI has been used in many applications and tends to be faster than classical methods, such as Markov chain Monte Carlo sampling. The idea behind VI is to first posit a family of densities and then to find the member of that family which is close to the target. Closeness is measured by Kullback-Leibler divergence. We review the ideas behind mean-field variational inference, discuss the special case of VI applied to exponential family models, present a full example with a Bayesian mixture of Gaussians, and derive a variant that uses stochastic optimization to scale up to massive data. We discuss modern research in VI and highlight important open problems. VI is powerful, but it is not yet well understood. Our hope in writing this paper is to catalyze statistical research on this class of algorithms.

translated by 谷歌翻译

Probabilistic Feature Selection in Joint Quantile Time Series Analysis

Ning Ning

分类： (统计)机器学习 | 机器学习

2020-10-04

分位数特征选择与相关的多变量时间序列数据一直是一种方法论挑战，是一个公开的问题。在本文中，我们提出了一般的概率方法，用于在分位数特征选择时间序列（QFSTS）模型的名称下进行关节定量时间序列分析中的特征选择。 QFSTS模型是一般的结构时间序列模型，其中每个组件对具有直接解释的时间序列建模产生了添加剂贡献。其灵活性是化合物，用户可以在用户可以为每个次系列添加/扣除组件，并且每个时间序列都可以具有其自身特定的不同大小的价值组件。特征选择是在分位数回归组件中进行的，其中每个时间序列都有自己的同时外部预测器池，允许“垂圈”。通过多变量非对称LAPLACE分布，“峰值板”先前设置，Metropolis-Hastings算法和贝叶斯模型平均技术，开发了创造性的概率方法在扩展到分量时间序列研究区域的特征选择。始终如一地在贝叶斯范式中。与大多数机器学习算法不同，QFSTS模型需要小型数据集训练，快速收敛，并且可在普通的个人计算机上进行可执行。对模拟数据和经验数据的广泛检查确认QFSTS模型具有卓越的性能特征选择，参数估计和预测。

translated by 谷歌翻译

A flexible empirical Bayes approach to multiple linear regression and connections with penalized regression

Youngseok Kim , Wei Wang , Peter Carbonetto , Matthew Stephens

分类： (统计)机器学习

2022-08-23

我们引入了一种新的经验贝叶斯方法，用于大规模多线性回归。我们的方法结合了两个关键思想：（i）使用灵活的“自适应收缩”先验，该先验近似于正常分布的有限混合物，近似于正常分布的非参数家族；（ii）使用变分近似来有效估计先前的超参数并计算近似后期。将这两个想法结合起来，将快速，灵活的方法与计算速度相当，可与快速惩罚的回归方法（例如Lasso）相当，并在各种场景中具有出色的预测准确性。此外，我们表明，我们方法中的后验平均值可以解释为解决惩罚性回归问题，并通过直接解决优化问题（而不是通过交叉验证来调整）从数据中学到的惩罚函数的精确形式。。我们的方法是在r https://github.com/stephenslab/mr.ash.ash.alpha的r软件包中实现的

translated by 谷歌翻译

A Variational Inference Approach to Inverse Problems with Gamma Hyperpriors

Shiv Agrawal , Hwanwoo Kim , Alexander Strang , Daniel Sanz-Alonso

分类： (统计)机器学习

2021-11-26

具有伽马超高提升的分层模型提供了一个灵活，稀疏的促销框架，用于桥接$ l ^ 1 $和$ l ^ 2 $ scalalizations在贝叶斯的配方中致正问题。尽管对这些模型具有贝叶斯动机，但现有的方法仅限于\ Textit {最大后验}估计。尚未实现执行不确定性量化的可能性。本文介绍了伽马超高图的分层逆问题的变分迭代交替方案。所提出的变分推理方法产生精确的重建，提供有意义的不确定性量化，易于实施。此外，它自然地引入了用于选择超参数的模型选择。我们说明了我们在几个计算的示例中的方法的性能，包括从时间序列数据的动态系统的解卷积问题和稀疏识别。

translated by 谷歌翻译

Variational Inference for Additive Main and Multiplicative Interaction Effects Models

AntÔnia A. L. Dos Santos , Rafael A. Moral , Danilo A. Sarti , Andrew C. Parnell

分类： (统计)机器学习 | 机器学习

2022-06-29

在植物繁殖中，环境（GXE）相互作用的基因型存在对耕作决策和引入新作物品种的影响很大。线性和双线性项的组合已被证明在建模这种类型的数据方面非常有用。识别GXE的一种广泛使用的方法是加性主要效应和乘法交互作用（AMMI）模型。但是，由于数据经常可能是高维的，马尔可夫链蒙特卡洛（MCMC）方法在计算上可能是不可行的。在本文中，我们考虑了这种模型的变异推理方法。我们得出用于估计参数的变异近似值，并使用模拟和真实数据将近似值与MCMC进行比较。我们提出的新推论框架平均要快两倍，同时保持与MCMC相同的预测性能。

translated by 谷歌翻译

Variational Bayes for high-dimensional proportional hazards models with applications to gene expression variable selection

Michael Komodromos , Eric Aboagye , Marina Evangelou , Sarah Filippi , Kolyan Ray

分类： (统计)机器学习

2021-12-19

我们提出了一种变分贝叶斯比例危险模型，用于预测和可变选择的关于高维存活数据。我们的方法基于平均场变分近似，克服了MCMC的高计算成本，而保留有用的特征，提供优异的点估计，并通过后夹层概念提供可变选择的自然机制。我们提出的方法的性能通过广泛的仿真进行评估，并与其他最先进的贝叶斯变量选择方法进行比较，展示了可比或更好的性能。最后，我们展示了如何在两个转录组数据集上使用所提出的方法进行审查的生存结果，其中我们识别具有预先存在的生物解释的基因。

translated by 谷歌翻译

Community Detection in Weighted Multilayer Networks with Ambient Noise

Mark He , Dylan Lu , Jason Xu , Rose Mary Xavier

分类： (统计)机器学习

2021-02-24

我们介绍了一个新型的多层加权网络模型，该模型除了本地信号外，还考虑了全局噪声。该模型类似于多层随机块模型（SBM），但关键区别在于，跨层之间的块之间的相互作用在整个系统中是常见的，我们称之为环境噪声。单个块还以这些固定的环境参数为特征，以表示不属于其他任何地方的成员。这种方法允许将块同时聚类和类型化到信号或噪声中，以便更好地理解其在整个系统中的作用，而现有块模型未考虑。我们采用了分层变异推断的新颖应用来共同检测和区分块类型。我们称此模型为多层加权网络称为随机块（具有）环境噪声模型（SBANM），并开发了相关的社区检测算法。我们将此方法应用于费城神经发育队列中的受试者，以发现与精神病有关的具有共同心理病理学的受试者社区。

translated by 谷歌翻译

Forecast Evaluation in Large Cross-Sections of Realized Volatility

Christis Katsouris

分类： (统计)机器学习 | 机器学习

2021-12-09

在本文中，我们考虑了使用相同的预测精度测试程序在横截面依赖下实现了实现波动率测量的预测评估。在预测实现挥发性时，我们根据增强横截面评估模型的预测精度。在相等预测精度的零假设下，所采用的基准模型是标准的HAR模型，而在非相同的预测精度的替代方案下，预测模型是通过套索缩收估计的增强的HAR模型。我们通过结合测量误差校正以及横截面跳转分量测量来研究预报对模型规范的敏感性。使用数值实现评估模型的样本外预测评估。

translated by 谷歌翻译

Variational Gibbs inference for statistical model estimation from incomplete data

Vaidotas Simkus , Benjamin Rhodes , Michael U. Gutmann

分类：机器学习 | (统计)机器学习

2021-11-25

统计模型是机器学习的核心，具有广泛适用性，跨各种下游任务。模型通常由通过最大似然估计从数据估计的自由参数控制。但是，当面对现实世界数据集时，许多模型运行到一个关键问题：它们是在完全观察到的数据方面配制的，而在实践中，数据集会困扰缺失数据。来自不完整数据的统计模型估计理论在概念上类似于潜在变量模型的估计，其中存在强大的工具，例如变分推理（VI）。然而，与标准潜在变量模型相比，具有不完整数据的参数估计通常需要估计缺失变量的指数 - 许多条件分布，因此使标准的VI方法是棘手的。通过引入变分Gibbs推理（VGI），是一种新的通用方法来解决这个差距，以估计来自不完整数据的统计模型参数。我们在一组合成和实际估算任务上验证VGI，从不完整的数据中估算重要的机器学习模型，VAE和标准化流程。拟议的方法，同时通用，实现比现有的特定模型特定估计方法竞争或更好的性能。

translated by 谷歌翻译

Quasi Black-Box Variational Inference with Natural Gradients for Bayesian Learning

Martin Magris , Mostafa Shabani , Alexandros Iosifidis

分类： (统计)机器学习 | 机器学习

2022-05-23

We develop an optimization algorithm suitable for Bayesian learning in complex models. Our approach relies on natural gradient updates within a general black-box framework for efficient training with limited model-specific derivations. It applies within the class of exponential-family variational posterior distributions, for which we extensively discuss the Gaussian case for which the updates have a rather simple form. Our Quasi Black-box Variational Inference (QBVI) framework is readily applicable to a wide class of Bayesian inference problems and is of simple implementation as the updates of the variational posterior do not involve gradients with respect to the model parameters, nor the prescription of the Fisher information matrix. We develop QBVI under different hypotheses for the posterior covariance matrix, discuss details about its robust and feasible implementation, and provide a number of real-world applications to demonstrate its effectiveness.

translated by 谷歌翻译

Modeling Item Response Theory with Stochastic Variational Inference

Mike Wu , Richard L. Davis , Benjamin W. Domingue , Chris Piech , Noah Goodman

分类：机器学习 | (统计)机器学习

2021-08-26

项目反应理论（IRT）是一个无处不在的模型，可以根据他们对问题的回答理解人类行为和态度。大型现代数据集为捕捉人类行为的更多细微差别提供了机会，从而有可能改善心理测量模型，从而改善科学理解和公共政策。但是，尽管较大的数据集允许采用更灵活的方法，但许多用于拟合IRT模型的当代算法也可能具有禁止现实世界应用的巨大计算需求。为了解决这种瓶颈，我们引入了IRT的变异贝叶斯推理算法，并表明它在不牺牲准确性的情况下快速可扩展。将此方法应用于认知科学和教育的五个大规模项目响应数据集中，比替代推理算法更高的对数可能性和更高的准确性。然后，使用这种新的推论方法，我们将IRT概括为具有表现力的贝叶斯响应模型，利用深度学习的最新进展来捕获具有神经网络的非线性项目特征曲线（ICC）。使用TIMSS的特定级数学测试，我们显示我们的非线性IRT模型可以捕获有趣的不对称ICC。该算法实现是开源的，易于使用。

translated by 谷歌翻译

Stacking for Non-mixing Bayesian Computations: The Curse and Blessing of Multimodal Posteriors

Yuling Yao , Aki Vehtari , Andrew Gelman

分类： (统计)机器学习

2020-06-22

在使用多模式贝叶斯后部分布时，马尔可夫链蒙特卡罗（MCMC）算法难以在模式之间移动，并且默认变分或基于模式的近似推动将低估后不确定性。并且，即使找到最重要的模式，难以评估后部的相对重量。在这里，我们提出了一种使用MCMC，变分或基于模式的模式的并行运行的方法，以便尽可能多地击中多种模式或分离的区域，然后使用贝叶斯堆叠来组合这些用于构建分布的加权平均值的可扩展方法。通过堆叠从多模式后分布的堆叠，最小化交叉验证预测误差的结果，并且代表了比变分推断更好的不确定度，但它不一定是相当于渐近的，以完全贝叶斯推断。我们呈现理论一致性，其中堆叠推断逼近来自未衰退的模型和非混合采样器的真实数据生成过程，预测性能优于完全贝叶斯推断，因此可以被视为祝福而不是模型拼写下的诅咒。我们展示了几个模型家庭的实际实施：潜在的Dirichlet分配，高斯过程回归，分层回归，马蹄素变量选择和神经网络。

translated by 谷歌翻译

Machine Learning based Framework for Robust Price-Sensitivity Estimation with Application to Airline Pricing

Ravi Kumar , Shahin Boluki , Karl Isler , Jonas Rauch , Darius Walczak

分类： (统计)机器学习 | 机器学习

2022-05-04

We consider the problem of dynamic pricing of a product in the presence of feature-dependent price sensitivity. Developing practical algorithms that can estimate price elasticities robustly, especially when information about no purchases (losses) is not available, to drive such automated pricing systems is a challenge faced by many industries. Based on the Poisson semi-parametric approach, we construct a flexible yet interpretable demand model where the price related part is parametric while the remaining (nuisance) part of the model is non-parametric and can be modeled via sophisticated machine learning (ML) techniques. The estimation of price-sensitivity parameters of this model via direct one-stage regression techniques may lead to biased estimates due to regularization. To address this concern, we propose a two-stage estimation methodology which makes the estimation of the price-sensitivity parameters robust to biases in the estimators of the nuisance parameters of the model. In the first-stage we construct estimators of observed purchases and prices given the feature vector using sophisticated ML estimators such as deep neural networks. Utilizing the estimators from the first-stage, in the second-stage we leverage a Bayesian dynamic generalized linear model to estimate the price-sensitivity parameters. We test the performance of the proposed estimation schemes on simulated and real sales transaction data from the Airline industry. Our numerical studies demonstrate that our proposed two-stage approach reduces the estimation error in price-sensitivity parameters from 25\% to 4\% in realistic simulation settings. The two-stage estimation techniques proposed in this work allows practitioners to leverage modern ML techniques to robustly estimate price-sensitivities while still maintaining interpretability and allowing ease of validation of its various constituent parts.

translated by 谷歌翻译

Conjugate priors for count and rounded data regression

Daniel R. Kowal

分类： (统计)机器学习

2021-10-23

离散数据丰富，并且通常作为计数或圆形数据而出现。甚至对于线性回归模型，缀合格前沿和闭合形式的后部通常是不可用的，这需要近似诸如MCMC的后部推理。对于广泛的计数和圆形数据回归模型，我们介绍了能够闭合后部推理的共轭前沿。密钥后和预测功能可通过直接蒙特卡罗模拟来计算。至关重要的是，预测分布是离散的，以匹配数据的支持，并且可以在多个协变量中进行共同评估或模拟。这些工具广泛用途是线性回归，非线性模型，通过基础扩展，以及模型和变量选择。多种仿真研究表明计算，预测性建模和相对于现有替代方案的选择性的显着优势。

translated by 谷歌翻译

A similarity-based Bayesian mixture-of-experts model

Tianfang Zhang , Rasmus Bokrantz , Jimmy Olsson

分类： (统计)机器学习 | 机器学习

2020-12-03

我们提出了一种新的非参数混合物模型，用于多变量回归问题，灵感来自概率K-Nearthimest邻居算法。使用有条件指定的模型，对样本外输入的预测基于与每个观察到的数据点的相似性，从而产生高斯混合物表示的预测分布。在混合物组件的参数以及距离度量标准的参数上，使用平均场变化贝叶斯算法进行后推断，并具有基于随机梯度的优化过程。在与数据大小相比，输入 - 输出关系很复杂，预测分布可能偏向或多模式的情况下，输入相对较高的尺寸，该方法尤其有利。对五个数据集进行的计算研究，其中两个是合成生成的，这说明了我们的高维输入的专家混合物方法的明显优势，在验证指标和视觉检查方面都优于竞争者模型。

translated by 谷歌翻译

Maximum Likelihood from Incomplete Data Via the EM Algorithm

分类：

JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms of scholarship. For more information about JSTOR, please contact

translated by 谷歌翻译

Sparse Horseshoe Estimation via Expectation-Maximisation

Shu Yu Tew , Daniel F. Schmidt , Enes Makalic

分类： (统计)机器学习 | 机器学习

2022-11-07

The horseshoe prior is known to possess many desirable properties for Bayesian estimation of sparse parameter vectors, yet its density function lacks an analytic form. As such, it is challenging to find a closed-form solution for the posterior mode. Conventional horseshoe estimators use the posterior mean to estimate the parameters, but these estimates are not sparse. We propose a novel expectation-maximisation (EM) procedure for computing the MAP estimates of the parameters in the case of the standard linear model. A particular strength of our approach is that the M-step depends only on the form of the prior and it is independent of the form of the likelihood. We introduce several simple modifications of this EM procedure that allow for straightforward extension to generalised linear models. In experiments performed on simulated and real data, our approach performs comparable, or superior to, state-of-the-art sparse estimation methods in terms of statistical performance and computational cost.

translated by 谷歌翻译