智能论文笔记

Invariant Priors for Bayesian Quadrature

Masha Naslidnyk , Javier Gonzalez , Maren Mahsereci

分类： (统计)机器学习 | 机器学习

2021-12-02

贝叶斯正交（BQ）是一种基于型号的数值集成方法，可以通过编码和利用手头的集成任务的已知结构来提高样本效率。在本文中，我们探讨了在输入域中的一组基本变换下编码积分的不变性，特别是一些酉变换，例如旋转，轴翻转或点对称。与若干合成和一个现实世界应用相比，我们展示了卓越的性能的初步结果。

translated by 谷歌翻译

Fast and robust Bayesian Inference using Gaussian Processes with GPry

Jonas El Gammal , Nils Schöneberg , Jesús Torrado , Christian Fidler

分类： (统计)机器学习

2022-11-03

We present the GPry algorithm for fast Bayesian inference of general (non-Gaussian) posteriors with a moderate number of parameters. GPry does not need any pre-training, special hardware such as GPUs, and is intended as a drop-in replacement for traditional Monte Carlo methods for Bayesian inference. Our algorithm is based on generating a Gaussian Process surrogate model of the log-posterior, aided by a Support Vector Machine classifier that excludes extreme or non-finite values. An active learning scheme allows us to reduce the number of required posterior evaluations by two orders of magnitude compared to traditional Monte Carlo inference. Our algorithm allows for parallel evaluations of the posterior at optimal locations, further reducing wall-clock times. We significantly improve performance using properties of the posterior in our active learning scheme and for the definition of the GP prior. In particular we account for the expected dynamical range of the posterior in different dimensionalities. We test our model against a number of synthetic and cosmological examples. GPry outperforms traditional Monte Carlo methods when the evaluation time of the likelihood (or the calculation of theoretical observables) is of the order of seconds; for evaluation times of over a minute it can perform inference in days that would take months using traditional methods. GPry is distributed as an open source Python package (pip install gpry) and can also be found at https://github.com/jonaselgammal/GPry.

translated by 谷歌翻译

Fast Bayesian Inference with Batch Bayesian Quadrature via Kernel Recombination

Masaki Adachi , Satoshi Hayakawa , Martin Jørgensen , Harald Oberhauser , Michael A. Osborne

分类：机器学习 | (统计)机器学习

2022-06-09

贝叶斯后期和模型证据的计算通常需要数值整合。贝叶斯正交（BQ）是一种基于替代模型的数值整合方法，能够具有出色的样品效率，但其缺乏并行化阻碍了其实际应用。在这项工作中，我们提出了一种并行的（批次）BQ方法，该方法采用了核正素的技术，该技术具有证明是指数的收敛速率。另外，与嵌套采样一样，我们的方法允许同时推断后期和模型证据。重新选择了来自BQ替代模型的样品，通过内核重组算法获得一组稀疏的样品，需要可忽略的额外时间来增加批处理大小。从经验上讲，我们发现我们的方法显着优于在包括锂离子电池分析在内的各种现实世界数据集中，最先进的BQ技术和嵌套采样的采样效率。

translated by 谷歌翻译

Adjoint-aided inference of Gaussian process driven differential equations

Paterne Gahungu , Christopher W Lanyon , Mauricio A Alvarez , Engineer Bainomugisha , Michael Smith , Richard D. Wilkinson

分类： (统计)机器学习 | 机器学习

2022-02-09

线性系统发生在整个工程和科学中，最著名的是差分方程。在许多情况下，系统的强迫函数尚不清楚，兴趣在于使用对系统的嘈杂观察来推断强迫以及其他未知参数。在微分方程中，强迫函数是自变量（通常是时间和空间）的未知函数，可以建模为高斯过程（GP）。在本文中，我们展示了如何使用GP内核的截断基础扩展，如何使用线性系统的伴随有效地推断成GP的功能。我们展示了如何实现截短的GP的确切共轭贝叶斯推断，在许多情况下，计算的计算大大低于使用MCMC方法所需的计算。我们证明了普通和部分微分方程系统的方法，并表明基础扩展方法与数量适中的基础向量相近。最后，我们展示了如何使用贝叶斯优化来推断非线性模型参数（例如内核长度尺度）的点估计值。

translated by 谷歌翻译

NOMU: Neural Optimization-based Model Uncertainty

Jakob Heiss , Jakob Weissteiner , Hanna Wutte , Sven Seuken , Josef Teichmann

分类：机器学习 | 人工智能 | (统计)机器学习

2021-02-26

我们研究了回归中神经网络（NNS）的模型不确定性的方法。为了隔离模型不确定性的效果，我们专注于稀缺训练数据的无噪声环境。我们介绍了关于任何方法都应满足的模型不确定性的五个重要的逃亡者。但是，我们发现，建立的基准通常无法可靠地捕获其中一些逃避者，即使是贝叶斯理论要求的基准。为了解决这个问题，我们介绍了一种新方法来捕获NNS的模型不确定性，我们称之为基于神经优化的模型不确定性（NOMU）。 NOMU的主要思想是设计一个由两个连接的子NN组成的网络体系结构，一个用于模型预测，一个用于模型不确定性，并使用精心设计的损耗函数进行训练。重要的是，我们的设计执行NOMU满足我们的五个Desiderata。由于其模块化体系结构，NOMU可以为任何给定（先前训练）NN提供模型不确定性，如果访问其培训数据。我们在各种回归任务和无嘈杂的贝叶斯优化（BO）中评估NOMU，并具有昂贵的评估。在回归中，NOMU至少和最先进的方法。在BO中，Nomu甚至胜过所有考虑的基准。

translated by 谷歌翻译

Uncertainty Estimation for Computed Tomography with a Linearised Deep Image Prior

Javier Antorán , Riccardo Barbano , Johannes Leuschner , José Miguel Hernández-Lobato , Bangti Jin

分类：机器学习 | (统计)机器学习

2022-02-28

Existing deep-learning based tomographic image reconstruction methods do not provide accurate estimates of reconstruction uncertainty, hindering their real-world deployment. This paper develops a method, termed as the linearised deep image prior (DIP), to estimate the uncertainty associated with reconstructions produced by the DIP with total variation regularisation (TV). Specifically, we endow the DIP with conjugate Gaussian-linear model type error-bars computed from a local linearisation of the neural network around its optimised parameters. To preserve conjugacy, we approximate the TV regulariser with a Gaussian surrogate. This approach provides pixel-wise uncertainty estimates and a marginal likelihood objective for hyperparameter optimisation. We demonstrate the method on synthetic data and real-measured high-resolution 2D $\mu$CT data, and show that it provides superior calibration of uncertainty estimates relative to previous probabilistic formulations of the DIP. Our code is available at https://github.com/educating-dip/bayes_dip.

translated by 谷歌翻译

$π$VAE: a stochastic process prior for Bayesian deep learning with MCMC

Swapnil Mishra , Seth Flaxman , Tresnia Berah , Harrison Zhu , Mikko Pakkanen , Samir Bhatt

分类：机器学习 | (统计)机器学习

2020-02-17

随机过程提供了数学上优雅的方式模型复杂数据。从理论上讲，它们为可以编码广泛有趣的假设的功能类提供了灵活的先验。但是，实际上，难以通过优化或边缘化来有效推断，这一问题进一步加剧了大数据和高维输入空间。我们提出了一种新颖的变性自动编码器（VAE），称为先前的编码变量自动编码器（$ \ pi $ vae）。 $ \ pi $ vae是有限的交换且Kolmogorov一致的，因此是一个连续的随机过程。我们使用$ \ pi $ vae学习功能类的低维嵌入。我们表明，我们的框架可以准确地学习表达功能类，例如高斯流程，也可以学习函数的属性以启用统计推断（例如log高斯过程的积分）。对于流行的任务，例如空间插值，$ \ pi $ vae在准确性和计算效率方面都达到了最先进的性能。也许最有用的是，我们证明了所学的低维独立分布的潜在空间表示提供了一种优雅，可扩展的方法，可以在概率编程语言（例如Stan）中对随机过程进行贝叶斯推断。

translated by 谷歌翻译

Shallow and Deep Nonparametric Convolutions for Gaussian Processes

Thomas M. McDonald , Magnus Ross , Michael T. Smith , Mauricio A. Álvarez

分类： (统计)机器学习 | 机器学习

2022-06-17

高斯流程（GPS）实际应用的主要挑战是选择适当的协方差函数。 GPS的移动平均值或过程卷积的构建可以提供一些额外的灵活性，但仍需要选择合适的平滑核，这是非平凡的。以前的方法通过在平滑内核上使用GP先验，并通过扩展协方差来构建协方差函数，以绕过预先指定它的需求。但是，这样的模型在几种方面受到限制：它们仅限于单维输入，例如时间;它们仅允许对单个输出进行建模，并且由于推理并不简单，因此不会扩展到大型数据集。在本文中，我们引入了GPS的非参数过程卷积公式，该公式通过使用基于Matheron规则的功能采样方法来减轻这些弱点，以使用诱导变量的间域间采样进行快速采样。此外，我们提出了这些非参数卷积的组成，可作为经典深度GP模型的替代方案，并允许从数据中推断中间层的协方差函数。我们测试了单个输出GP，多个输出GPS和DEEP GPS在基准测试上的模型性能，并发现在许多情况下，我们的方法可以提供比标准GP模型的改进。

translated by 谷歌翻译

Combining Multi-Fidelity Modelling and Asynchronous Batch Bayesian Optimization

Jose Pablo Folch , Robert M Lee , Behrang Shafei , David Walz , Calvin Tsay , Mark van der Wilk , Ruth Misener

分类：机器学习 | (统计)机器学习

2022-11-11

Bayesian Optimization is a useful tool for experiment design. Unfortunately, the classical, sequential setting of Bayesian Optimization does not translate well into laboratory experiments, for instance battery design, where measurements may come from different sources and their evaluations may require significant waiting times. Multi-fidelity Bayesian Optimization addresses the setting with measurements from different sources. Asynchronous batch Bayesian Optimization provides a framework to select new experiments before the results of the prior experiments are revealed. This paper proposes an algorithm combining multi-fidelity and asynchronous batch methods. We empirically study the algorithm behavior, and show it can outperform single-fidelity batch methods and multi-fidelity sequential methods. As an application, we consider designing electrode materials for optimal performance in pouch cells using experiments with coin cells to approximate battery performance.

translated by 谷歌翻译

Active Learning for Saddle Point Calculation

Shuting Gu , Hongqiao Wang , Xiang Zhou

分类： (统计)机器学习 | 机器学习

2021-08-10

The saddle point (SP) calculation is a grand challenge for computationally intensive energy function in computational chemistry area, where the saddle point may represent the transition state (TS). The traditional methods need to evaluate the gradients of the energy function at a very large number of locations. To reduce the number of expensive computations of the true gradients, we propose an active learning framework consisting of a statistical surrogate model, Gaussian process regression (GPR) for the energy function, and a single-walker dynamics method, gentle accent dynamics (GAD), for the saddle-type transition states. SP is detected by the GAD applied to the GPR surrogate for the gradient vector and the Hessian matrix. Our key ingredient for efficiency improvements is an active learning method which sequentially designs the most informative locations and takes evaluations of the original model at these locations to train GPR. We formulate this active learning task as the optimal experimental design problem and propose a very efficient sample-based sub-optimal criterion to construct the optimal locations. We show that the new method significantly decreases the required number of energy or force evaluations of the original model.

translated by 谷歌翻译

Bayesian model calibration for block copolymer self-assembly: Likelihood-free inference and expected information gain computation via measure transport

Ricardo Baptista , Lianghao Cao , Joshua Chen , Omar Ghattas , Fengyi Li , Youssef M. Marzouk , J. Tinsley Oden

分类： (统计)机器学习

2022-06-22

我们考虑了使用显微镜或X射线散射技术产生的图像数据自组装的模型的贝叶斯校准。为了说明BCP平衡结构中的随机远程疾病，我们引入了辅助变量以表示这种不确定性。然而，这些变量导致了高维图像数据的综合可能性，通常可以评估。我们使用基于测量运输的可能性方法以及图像数据的摘要统计数据来解决这一具有挑战性的贝叶斯推理问题。我们还表明，可以计算出有关模型参数的数据中的预期信息收益（EIG），而无需额外的成本。最后，我们介绍了基于二嵌段共聚物薄膜自组装和自上而下显微镜表征的ohta-kawasaki模型的数值案例研究。为了进行校准，我们介绍了一些基于域的能量和傅立叶的摘要统计数据，并使用EIG量化了它们的信息性。我们证明了拟议方法研究数据损坏和实验设计对校准结果的影响的力量。

translated by 谷歌翻译

Uncertainty Quantification and Experimental Design for Large-Scale Linear Inverse Problems under Gaussian Process Priors

Cédric Travelletti , David Ginsbourger , Niklas Linde

分类： (统计)机器学习 | 机器学习

2021-09-08

我们考虑使用高斯工艺（GP）先验在贝叶斯框架中解决反问题。众所周知，GPS的计算复杂性在数据点数中立方缩放。我们在这里表明，在涉及整体操作员的反问题的背景下，人们面临的其他困难阻碍了大网格上的倒置。此外，在这种情况下，协方差矩阵可能会变得太大而无法存储。通过利用有关高斯措施的顺序分解的结果，我们能够引入后协方差矩阵的隐式表示，该矩阵仅通过存储低级中间矩阵来降低记忆足迹，同时允许在不用的情况下访问单个元素建立完整的后协方差矩阵。此外，它允许快速顺序包含新的观测值。在考虑顺序实验设计任务时，这些功能至关重要。我们通过计算重量逆问题的偏移集合恢复的顺序数据收集计划来证明我们的方法，该计划的目标是提供意大利Stromboli火山内高密度区域的精细分辨率估计。顺序数据收集计划是通过将加权集成方差降低（WIVR）标准扩展到反问题来计算的。我们的结果表明，该标准能够显着减少偏移量的不确定性，达到接近最小的残余不确定性水平。总体而言，我们的技术允许将概率模型的优势带到自然科学中引起的大规模逆问题上。

translated by 谷歌翻译

Nonparametric Multi-shape Modeling with Uncertainty Quantification

Hengrui Luo , Justin D. Strait

分类： (统计)机器学习 | 机器学习

2022-06-18

封闭曲线的建模和不确定性量化是形状分析领域的重要问题，并且可以对随后的统计任务产生重大影响。这些任务中的许多涉及封闭曲线的集合，这些曲线通常在多个层面上表现出结构相似性。以有效融合这种曲线间依赖性的方式对多个封闭曲线进行建模仍然是一个具有挑战性的问题。在这项工作中，我们提出并研究了一个多数输出（又称多输出），多维高斯流程建模框架。我们说明了提出的方法学进步，并在几个曲线和形状相关的任务上证明了有意义的不确定性量化的实用性。这种基于模型的方法不仅解决了用内核构造对封闭曲线（及其形状）的推断问题，而且还为通常对功能对象的多层依赖性的非参数建模打开了门。

translated by 谷歌翻译

Provably Reliable Large-Scale Sampling from Gaussian Processes

Anthony Stephenson , Robert Allison , Edward Pyzer-Knapp

分类： (统计)机器学习 | 机器学习

2022-11-15

When comparing approximate Gaussian process (GP) models, it can be helpful to be able to generate data from any GP. If we are interested in how approximate methods perform at scale, we may wish to generate very large synthetic datasets to evaluate them. Na\"{i}vely doing so would cost $\mathcal{O}(n^3)$ flops and $\mathcal{O}(n^2)$ memory to generate a size $n$ sample. We demonstrate how to scale such data generation to large $n$ whilst still providing guarantees that, with high probability, the sample is indistinguishable from a sample from the desired GP.

translated by 谷歌翻译

Batch Bayesian optimisation via density-ratio estimation with guarantees

Rafael Oliveira , Louis Tiao , Fabio Ramos

分类：机器学习 | 人工智能 | (统计)机器学习

2022-09-22

贝叶斯优化（BO）算法在涉及昂贵的黑盒功能的应用中表现出了显着的成功。传统上，BO被设置为一个顺序决策过程，该过程通过采集函数和先前的功能（例如高斯过程）来估计查询点的实用性。然而，最近，通过密度比率估计（BORE）对BO进行重新制定允许将采集函数重新诠释为概率二进制分类器，从而消除了对函数的显式先验和提高可伸缩性的需求。在本文中，我们介绍了对孔的遗憾和算法扩展的理论分析，并提高了不确定性估计。我们还表明，通过将问题重新提交为近似贝叶斯推断，可以自然地扩展到批处理优化设置。所得算法配备了理论性能保证，并在一系列实验中对其他批处理基本线进行了评估。

translated by 谷歌翻译

Non-Gaussian Process Regression

Yaman Kındap , Simon Godsill

分类： (统计)机器学习 | 机器学习

2022-09-07

标准GPS为行为良好的流程提供了灵活的建模工具。然而，预计与高斯的偏差有望在现实世界数据集中出现，结构异常值和冲击通常会观察到。在这些情况下，GP可能无法充分建模不确定性，并且可能会过度推动。在这里，我们将GP框架扩展到一类新的时间变化的GP，从而可以直接建模重尾非高斯行为，同时通过非均匀GPS表示的无限混合物保留了可拖动的条件GP结构。有条件的GP结构是通过在潜在转化的输入空间上调节观测值来获得的，并使用L \'{e} Vy过程对潜在转化的随机演变进行建模，该过程允许贝叶斯在后端预测密度和潜在转化中的贝叶斯推断功能。我们为该模型提供了马尔可夫链蒙特卡洛推理程序，并证明了与标准GP相比的潜在好处。

translated by 谷歌翻译

Bayesian Search for Robust Optima

Nicholas D. Sanders , Richard M. Everson , Jonathan E. Fieldsend , Alma A. M. Rahat

分类：机器学习 | (统计)机器学习

2019-04-25

许多昂贵的黑匣子优化问题对其输入敏感。在这些问题中，定位一个良好的设计区域更有意义，而不是一个可能的脆弱的最佳设计。昂贵的黑盒功能可以有效地优化贝叶斯优化，在那里高斯过程是在昂贵的功能之前的流行选择。我们提出了一种利用贝叶斯优化的强大优化方法，找到一种设计空间区域，其中昂贵的功能的性能对输入相对不敏感，同时保持质量好。这是通过从正在建模昂贵的功能的高斯进程的实现来实现这一点，并评估每个实现的改进。这些改进的期望可以用进化算法廉价地优化，以确定评估昂贵功能的下一个位置。我们描述了一个有效的过程来定位最佳预期改进。我们凭经验展示了评估候选不确定区域的昂贵功能的昂贵功能，该模型最不确定，或随机地产生最佳收敛与利用方案相比。我们在两个，五个和十个维度中说明了我们的六个测试功能的方法，并证明它能够优于来自文献的两种最先进的方法。我们还展示了我们的方法在4和8维中展示了两个真实问题，这涉及训练机器人臂，将物体推到目标上。

translated by 谷歌翻译

Physics-Informed Gaussian Process Regression Generalizes Linear PDE Solvers

Marvin Pförtner , Ingo Steinwart , Philipp Hennig , Jonathan Wenger

分类：机器学习 | (统计)机器学习

2022-12-23

Linear partial differential equations (PDEs) are an important, widely applied class of mechanistic models, describing physical processes such as heat transfer, electromagnetism, and wave propagation. In practice, specialized numerical methods based on discretization are used to solve PDEs. They generally use an estimate of the unknown model parameters and, if available, physical measurements for initialization. Such solvers are often embedded into larger scientific models or analyses with a downstream application such that error quantification plays a key role. However, by entirely ignoring parameter and measurement uncertainty, classical PDE solvers may fail to produce consistent estimates of their inherent approximation error. In this work, we approach this problem in a principled fashion by interpreting solving linear PDEs as physics-informed Gaussian process (GP) regression. Our framework is based on a key generalization of a widely-applied theorem for conditioning GPs on a finite number of direct observations to observations made via an arbitrary bounded linear operator. Crucially, this probabilistic viewpoint allows to (1) quantify the inherent discretization error; (2) propagate uncertainty about the model parameters to the solution; and (3) condition on noisy measurements. Demonstrating the strength of this formulation, we prove that it strictly generalizes methods of weighted residuals, a central class of PDE solvers including collocation, finite volume, pseudospectral, and (generalized) Galerkin methods such as finite element and spectral methods. This class can thus be directly equipped with a structured error estimate and the capability to incorporate uncertain model parameters and observations. In summary, our results enable the seamless integration of mechanistic models as modular building blocks into probabilistic models.

translated by 谷歌翻译

Batch Bayesian Optimization via Particle Gradient Flows

Enrico Crovini , Simon L. Cotter , Konstantinos Zygalakis , Andrew B. Duncan

分类： (统计)机器学习 | 机器学习

2022-09-10

贝叶斯优化（BO）方法试图找到目标功能的全球最佳功能，这些功能仅作为黑盒或昂贵的评估。这样的方法为目标函数构建了替代模型，从而量化了通过贝叶斯推论的替代物中的不确定性。客观评估是通过在每个步骤中最大化采集函数来依次确定的。但是，由于采集函数的非转换性，尤其是在批处理贝叶斯优化的情况下，该辅助优化问题可能是高度不平凡的，因此可以解决。在这项工作中，我们将批处理重新定义为在概率措施空间上的优化问题。我们基于多点预期改进来构建一个新的采集函数，该功能是概率度量空间的凸面。解决此“内部”优化问题的实用方案自然会作为该目标函数的梯度流。我们证明了这种新方法对不同基准函数的功效，并与最先进的批次BO方法进行了比较。

translated by 谷歌翻译

Additive Gaussian Processes Revisited

Xiaoyu Lu , Alexis Boukouvalas , James Hensman

分类： (统计)机器学习 | 机器学习

2022-06-20

高斯流程（GP）模型是一类灵活的非参数模型，具有丰富的代表力。通过使用具有添加剂结构的高斯工艺，可以在保持解释性的同时对复杂的响应进行建模。先前的工作表明，加性高斯工艺模型需要高维相互作用项。我们提出了正交添加剂（OAK），该核（OAK）对添加功能施加正交性约束，从而实现了功能关系的可识别，低维表示。我们将OAK内核连接到功能方差分析分解，并显示出稀疏计算方法的收敛速率。与黑盒模型相比，我们只有少量的添加剂低维术语，在保持可解释性的同时，橡木模型的预测性能相似或更好。

translated by 谷歌翻译