智能论文笔记

Stochastic Modeling of Tag Installation Error for Robust On-Manifold Tag-Based Visual-Inertial Localization

Navid Kayhani , Brenda McCabe , Angela P. Schoellig

分类：机器人

2022-09-13

包括无人驾驶汽车（UAV）在内的自动移动机器人因其在建筑中的应用而受到了极大的关注。这些平台具有极大的潜力，可以自动化和增强许多任务所需数据的质量和频率，例如施工时间表更新，检查和监视。强大的本地化是可靠部署自动机器人平台的关键推动力。自动化的机器人解决方案主要依靠全球定位系统（GPS）进行户外定位。但是，GPS信号在室内被拒绝，并且经常使用预建的环境图来室内定位。这需要通过对环境中的移动机器人进行远程操作来产生高质量的地图。这种方法不仅耗时且乏味，而且在室内建筑环境中也是不可靠的。布局随着施工的进度而变化，需要频繁的映射会话来支持自主任务。此外，依赖视觉特征的基于视觉解决方案的有效性在现场低质地和重复区域都受到高度影响。为了应对这些挑战，我们以前提出了使用Apriltags的低成本，轻巧的基于标签的视觉惯性定位方法。在这种方法中，标签是具有已知尺寸和位置的纸张可打印地标，代表环境的准图。由于标签放置/更换是一个手动过程，因此它会遭受人体错误。在这项工作中，我们研究了人体错误在手动标签安装过程中的影响，并提出了一种随机方法，以使用谎言组理论来解决这种不确定性。使用蒙特卡洛模拟，我们通过实验表明，在我们的Manifold配方中纳入的拟议随机模型可提高基于标签的定位对在现场手动标签安装中不可避免的瑕疵的鲁棒性和准确性。

translated by 谷歌翻译

POCD: Probabilistic Object-Level Change Detection and Volumetric Mapping in Semi-Static Scenes

Jingxing Qian , Veronica Chatrath , Jun Yang , James Servos , Angela P. Schoellig , Steven L. Waslander

分类：机器人

2022-05-02

保持最新的地图以反映现场的最新变化非常重要，尤其是在涉及在延长环境中操作的机器人重复遍历的情况。未发现的变化可能会导致地图质量恶化，导致本地化差，操作效率低下和机器人丢失。体积方法，例如截断的签名距离功能（TSDF），由于其实时生产致密而详细的地图，尽管在随着时间的推移随着时间的流逝而变化的地图更新仍然是一个挑战，但由于它们的实时生产而迅速获得了吸引力。我们提出了一个框架，该框架引入了一种新颖的概率对象状态表示，以跟踪对象在半静态场景中的姿势变化。该表示为每个对象共同对平稳性评分和TSDF变更度量进行建模。同时加入几何信息和语义信息的贝叶斯更新规则被得出以实现一致的在线地图维护。为了与最先进的方法一起广泛评估我们的方法，我们在仓库环境中发布了一个新颖的现实数据集。我们还评估了公共Toycar数据集。我们的方法优于半静态环境重建质量的最先进方法。

translated by 谷歌翻译

UTIL: An Ultra-wideband Time-difference-of-arrival Indoor Localization Dataset

Wenda Zhao , Abhishek Goudar , Xinyuan Qiao , Angela P. Schoellig

分类：机器人

2022-03-28

超宽带（UWB）基于到达的时间差异（TDOA）的定位最近已成为一种有希望的，低成本和可扩展的室内定位解决方案，这特别适合多机器人应用。但是，似乎缺乏公共数据集来基准在混乱的室内环境中新兴的UWB TDOA定位技术。为了填补这一空白，我们提供了一个全面的数据集由UWB TDOA识别实验和基于DeCawave的DWM1000 UWB模块的飞行实验组成。在识别实验中，我们在各种视线（LOS）和非线（NLOS）条件下收集了低级信号信息，包括信噪比（SNR）和功率差值。对于飞行实验，我们使用四个不同的锚点进行了累积的$ \ sim $ 150分钟的现实飞行，平均速度为0.45 m/s。在飞行过程中收集了包括UWB TDOA，惯性测量单元（IMU），光流，飞行时间（TOF）激光器和毫米精度的地面真相数据在内的原始传感器数据。数据集和开发套件可在https://utiasdsl.github.io/util-uwb-dataset/上获得。

translated by 谷歌翻译

Bridging the Model-Reality Gap with Lipschitz Network Adaptation

Siqi Zhou , Karime Pereida , Wenda Zhao , Angela P. Schoellig

分类：机器人 | 人工智能

2021-12-07

随着机器人在现实世界中冒险，他们受到无意义的动态和干扰。在相对静态和已知的操作环境中已成功地证明了基于传统的基于模型的控制方法。但是，当机器人的准确模型不可用时，基于模型的设计可能导致次优甚至不安全的行为。在这项工作中，我们提出了一种桥接模型 - 现实差距的方法，并且即使存在动态不确定性，也能够应用基于模型的方法。特别地，我们介绍基于学习的模型参考适应方法，其使机器人系统具有可能不确定的动态，表现为预定义的参考模型。反过来，参考模型可用于基于模型的控制器设计。与典型的模型参考调整控制方法相比，我们利用神经网络的代表性力量来捕获高度非线性动力学的不确定性，并通过在称为Lipschitz网络的特殊类型神经网络的建筑设计中编码认证嘴唇条件来捕获高度非线性动力学的不确定性和保证稳定性。即使我们的关于真正的机器人系统的先验知识有限，我们的方法也适用于一般的非线性控制仿射系统。我们展示了我们在飞行倒置摆的方法中的方法，其中一个搁板的四轮电机被挑战，以平衡倒挂摆在悬停或跟踪圆形轨迹时。

translated by 谷歌翻译

safe-control-gym: a Unified Benchmark Suite for Safe Learning-based Control and Reinforcement Learning in Robotics

Zhaocong Yuan , Adam W. Hall , Siqi Zhou , Lukas Brunke , Melissa Greeff , Jacopo Panerati , Angela P. Schoellig

分类：机器人 | 机器学习

2021-09-13

近年来，强化学习和基于学习的控制以及对他们的安全性的研究，这对于在现实世界机器人中的部署至关重要 - 都获得了重大的吸引力。但是，为了充分评估新结果的进度和适用性，我们需要工具来公平地比较控制和强化学习界提出的方法。在这里，我们提出了一个新的开源基准套件，称为“安全控制”套件，支持基于模型和基于数据的控制技术。我们为三个动态系统（Cart-Pole，1D和2D四极管）提供实现，以及两个控制任务 - 稳定和轨迹跟踪。我们建议扩展OpenAi的Gym API - 强化学习研究的事实上的标准 - （i）能够指定（和查询）符号动态和（ii）约束，以及（iii）（重复）（重复）在控制输入，状态测量和惯性特性。为了证明我们的建议并试图使研究社区更加紧密地结合在一起，我们展示了如何使用安全控制的gym定量比较传统控制领域的多种方法的控制绩效，数据效率和安全性控制和加强学习。

translated by 谷歌翻译

Safe Learning in Robotics: From Learning-Based Control to Safe Reinforcement Learning

Lukas Brunke , Melissa Greeff , Adam W. Hall , Zhaocong Yuan , Siqi Zhou , Jacopo Panerati , Angela P. Schoellig

分类：机器人 | 机器学习

2021-08-13

过去半年来，从控制和强化学习社区的真实机器人部署的安全学习方法的贡献数量急剧上升。本文提供了一种简洁的但整体审查，对利用机器学习实现的最新进展，以实现在不确定因素下的安全决策，重点是统一控制理论和加固学习研究中使用的语言和框架。我们的评论包括：基于学习的控制方法，通过学习不确定的动态，加强学习方法，鼓励安全或坚固性的加固学习方法，以及可以正式证明学习控制政策安全的方法。随着基于数据和学习的机器人控制方法继续获得牵引力，研究人员必须了解何时以及如何最好地利用它们在安全势在必行的现实情景中，例如在靠近人类的情况下操作时。我们突出了一些开放的挑战，即将在未来几年推动机器人学习领域，并强调需要逼真的物理基准的基准，以便于控制和加固学习方法之间的公平比较。

translated by 谷歌翻译

Safe Model-based Reinforcement Learning with Stability Guarantees

Felix Berkenkamp , Matteo Turchetta , Angela P. Schoellig , Andreas Krause

分类：

2017-05-23

Reinforcement learning is a powerful paradigm for learning optimal policies from experimental data. However, to find optimal policies, most reinforcement learning algorithms explore all possible actions, which may be harmful for real-world systems. As a consequence, learning algorithms are rarely applied on safety-critical systems in the real world. In this paper, we present a learning algorithm that explicitly considers safety, defined in terms of stability guarantees. Specifically, we extend control-theoretic results on Lyapunov stability verification and show how to use statistical models of the dynamics to obtain high-performance control policies with provable stability certificates. Moreover, under additional regularity assumptions in terms of a Gaussian process prior, we prove that one can effectively and safely collect data in order to learn about the dynamics and thus both improve control performance and expand the safe region of the state space. In our experiments, we show how the resulting algorithm can safely optimize a neural network policy on a simulated inverted pendulum, without the pendulum ever falling down.

translated by 谷歌翻译

A Few Thousand Translations Go a Long Way! Leveraging Pre-trained Models for African News Translation

David Ifeoluwa Adelani , Jesujoba Oluwadara Alabi , Angela Fan , Julia Kreutzer , Xiaoyu Shen , Machel Reid , Dana Ruiter , Dietrich Klakow , Peter Nabende , Ernie Chang

分类：自然语言处理

2022-05-04

语言模型预训练的最新进展利用大规模数据集创建多语言模型。但是，这些数据集中大多遗漏了低资源语言。这主要是因为网络上没有很好地表示口语，因此被排除在用于创建数据集的大规模爬网中。此外，这些模型的下游用户仅限于最初选择用于预训练的语言的选择。这项工作调查了如何最佳利用现有的预培训模型来为16种非洲语言创建低资源翻译系统。我们关注两个问题：1）如何将预训练的模型用于初始预培训中未包含的语言？ 2）生成的翻译模型如何有效地转移到新域？为了回答这些问题，我们创建了一个新的非洲新闻语料库，涵盖16种语言，其中8种语言不属于任何现有评估数据集的一部分。我们证明，将两种语言转移到其他语言和其他领域的最有效策略是，以少量的高质量翻译数据微调大型预训练模型。

translated by 谷歌翻译

Protein-Ligand Complex Generator & Drug Screening via Tiered Tensor Transform

Jonathan P. Mailoa , Zhaofeng Ye , Jiezhong Qiu , Chang-Yu Hsieh , Shengyu Zhang

分类：神经与进化计算

2023-01-03

Accurate determination of a small molecule candidate (ligand) binding pose in its target protein pocket is important for computer-aided drug discovery. Typical rigid-body docking methods ignore the pocket flexibility of protein, while the more accurate pose generation using molecular dynamics is hindered by slow protein dynamics. We develop a tiered tensor transform (3T) algorithm to rapidly generate diverse protein-ligand complex conformations for both pose and affinity estimation in drug screening, requiring neither machine learning training nor lengthy dynamics computation, while maintaining both coarse-grain-like coordinated protein dynamics and atomistic-level details of the complex pocket. The 3T conformation structures we generate are closer to experimental co-crystal structures than those generated by docking software, and more importantly achieve significantly higher accuracy in active ligand classification than traditional ensemble docking using hundreds of experimental protein conformations. 3T structure transformation is decoupled from the system physics, making future usage in other computational scientific domains possible.

translated by 谷歌翻译

Posterior Collapse and Latent Variable Non-identifiability

Yixin Wang , David M. Blei , John P. Cunningham

分类： (统计)机器学习 | 机器学习

2023-01-02

Variational autoencoders model high-dimensional data by positing low-dimensional latent variables that are mapped through a flexible distribution parametrized by a neural network. Unfortunately, variational autoencoders often suffer from posterior collapse: the posterior of the latent variables is equal to its prior, rendering the variational autoencoder useless as a means to produce meaningful representations. Existing approaches to posterior collapse often attribute it to the use of neural networks or optimization issues due to variational approximation. In this paper, we consider posterior collapse as a problem of latent variable non-identifiability. We prove that the posterior collapses if and only if the latent variables are non-identifiable in the generative model. This fact implies that posterior collapse is not a phenomenon specific to the use of flexible distributions or approximate inference. Rather, it can occur in classical probabilistic models even with exact inference, which we also demonstrate. Based on these results, we propose a class of latent-identifiable variational autoencoders, deep generative models which enforce identifiability without sacrificing flexibility. This model class resolves the problem of latent variable non-identifiability by leveraging bijective Brenier maps and parameterizing them with input convex neural networks, without special variational inference objectives or optimization tricks. Across synthetic and real datasets, latent-identifiable variational autoencoders outperform existing methods in mitigating posterior collapse and providing meaningful representations of the data.

translated by 谷歌翻译