智能论文笔记

One Network, Many Robot: Generative Graphical Inverse Kinematics

Oliver Limoyo , Filip Marić , Matthew Giamou , Petra Alexson , Ivan Petrović , Jonathan Kelly

分类：机器人

2022-09-19

快速，可靠地找到准确的逆运动学（IK）解决方案仍然是机器人操纵的挑战性问题。现有的数值求解器广泛适用，但依赖于本地搜索技术来管理高度非关键目标函数。最近，基于学习的方法已显示出有望作为生成快速准确的IK结果的一种手段。可以轻松地将学习的求解器与端到端系统中的其他学习算法集成在一起。但是，基于学习的方法具有致命的脚跟：每个感兴趣的机器人都需要一个专门的模型，必须从头开始训练。为了解决这一关键缺点，我们研究了一种新颖的距离几何机器人表示，并与图形结构相结合，使我们能够利用图形神经网络（GNNS）的灵活性。我们使用这种方法来训练第一个学到的生成图形逆运动学（GGIK）求解器，它至关重要的是，“机器人 - 敏捷” - 单个模型能够为各种不同的机器人提供IK解决方案。此外，GGIK的生成性质使求解器可以同时生产大量不同的解决方案，并与最小的额外计算时间同行，使其适用于诸如基于采样的运动计划之类的应用。最后，GGIK可以通过提供可靠的初始化来补充本地IK求解器。这些优势以及使用与任务相关的先验并通过新数据不断改进的能力表明，GGIK有可能成为灵活的，基于学习的机器人操作系统的关键组成部分。

translated by 谷歌翻译

Motion Planning in Dynamic Environments Using Context-Aware Human Trajectory Prediction

Mark Nicholas Finean , Luka Petrović , Wolfgang Merkt , Ivan Marković , Ioannis Havoutis

分类：机器人

2022-01-13

多年来，运动规划，映射和人类轨迹预测的单独领域显着提出。然而，在提供能够使移动操纵器能够执行全身运动并考虑移动障碍物的预测运动时，文献在提供实际框架方面仍然稀疏。基于以前的优化的运动计划方法，使用距离字段遭受更新环境表示所需的高计算成本。我们证明，与从头划痕计算距离场相比，GPU加速预测的复合距离场显着降低计算时间。我们将该技术与完整的运动规划和感知框架集成，其占据动态环境中的人类的预测运动，从而实现了包含预测动作的反应性和先发制人的运动规划。为实现这一目标，我们提出并实施了一种新颖的人类轨迹预测方法，该方法结合了基于轨迹优化的运动规划的意图识别。我们在现实世界丰田人类支持机器人（HSR）上验证了我们的由Onboard Camera的现场RGB-D传感器数据验证了我们的结果框架。除了在公开的数据集提供分析外，我们还释放了牛津室内人类运动（牛津-IHM）数据集，并在人类轨迹预测中展示了最先进的性能。牛津-IHM数据集是一个人类轨迹预测数据集，人们在室内环境中的兴趣区域之间行走。静态和机器人安装的RGB-D相机都观察了用运动捕获系统跟踪的人员。

translated by 谷歌翻译

Convex Iteration for Distance-Geometric Inverse Kinematics

Matthew Giamou , Filip Marić , David M. Rosen , Valentin Peretroukhin , Nicholas Roy , Ivan Petrović , Jonathan Kelly

分类：机器人

2021-09-08

反向运动学（IK）是找到满足一个或多个末端效应器的位置或姿势的限制的机器人联合配置的问题。对于具有冗余自由度的机器人，通常存在无限，不透露的解决方案。当通过工作空间中的障碍施加碰撞限制时，IK问题进一步复杂。通常，不存在产生可行配置的闭合表达，促使使用数值解决方案方法。然而，这些方法依赖于局部优化非凸起问题，通常需要准确的初始化或许多重新初始化来收敛到有效的解决方案。在这项工作中，我们首先将复杂的工作空间约束制定逆运动学，作为凸的可行性问题，其低级可行点提供精确的IK解决方案。然后，我们呈现\ texttt {cidgik}（距离 - 几何反向运动学的凸迭代），这是一种解决这种可行性问题的算法，其具有旨在鼓励低秩最小化的半导体级程序的序列。我们的问题制定优雅地统一机器人的配置空间和工作空间约束：内在机器人几何形状和避免避免都表示为简单的线性矩阵方程和不等式。我们对各种流行的操纵器模型的实验结果比传统的非线性优化的方法更快，更准确的会聚，特别是在具有许多障碍的环境中。

translated by 谷歌翻译

Riemannian Optimization for Distance-Geometric Inverse Kinematics

Filip Marić , Matthew Giamou , Adam W. Hall , Soroush Khoubyarian , Ivan Petrović , Jonathan Kelly

分类：机器人

2021-08-31

解决逆运动学问题是针对清晰机器人的运动计划，控制和校准的基本挑战。这些机器人的运动学模型通常通过关节角度进行参数化，从而在机器人构型和最终效果姿势之间产生复杂的映射。或者，可以使用机器人附加点之间的不变距离来表示运动学模型和任务约束。在本文中，我们将基于距离的逆运动学的等效性和大量铰接式机器人和任务约束的距离几何问题进行形式化。与以前的方法不同，我们使用距离几何形状和低级别矩阵完成之间的连接来通过局部优化完成部分欧几里得距离矩阵来找到逆运动学解决方案。此外，我们用固定级革兰氏矩阵的Riemannian歧管来参数欧几里得距离矩阵的空间，从而使我们能够利用各种成熟的Riemannian优化方法。最后，我们表明，绑定的平滑性可用于生成知情的初始化，而无需大量的计算开销，从而改善收敛性。我们证明，我们的逆运动求解器比传统技术获得更高的成功率，并且在涉及许多工作区约束的问题上大大优于它们。

translated by 谷歌翻译

A Continuous-Time Approach for 3D Radar-to-Camera Extrinsic Calibration

Emmett Wise , Juraj Peršić , Christopher Grebe , Ivan Petrović , Jonathan Kelly

分类：机器人

2021-03-12

恶劣天气的可靠运行对于部署安全自治车辆（AVS）至关重要。通过熔化来自标准AV传感器套件（即，Lidars，Cameras）的数据，可以实现鲁棒性和可靠性，其中天气强壮的传感器，例如毫米波雷达。批判性地，精确的传感器数据融合需要了解传感器对之间的刚体变换，这可以通过外部校准的过程来确定。已经为2D（平面）雷达传感器设计了许多外部校准算法 - 然而，最近开发的低成本3D毫米波雷达被设定为在许多应用中取代其2D对应物。在本文中，我们提出了一种连续时间3D雷达 - 相机外在校准算法，其利用雷达速度测量，并且与大多数现有技术不同，不需要专门的雷达逆向反射器存在于环境中。我们推出了我们配方的可观察性性质，并通过合成和现实世界实验证明了我们的算法的功效。

translated by 谷歌翻译

A Riemannian Metric for Geometry-Aware Singularity Avoidance by Articulated Robots

Filip Marić , Luka Petrović , Marko Guberina , Jonathan Kelly , Ivan Petrović

分类：机器人

2021-03-09

诸如操纵器之类的铰接机器人必须在不确定和动态的环境中运行，例如，相互作用（例如与人类同事）是必要的。在这种情况下，必须快速适应操作空间限制的意外变化的能力至关重要。在操纵器的配置空间中的某些点（称为奇异点），机器人失去了一个或多个自由度（DOF），并且无法在特定的操作空间方向上移动。无法在操作空间中朝任意方向移动会损害适应性和安全性。我们引入了一个几何感知奇异性索引，该索引在对称正定定义矩阵上使用Riemannian度量定义，以提供与奇异构型的接近度的度量。我们证明我们的索引避免了其他共同指数固有的某些故障模式和困难。此外，我们表明该索引可以轻松区分，使其与用于操作空间控制的局部优化方法兼容。我们的实验结果表明，对于遵循任务的到达和路径，基于我们的索引优化优于一种常见的可操作性最大化技术，并确保奇异性运动动作。

translated by 谷歌翻译

Inverse Kinematics as Low-Rank Euclidean Distance Matrix Completion

Filip Marić , Matthew Giamou , Ivan Petrović , Jonathan Kelly

分类：机器人

2020-11-10

大多数逆运动学（IK）算法在由关节角定义的配置空间中搜索解决方案。然而，许多机器人的运动学也可以用刚性附着点之间的距离来描述，这些点齐构成了欧几里得距离矩阵。运动学的这种替代几何描述揭示了IK与低级矩阵完成问题之间的优雅等效性。我们使用这种连接来针对具有对称关节角度约束的各种铰接式机器人对IK实施新的基于Riemannian优化的解决方案。

translated by 谷歌翻译

Approaching Peak Ground Truth

Florian Kofler , Johannes Wahle , Ivan Ezhov , Sophia Wagner , Rami Al-Maskari , Emilia Gryska , Mihail Todorov , Christina Bukas , Felix Meissen , Tingying Peng

分类：机器学习 | 人工智能 | 计算机视觉

2022-12-31

Machine learning models are typically evaluated by computing similarity with reference annotations and trained by maximizing similarity with such. Especially in the bio-medical domain, annotations are subjective and suffer from low inter- and intra-rater reliability. Since annotations only reflect the annotation entity's interpretation of the real world, this can lead to sub-optimal predictions even though the model achieves high similarity scores. Here, the theoretical concept of Peak Ground Truth (PGT) is introduced. PGT marks the point beyond which an increase in similarity with the reference annotation stops translating to better Real World Model Performance (RWMP). Additionally, a quantitative technique to approximate PGT by computing inter- and intra-rater reliability is proposed. Finally, three categories of PGT-aware strategies to evaluate and improve model performance are reviewed.

translated by 谷歌翻译

MyI-Net: Fully Automatic Detection and Quantification of Myocardial Infarction from Cardiovascular MRI Images

Shuihua Wang , Ahmed M. S. E. K Abdelaty , Kelly Parke , J Ranjit Arnold , Gerry P McCann , Ivan Y Tyukin

分类：计算机视觉 | 机器学习

2022-12-28

A "heart attack" or myocardial infarction (MI), occurs when an artery supplying blood to the heart is abruptly occluded. The "gold standard" method for imaging MI is Cardiovascular Magnetic Resonance Imaging (MRI), with intravenously administered gadolinium-based contrast (late gadolinium enhancement). However, no "gold standard" fully automated method for the quantification of MI exists. In this work, we propose an end-to-end fully automatic system (MyI-Net) for the detection and quantification of MI in MRI images. This has the potential to reduce the uncertainty due to the technical variability across labs and inherent problems of the data and labels. Our system consists of four processing stages designed to maintain the flow of information across scales. First, features from raw MRI images are generated using feature extractors built on ResNet and MoblieNet architectures. This is followed by the Atrous Spatial Pyramid Pooling (ASPP) to produce spatial information at different scales to preserve more image context. High-level features from ASPP and initial low-level features are concatenated at the third stage and then passed to the fourth stage where spatial information is recovered via up-sampling to produce final image segmentation output into: i) background, ii) heart muscle, iii) blood and iv) scar areas. New models were compared with state-of-art models and manual quantification. Our models showed favorable performance in global segmentation and scar tissue detection relative to state-of-the-art work, including a four-fold better performance in matching scar pixels to contours produced by clinicians.

translated by 谷歌翻译

Statistical Mechanics of Generalization In Graph Convolution Networks

Cheng Shi , Liming Pan , Hong Hu , Ivan Dokmanić

分类：机器学习 | (统计)机器学习

2022-12-26

Graph neural networks (GNN) have become the default machine learning model for relational datasets, including protein interaction networks, biological neural networks, and scientific collaboration graphs. We use tools from statistical physics and random matrix theory to precisely characterize generalization in simple graph convolution networks on the contextual stochastic block model. The derived curves are phenomenologically rich: they explain the distinction between learning on homophilic and heterophilic graphs and they predict double descent whose existence in GNNs has been questioned by recent work. Our results are the first to accurately explain the behavior not only of a stylized graph learning model but also of complex GNNs on messy real-world datasets. To wit, we use our analytic insights about homophily and heterophily to improve performance of state-of-the-art graph neural networks on several heterophilic benchmarks by a simple addition of negative self-loop filters.

translated by 谷歌翻译