智能论文笔记

Identification of Binary Neutron Star Mergers in Gravitational-Wave Data Using YOLO One-Shot Object Detection

João Aveiro , Felipe F. Freitas , Márcio Ferreira , Antonio Onofre , Constança Providência , Gonçalo Gonçalves , José A. Font

分类：计算机视觉 | 机器学习

2022-07-01

我们证明了Yolov5模型（一种基于通用卷积的单杆对象检测模型）的应用，在从当前生成干涉仪检测器的重力数据中检测到二进制中子星（BNS）聚合事件的任务。我们还基于用于模型训练，验证和测试步骤的大概波形模型对合成数据生成和准备任务的详尽说明。使用这种方法，我们实现平均平均精度（$ \ text {map} _ {[0.50]} $）的单个类验证数据集的值为0.945，测试数据集的平均值为0.945，高达0.978。此外，训练有素的模型成功地识别了LIGO H1检测器数据中的GW170817事件。 LIGO L1检测器数据也可以通过附加的预处理步骤进行识别，而无需在Inspiral的最后阶段消除大故障。 GW190425事件的检测不太成功，这证明了信噪比的性能退化。我们的研究表明，Yolov5模型是第一阶段检测警报管道的有趣方法，并且在整合到更复杂的管道中时，用于实时推断物理源参数。

translated by 谷歌翻译

Anomaly detection in laser-guided vehicles' batteries: a case study

Gianfranco Lombardo , Stefano Cagnoni , Stefano Cavalli , Juan José Contreras Gonzáles , Francesco Monica , Monica Mordonini , Michele Tomaiuolo

分类：机器学习

2022-12-27

Detecting anomalous data within time series is a very relevant task in pattern recognition and machine learning, with many possible applications that range from disease prevention in medicine, e.g., detecting early alterations of the health status before it can clearly be defined as "illness" up to monitoring industrial plants. Regarding this latter application, detecting anomalies in an industrial plant's status firstly prevents serious damages that would require a long interruption of the production process. Secondly, it permits optimal scheduling of maintenance interventions by limiting them to urgent situations. At the same time, they typically follow a fixed prudential schedule according to which components are substituted well before the end of their expected lifetime. This paper describes a case study regarding the monitoring of the status of Laser-guided Vehicles (LGVs) batteries, on which we worked as our contribution to project SUPER (Supercomputing Unified Platform, Emilia Romagna) aimed at establishing and demonstrating a regional High-Performance Computing platform that is going to represent the main Italian supercomputing environment for both computing power and data volume.

translated by 谷歌翻译

SegAugment: Maximizing the Utility of Speech Translation Data with Segmentation-based Augmentations

Ioannis Tsiamas , José A. R. Fonollosa , Marta R. Costa-jussà

分类：自然语言处理

2022-12-19

Data scarcity is one of the main issues with the end-to-end approach for Speech Translation, as compared to the cascaded one. Although most data resources for Speech Translation are originally document-level, they offer a sentence-level view, which can be directly used during training. But this sentence-level view is single and static, potentially limiting the utility of the data. Our proposed data augmentation method SegAugment challenges this idea and aims to increase data availability by providing multiple alternative sentence-level views of a dataset. Our method heavily relies on an Audio Segmentation system to re-segment the speech of each document, after which we obtain the target text with alignment methods. The Audio Segmentation system can be parameterized with different length constraints, thus giving us access to multiple and diverse sentence-level views for each document. Experiments in MuST-C show consistent gains across 8 language pairs, with an average increase of 2.2 BLEU points, and up to 4.7 BLEU for lower-resource scenarios in mTEDx. Additionally, we find that SegAugment is also applicable to purely sentence-level data, as in CoVoST, and that it enables Speech Translation models to completely close the gap between the gold and automatic segmentation at inference time.

translated by 谷歌翻译

Going for GOAL: A Resource for Grounded Football Commentaries

Alessandro Suglia , José Lopes , Emanuele Bastianelli , Andrea Vanzo , Shubham Agarwal , Malvina Nikandrou , Lu Yu , Ioannis Konstas , Verena Rieser

分类：计算机视觉 | 自然语言处理

2022-11-08

Recent video+language datasets cover domains where the interaction is highly structured, such as instructional videos, or where the interaction is scripted, such as TV shows. Both of these properties can lead to spurious cues to be exploited by models rather than learning to ground language. In this paper, we present GrOunded footbAlL commentaries (GOAL), a novel dataset of football (or `soccer') highlights videos with transcribed live commentaries in English. As the course of a game is unpredictable, so are commentaries, which makes them a unique resource to investigate dynamic language grounding. We also provide state-of-the-art baselines for the following tasks: frame reordering, moment retrieval, live commentary retrieval and play-by-play live commentary generation. Results show that SOTA models perform reasonably well in most tasks. We discuss the implications of these results and suggest new tasks for which GOAL can be used. Our codebase is available at: https://gitlab.com/grounded-sport-convai/goal-baselines.

translated by 谷歌翻译

scikit-fda: A Python Package for Functional Data Analysis

Carlos Ramos-Carreño , José Luis Torrecilla , Miguel Carbajo-Berrocal , Pablo Marcos , Alberto Suárez

分类：机器学习 | (统计)机器学习

2022-11-04

The library scikit-fda is a Python package for Functional Data Analysis (FDA). It provides a comprehensive set of tools for representation, preprocessing, and exploratory analysis of functional data. The library is built upon and integrated in Python's scientific ecosystem. In particular, it conforms to the scikit-learn application programming interface so as to take advantage of the functionality for machine learning provided by this package: pipelines, model selection, and hyperparameter tuning, among others. The scikit-fda package has been released as free and open-source software under a 3-Clause BSD license and is open to contributions from the FDA community. The library's extensive documentation includes step-by-step tutorials and detailed examples of use.

translated by 谷歌翻译

Rethinking Motion Deblurring Training: A Segmentation-Based Method for Simulating Non-Uniform Motion Blurred Images

Guillermo Carbajal , Patricia Vitoria , Pablo Musé , José Lezama

分类：计算机视觉

2022-09-26

成功培训端到端的深网进行真实运动去缩合，需要尖锐/模糊的图像对数据集，这些数据集现实且多样化，足以实现概括以实现真实的图像。获得此类数据集仍然是一项具有挑战性的任务。在本文中，我们首先回顾了现有的Deblurring基准数据集的局限性，从泛化到野外模糊图像的角度。其次，我们提出了一种有效的程序方法，以基于一个简单而有效的图像形成模型来生成清晰/模糊的图像对。这允许生成几乎无限的现实和多样化的培训对。我们通过在模拟对上训练现有的DeBlurring架构，并在四个真实模糊图像的标准数据集中对其进行评估，从而证明了所提出的数据集的有效性。我们观察到使用建议方法训练时动态场景的真实运动毛线照片的最终任务的出色概括性能。

translated by 谷歌翻译

Stochastic strategies for patrolling a terrain with a synchronized multi-robot system

Luis E. Caraballo , José M. Díaz-Báñez , Ruy Fabila-Monroy , Carlos Hidalgo-Toscan

分类：机器人

2022-09-14

可以部署一组合作的空中机器人，以有效地巡逻地形，每个机器人都会在指定区域飞行，并定期与邻居共享信息，以保护或监督它。为了确保鲁棒性，以前对这些同步系统的作品提出了将机器人发送到相邻区域的情况，以防它检测到故障。为了处理不可预测性并提高确定性巡逻计划的效率，本文提出了随机策略，以涵盖在代理之间分配的领域。首先，在本文中针对两个指标进行了对随机过程的理论研究：\ emph {闲置时间}，这是两个连续观察到地形的任何点和\ emph {隔离时间}之间的预期时间，预期的时间}，预期的时间机器人没有与任何其他机器人通信的时间。之后，将随机策略与添加另一个指标的确定性策略进行了比较：\ emph {广播时间}，从机器人发出消息的那一刻，直到团队的所有其他机器人收到消息。模拟表明，理论结果与模拟和随机策略的表现非常吻合，其行为与文献中提出的确定性协议获得的行为相比。

translated by 谷歌翻译

ExplORB-SLAM: Active Visual SLAM Exploiting the Pose-graph Topology

Julio A. Placed , Juan J. Gómez Rodríguez , Juan D. Tardós , José A. Castellanos

分类：机器人

2022-09-08

长期以来，部署能够探索未知环境的自动驾驶机器人一直是与机器人社区有很大相关性的话题。在这项工作中，我们通过展示一个开源的活动视觉猛烈框架来朝着这个方向迈出一步基础姿势图提供的结构。通过仔细估计后验加权姿势图，在线实现了D-最佳决策，目的是在发生探索时改善本地化和映射不确定性。

translated by 谷歌翻译

Towards Situation Awareness and Attention Guidance in a Multiplayer Environment using Augmented Reality and Carcassonne

David Kadish , Arezoo Sarkheyli-Hägele , Jose Font , Diederick C. Niehorster , Thomas Pederson

分类：人工智能

2022-08-18

增强现实（AR）游戏是一个丰富的环境，用于研究和测试提供微妙的用户指导和培训的计算系统。在特定的计算机系统中，旨在增强用户状况意识的计算机系统受益于AR耳机中可用的传感器和计算功率。在这篇正在进行的论文中，我们提出了一个新的环境，以研究情况意识和注意力指导（SAAG）：棋盘游戏Carcassonne的增强现实版本。我们还介绍了生产SAAG管道的最初工作，包括创建游戏状态编码，游戏玩法AI的开发和培训以及情况建模和凝视跟踪系统的设计。

translated by 谷歌翻译

Enhancing Dynamic Mode Decomposition Workflow with In-Situ Visualization and Data Compression

Gabriel F. Barros , Malú Grave , José J. Camata , Alvaro L. G. A. Coutinho

分类：机器学习

2022-08-16

科学机器学习的进步改善了现代计算科学和工程应用。数据驱动的方法（例如动态模式分解（DMD））可以从动态系统生成的时空数据中提取相干结构，并推断上述系统的不同方案。时空数据作为快照，每次瞬间包含空间信息。在现代工程应用中，高维快照的产生可能是时间和/或资源要求。在本研究中，我们考虑了在大型数值模拟中增强DMD工作流程的两种策略：（i）快照压缩以减轻磁盘压力；（ii）使用原位可视化图像在运行时重建动力学（或部分）。我们通过两个3D流体动力学模拟评估我们的方法，并考虑DMD重建解决方案。结果表明，快照压缩大大减少了所需的磁盘空间。我们已经观察到，损耗的压缩将存储降低了几乎$ 50 \％$，而信号重建和其他关注数量的相对错误则较低。我们还使用原位可视化工具将分析扩展到了直接生成的数据，在运行时生成状态向量的图像文件。在大型模拟中，快照的产生可能足够慢，可以使用批处理算法进行推理。流DMD利用增量SVD算法，并随着每个新快照的到来更新模式。我们使用流式DMD来重建原位生成的图像的动力学。我们证明此过程是有效的，并且重建的动力学是准确的。

translated by 谷歌翻译