智能论文笔记

Hierarchy exploitation to detect missing annotations on hierarchical multi-label classification

Miguel Romero , Felipe Kenji Nakano , Jorge Finke , Camilo Rocha , Celine Vens

分类：机器学习

2022-07-13

在过去的十年中，基因组数据的可用性已成倍增长，这主要是由于新测序技术的发展。基于从增加的基因组数据中提取的基因（和基因产物）之间的相互作用，许多研究集中在基因与功能之间的关联鉴定上。尽管这些研究表现出了很大的希望，但用功能注释基因的问题仍然是一个悬而未决的挑战。在这项工作中，我们提出了一种检测层次多标签分类数据集中缺失注释的方法。我们提出了一种通过将汇总概率计算到每个实例的从叶子到根的类路径来利用类层次结构的方法。提出的方法是在预测缺失基因函数注释的背景下提出的，其中这些汇总概率被进一步用于选择一组通过体内实验验证的注释。各种大米的Oriza Sativa Japonica的实验，将类别的层次结合到该方法中通常会改善预测性能，而我们所提出的方法与文献相比，我们提出的方法可以提高结果。

translated by 谷歌翻译

Machine Learning and Thermography Applied to the Detection and Classification of Cracks in Building

Angela Busheska , Nara Almeida , Nicholas Sabella , Eudes de A. Rocha

分类：计算机视觉 | 机器学习

2022-12-30

Due to the environmental impacts caused by the construction industry, repurposing existing buildings and making them more energy-efficient has become a high-priority issue. However, a legitimate concern of land developers is associated with the buildings' state of conservation. For that reason, infrared thermography has been used as a powerful tool to characterize these buildings' state of conservation by detecting pathologies, such as cracks and humidity. Thermal cameras detect the radiation emitted by any material and translate it into temperature-color-coded images. Abnormal temperature changes may indicate the presence of pathologies, however, reading thermal images might not be quite simple. This research project aims to combine infrared thermography and machine learning (ML) to help stakeholders determine the viability of reusing existing buildings by identifying their pathologies and defects more efficiently and accurately. In this particular phase of this research project, we've used an image classification machine learning model of Convolutional Neural Networks (DCNN) to differentiate three levels of cracks in one particular building. The model's accuracy was compared between the MSX and thermal images acquired from two distinct thermal cameras and fused images (formed through multisource information) to test the influence of the input data and network on the detection results.

translated by 谷歌翻译

Overview of The MediaEval 2022 Predicting Video Memorability Task

Lorin Sweeney , Mihai Gabriel Constantin , Claire-Hélène Demarty , Camilo Fosco , Alba G. Seco de Herrera , Sebastian Halder , Graham Healy , Bogdan Ionescu , Ana Matran-Fernandez , Alan F. Smeaton

分类：计算机视觉 | 人工智能

2022-12-13

This paper describes the 5th edition of the Predicting Video Memorability Task as part of MediaEval2022. This year we have reorganised and simplified the task in order to lubricate a greater depth of inquiry. Similar to last year, two datasets are provided in order to facilitate generalisation, however, this year we have replaced the TRECVid2019 Video-to-Text dataset with the VideoMem dataset in order to remedy underlying data quality issues, and to prioritise short-term memorability prediction by elevating the Memento10k dataset as the primary dataset. Additionally, a fully fledged electroencephalography (EEG)-based prediction sub-task is introduced. In this paper, we outline the core facets of the task and its constituent sub-tasks; describing the datasets, evaluation metrics, and requirements for participant submissions.

translated by 谷歌翻译

Experiences from the MediaEval Predicting Media Memorability Task

Alba García Deco de Herrera , Mihai Gabriel Constantin , Chaire-Hélène Demarty , Camilo Fosco , Sebastian Halder , Graham Healy , Bogdan Ionescu , Ana Matran-Fernandez , Alan F. Smeaton , Mushfika Sultana

分类：计算机视觉 | 人工智能

2022-12-07

The Predicting Media Memorability task in the MediaEval evaluation campaign has been running annually since 2018 and several different tasks and data sets have been used in this time. This has allowed us to compare the performance of many memorability prediction techniques on the same data and in a reproducible way and to refine and improve on those techniques. The resources created to compute media memorability are now being used by researchers well beyond the actual evaluation campaign. In this paper we present a summary of the task, including the collective lessons we have learned for the research community.

translated by 谷歌翻译

Estimating Appearance Models for Image Segmentation via Tensor Factorization

Jeova Farias Sales Rocha Neto

分类：计算机视觉 | (统计)机器学习

2022-08-16

图像分割是计算机视觉中的核心任务之一，并且求解它通常取决于通过其组成区域的颜色分布对图像外观数据进行建模。尽管许多分割算法使用交替或隐式方法处理外观模型依赖性，但我们在这里提出了一种新方法，可以直接从图像估算它们，而无需先前有关基础分割的信息。我们的方法使用来自图像的局部高阶颜色统计信息作为潜在变量模型的基于张量分解的估计器的输入。这种方法能够估算多隔离图像中的模型，并在没有事先用户交互的情况下自动输出区域比例，从而克服了从事先尝试到此问题的缺点。我们还在许多具有挑战性的合成和真实成像方案中证明了我们提出的方法的性能，并表明它导致了有效的分割算法。

translated by 谷歌翻译

Active Learning for Computationally Efficient Distribution of Binary Evolution Simulations

Kyle Akira Rocha , Jeff J. Andrews , Christopher P. L. Berry , Zoheyr Doctor , Pablo Marchant , Vicky Kalogera , Scott Coughlin , Simone S. Bavera , Aaron Dotter , Tassos Fragos

分类：机器学习

2022-03-30

二进制恒星经历各种相互作用和进化阶段，对于预测和解释观察到的特性至关重要。具有完整恒星结构和进化模拟的二元种群合成在计算上需要大量的质量转移序列。最近开发的二元种群综合代码Posydon结合了梅萨二元星模拟的网格，然后将其插值以模拟大型大型二进制文件。计算高密度直线网格的传统方法对于高维网格，不可扩展，这是一系列金属性，旋转和偏心率的范围。我们提出了一种新的活跃学习算法PSY-CRI，该算法使用数据收集过程中的机器学习来适应和迭代选择目标模拟以运行，从而导致自定义，高性能的训练集。我们在玩具问题上测试PSY-CRIS，发现所得的训练集比常规或随机采样网格所需的模拟更少以进行准确的分类和回归。我们进一步将psy-cris应用于构建Mesa模拟动态网格的目标问题，我们证明，即使没有微调，仅$ \ sim 1/4 $的模拟集也足以足以达到相同的分类精度。当针对目标应用程序优化算法参数时，我们预计将进一步增益。我们发现，仅对分类进行优化可能会导致回归中的绩效损失，反之亦然。降低产生网格的计算成本将使Posydon的未来版本涵盖更多的输入参数，同时保留插值精度。

translated by 谷歌翻译

Complexity and Aesthetics in Generative and Evolutionary Art

Jon McCormack , Camilo Cruz Gambardella

分类：神经与进化计算

2022-01-05

在本文中，我们检查了复杂性的概念，因为它适用于生成和进化艺术和设计。复杂性具有许多不同，纪律的特定定义，例如物理系统（熵）的复杂性，信息复杂性的算法测量和“复杂系统”。我们将一系列不同的复杂度措施应用于三个不同的进化艺术数据集，并查看艺术家的复杂性和个人审美判断之间的相关性（在两个数据集的情况下）或生成3D形式的物理测量复杂性。我们的结果表明，每个集合和测量的相关程度都不同，表明没有整体“更好”的措施。但是，具体措施确实在各个数据集中表现良好，表明仔细选择可以增加使用此类措施的值。然后，我们通过对复杂性和美学的看法进行大规模调查来评估观众复杂度措施的价值。我们通过讨论生成和进化艺术中的直接措施的价值来得出结论，提高神经影像学和心理学的最新发现，这提出了人类审美判断的许多外在因素，超出了所判断的物体的可测量特性。

translated by 谷歌翻译

The Phonetic Footprint of Parkinson's Disease

Philipp Klumpp , Tomás Arias-Vergara , Juan Camilo Vásquez-Correa , Paula Andrea Pérez-Toro , Juan Rafael Orozco-Arroyave , Anton Batliner , Elmar Nöth

分类：人工智能 | 机器学习

2021-12-21

作为最普遍的神经退行性疾病之一，帕金森病（PD）对患者的精细运动技能产生了重大影响。在语音生产过程中不同铰接器的复杂相互作用和所需肌肉张力的实现变得越来越困难，从而导致发狂的言论。在受影响的个体中通常可以观察到元音不稳定性，浆液发音和慢演说的特征模式，并在先前的研究中分析以确定PD的存在和进展。在这项工作中，我们使用了专门培训的语音识别器，以研究PD如何影响患者的语音占地面积。我们重新发现了许多在以前的贡献中描述的模式，尽管我们的系统从未见过此前从未见过任何病理演讲。此外，我们可以表明来自神经网络的中间激活可以用作编码与个人疾病状态有关的信息的特征向量。我们还能够直接将演讲者的专家额定智能性与语音预测的平均置信相提并论。我们的结果支持假设，即培训能够分析PD语音的系统不一定需要病理数据。

translated by 谷歌翻译

Forensic Analysis of Synthetically Generated Scientific Images

Sara Mandelli , Davide Cozzolino , Joao P. Cardenuto , Daniel Moreira , Paolo Bestagini , Walter Scheirer , Anderson Rocha , Luisa Verdoliva , Stefano Tubaro , Edward J. Delp

分类：计算机视觉 | 人工智能

2021-12-16

综合产生的内容的广泛扩散是一种需要紧急对策的严重威胁。合成含量的产生不限于多媒体数据，如视频，照片或音频序列，但涵盖了可以包括生物图像的显着大面积，例如西幕和微观图像。在本文中，我们专注于检测综合生成的西幕图像。生物医学文献在很大程度上探讨了西部污染图像，已经表明了如何通过目视检查或标准取证检测器轻松地伪造这些图像。为了克服缺乏公开可用的数据集，我们创建了一个包含超过14k原始的西幕图像和18K合成的Western-Blot图像的新数据集，由三种不同的最先进的生成方法产生。然后，我们调查不同的策略来检测合成的Western印迹，探索二进制分类方法以及单级探测器。在这两种情况下，我们从不利用培训阶段的合成纤维图像。所达到的结果表明，即使在这些科学图像的合成版本未优化利用检测器，综合生成的西幕图像也可以具有良好的精度。

translated by 谷歌翻译

One System to Rule them All: a Universal Intent Recognition System for Customer Service Chatbots

Juan Camilo Vasquez-Correa , Juan Carlos Guerrero-Sierra , Jose Luis Pemberty-Tamayo , Juan Esteban Jaramillo , Andres Felipe Tejada-Castro

分类：自然语言处理

2021-12-15

客户服务Chatbots是对话系统，旨在为客户提供有关不同公司提供的产品/服务的信息。特别地，意图识别是自然语言低估Chatbot系统的能力的核心组件之一。在聊天训练识别的不同意图中，他们有一组是通用的任何客户服务Chatbot。普遍意图可以包括称呼，将对话交给人类代理人，告别。识别这些普遍意图的系统将非常有助于优化特定客户服务聊天训练过程。我们提出了一个普遍意图识别系统的发展，该系统受过培训，以识别28个不同的聊天跳闸中常见的11个意图组。拟议的系统考虑了最先进的单词嵌入模型，例如Word2VEC和BERT，基于卷积和经常性神经网络的深层分类器。所提出的模型能够区分这些普遍意图，均衡精度高达80.4 \％。此外，所提出的系统同样准确地识别短期和长文本请求中表达的意图。同时，错误分类错误通常发生在具有非常相似的语义领域，例如告别和正面评论之间。建议的系统将非常有帮助优化客户服务Chatbot的培训过程，因为我们的系统已经可用并检测到一些意图。与此同时，拟议的方法将是一个合适的基础模型，通过应用转移学习策略培训更具体的聊天措施。

translated by 谷歌翻译