智能论文笔记

Improving language models by retrieving from trillions of tokens

Sebastian Borgeaud , Arthur Mensch , Jordan Hoffmann , Trevor Cai , Eliza Rutherford , Katie Millican , George van den Driessche , Jean-Baptiste Lespiau , Bogdan Damoc , Aidan Clark

分类：自然语言处理 | 机器学习

2021-12-08

我们通过与与前面令牌的局部相似度，通过调节从大语料库检索的文档块来增强自动回归语言模型。尽管使用25美元\时分，我们的检索增强型变压器（RetroCro）的检索增强型变压器（RetroCr）对GPT-3和侏罗纪-1获得了可比性的性能。微调后，复古表演转换为下游知识密集型任务，如问题应答。复古结合了冷冻BERT猎犬，一种可微分的编码器和块状的横向机制，以预测基于数量级的令牌，而不是训练期间通常消耗的数量。我们通常从头开始训练复古，还可以快速改造预先接受的变压器，通过检索，仍然达到良好的性能。我们的工作通过以前所未有的规模开辟了通过显式内存改进语言模型的新途径。

translated by 谷歌翻译

Transformation-based Adversarial Video Prediction on Large-Scale Data

Pauline Luc , Aidan Clark , Sander Dieleman , Diego de Las Casas , Yotam Doron , Albin Cassirer , Karen Simonyan

分类：计算机视觉 | 机器学习

2020-03-09

近期对抗性生成建模的突破导致了能够生产高质量的视频样本的模型，即使在真实世界视频的大型和复杂的数据集上也是如此。在这项工作中，我们专注于视频预测的任务，其中给出了从视频中提取的一系列帧，目标是生成合理的未来序列。我们首先通过对鉴别器分解进行系统的实证研究并提出产生更快的收敛性和更高性能的系统来提高本领域的最新技术。然后，我们分析发电机中的复发单元，并提出了一种新的复发单元，其根据预测的运动样本来改变其过去的隐藏状态，并改进它以处理DIS闭塞，场景变化和其他复杂行为。我们表明，这种经常性单位始终如一地优于以前的设计。我们的最终模型导致最先进的性能中的飞跃，从大型动力学-600数据集中获得25.7的测试集Frechet视频距离为25.7，下降到69.2。

translated by 谷歌翻译

Augmented Business Process Management Systems: A Research Manifesto

Marlon Dumas , Fabiana Fournier , Lior Limonad , Andrea Marrella , Marco Montali , Jana-Rebecca Rehse , Rafael Accorsi , Diego Calvanese , Giuseppe De Giacomo , Dirk Fahland

分类：人工智能

2022-01-30

增强业务流程管理系统（ABPMS）是一类新兴的过程感知信息系统，可利用值得信赖的AI技术。ABPMS增强了业务流程的执行，目的是使这些过程更加适应性，主动，可解释和上下文敏感。该宣言为ABPMS提供了愿景，并讨论了需要克服实现这一愿景的研究挑战。为此，我们定义了ABPM的概念，概述了ABPMS中流程的生命周期，我们讨论了ABPMS的核心特征，并提出了一系列挑战以实现具有这些特征的系统。

translated by 谷歌翻译

Gait Recognition Based on Deep Learning: A Survey

Claudio Filipi Gonçalves dos Santos , Diego de Souza Oliveira , Leandro A. Passos , Rafael Gonçalves Pires , Daniel Felipe Silva Santos , Lucas Pascotti Valem , Thierry P. Moreira , Marcos Cleison S. Santana , Mateus Roder , João Paulo Papa

分类：计算机视觉 | 机器学习

2022-01-10

通常，基于生物谱系的控制系统可能不依赖于各个预期行为或合作适当运行。相反，这种系统应该了解未经授权的访问尝试的恶意程序。文献中提供的一些作品建议通过步态识别方法来解决问题。这些方法旨在通过内在的可察觉功能来识别人类，尽管穿着衣服或配件。虽然该问题表示相对长时间的挑战，但是为处理问题的大多数技术存在与特征提取和低分类率相关的几个缺点，以及其他问题。然而，最近的深度学习方法是一种强大的一组工具，可以处理几乎任何图像和计算机视觉相关问题，为步态识别提供最重要的结果。因此，这项工作提供了通过步态认可的关于生物识别检测的最近作品的调查汇编，重点是深入学习方法，强调他们的益处，暴露出弱点。此外，它还呈现用于解决相关约束的数据集，方法和体系结构的分类和表征描述。

translated by 谷歌翻译

NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation

Kaustubh D. Dhole , Varun Gangal , Sebastian Gehrmann , Aadesh Gupta , Zhenhao Li , Saad Mahamood , Abinaya Mahendiran , Simon Mille , Ashish Srivastava , Samson Tan

分类：自然语言处理 | 人工智能 | 机器学习

2021-12-06

数据增强是自然语言处理（NLP）模型的鲁棒性评估的重要组成部分，以及增强他们培训的数据的多样性。在本文中，我们呈现NL-Cogmenter，这是一种新的参与式Python的自然语言增强框架，它支持创建两个转换（对数据的修改）和过滤器（根据特定功能的数据拆分）。我们描述了框架和初始的117个变换和23个过滤器，用于各种自然语言任务。我们通过使用其几个转换来分析流行自然语言模型的鲁棒性来证明NL-Upmenter的功效。基础架构，Datacards和稳健性分析结果在NL-Augmenter存储库上公开可用（\ url {https://github.com/gem-benchmark/nl-augmenter}）。

translated by 谷歌翻译

A Discrete Variational Derivation of Accelerated Methods in Optimization

Cédric M. Campos , Alejandro Mahillo , David Martín de Diego

分类：机器学习

2021-06-04

机器学习中的许多新的发展都与基于梯度的优化方法相连。最近，已经使用变分透视研究了这些方法。这已经开辟了使用几何集成引入变分和辛方法的可能性。特别是，在本文中，我们引入了变分集成商，使我们能够导出不同的优化方法。使用汉密尔顿和拉格朗日 - 德尔尔堡的原则，我们在一对一的对应中获得了两个各自的优化方法的一个家庭，即概括Polyak的厚球和众所周知的Nesterov加速梯度方法，其中第二个是模仿行为的第二个对应首先减少经典动量方法的振荡。然而，由于考虑的系统是明确时间依赖的，因此自主系统的杂交的保存仅在这里发生在纤维上。几个实验举例说明结果。

translated by 谷歌翻译

AHMoSe: A Knowledge-Based Visual Support System for Selecting Regression Machine Learning Models

Diego Rojo , Nyi Nyi Htun , Denis Parra , Robin De Croon , Katrien Verbert

分类：机器学习

2021-01-28

决策支持系统在农业领域越来越受欢迎。随着自动化机器学习的发展，农业专家现在能够使用切削刃机器学习（ML）模型来培训，评估和做出预测，而无需大得多。虽然这种自动化方法导致了许多情况下的成功结果，但在某些情况下（例如，当有很多标记的数据集可用时）选择具有类似性能度量的不同模型中是一项艰巨的任务。此外，这些系统通常不允许用户纳入其域知识，这些域知识可以促进模型选择的任务，并深入了解最终决策的预测系统。为了解决这些问题，在本文中，我们展示了一种视觉支持系统，允许域专家更好地理解，诊断和比较不同的回归模型，主要是通过丰富具有域知识的模型不可知的解释。为了验证AHMOSE，我们描述了葡萄栽培领域的用例场景，葡萄质量预测，系统使用户能够诊断和选择更好的预测模型。我们还讨论了关于ML和葡萄栽培专家的工具设计的反馈。

translated by 谷歌翻译

The Undesirable Dependence on Frequency of Gender Bias Metrics Based on Word Embeddings

Francisco Valentini , Germán Rosati , Diego Fernandez Slezak , Edgar Altszyler

分类：自然语言处理 | 人工智能

2023-01-02

Numerous works use word embedding-based metrics to quantify societal biases and stereotypes in texts. Recent studies have found that word embeddings can capture semantic similarity but may be affected by word frequency. In this work we study the effect of frequency when measuring female vs. male gender bias with word embedding-based bias quantification methods. We find that Skip-gram with negative sampling and GloVe tend to detect male bias in high frequency words, while GloVe tends to return female bias in low frequency words. We show these behaviors still exist when words are randomly shuffled. This proves that the frequency-based effect observed in unshuffled corpora stems from properties of the metric rather than from word associations. The effect is spurious and problematic since bias metrics should depend exclusively on word co-occurrences and not individual word frequencies. Finally, we compare these results with the ones obtained with an alternative metric based on Pointwise Mutual Information. We find that this metric does not show a clear dependence on frequency, even though it is slightly skewed towards male bias across all frequencies.

translated by 谷歌翻译

DensePose From WiFi

Jiaqi Geng , Dong Huang , Fernando De la Torre

分类：计算机视觉

2022-12-31

Advances in computer vision and machine learning techniques have led to significant development in 2D and 3D human pose estimation from RGB cameras, LiDAR, and radars. However, human pose estimation from images is adversely affected by occlusion and lighting, which are common in many scenarios of interest. Radar and LiDAR technologies, on the other hand, need specialized hardware that is expensive and power-intensive. Furthermore, placing these sensors in non-public areas raises significant privacy concerns. To address these limitations, recent research has explored the use of WiFi antennas (1D sensors) for body segmentation and key-point body detection. This paper further expands on the use of the WiFi signal in combination with deep learning architectures, commonly used in computer vision, to estimate dense human pose correspondence. We developed a deep neural network that maps the phase and amplitude of WiFi signals to UV coordinates within 24 human regions. The results of the study reveal that our model can estimate the dense pose of multiple subjects, with comparable performance to image-based approaches, by utilizing WiFi signals as the only input. This paves the way for low-cost, broadly accessible, and privacy-preserving algorithms for human sensing.

translated by 谷歌翻译

Machine Learning and Thermography Applied to the Detection and Classification of Cracks in Building

Angela Busheska , Nara Almeida , Nicholas Sabella , Eudes de A. Rocha

分类：计算机视觉 | 机器学习

2022-12-30

Due to the environmental impacts caused by the construction industry, repurposing existing buildings and making them more energy-efficient has become a high-priority issue. However, a legitimate concern of land developers is associated with the buildings' state of conservation. For that reason, infrared thermography has been used as a powerful tool to characterize these buildings' state of conservation by detecting pathologies, such as cracks and humidity. Thermal cameras detect the radiation emitted by any material and translate it into temperature-color-coded images. Abnormal temperature changes may indicate the presence of pathologies, however, reading thermal images might not be quite simple. This research project aims to combine infrared thermography and machine learning (ML) to help stakeholders determine the viability of reusing existing buildings by identifying their pathologies and defects more efficiently and accurately. In this particular phase of this research project, we've used an image classification machine learning model of Convolutional Neural Networks (DCNN) to differentiate three levels of cracks in one particular building. The model's accuracy was compared between the MSX and thermal images acquired from two distinct thermal cameras and fused images (formed through multisource information) to test the influence of the input data and network on the detection results.

translated by 谷歌翻译