智能论文笔记

Guiding Visual Attention in Deep Convolutional Neural Networks Based on Human Eye Movements

Leonard E. van Dyck , Sebastian J. Denzler , Walter R. Gruber

分类：计算机视觉

2022-06-21

深度卷积神经网络（DCNN）最初是受生物视觉原理的启发，已演变为对象识别的最佳当前计算模型，因此表明在整个与神经图像和神经时间序列数据的比较中，都表明了与腹视觉途径的强大结构和功能并行性。随着深度学习的最新进展似乎降低了这种相似性，计算神经科学面临挑战，以逆转工程，以获得有用模型的生物学合理性。虽然先前的研究表明，生物学启发的体系结构能够扩大模型的人类风格，但在本研究中，我们研究了一种纯粹的数据驱动方法。我们使用人类的眼睛跟踪数据来直接修改训练示例，从而指导模型在自然图像中对象识别期间的视觉注意力朝着或远离人类固定的焦点。我们通过GARGCAM显着性图比较和验证不同的操纵类型（即标准，类人类和非人类的注意力）与人类参与者的眼动数据。我们的结果表明，与人类相比，所提出的指导焦点操作的作用是在负方向上的意图，而非人类样模型则集中在明显不同的图像部分上。观察到的效果是高度类别特异性的，它通过动画和面部的存在增强，仅在完成前馈处理后才开发，并表明对面部检测产生了强烈的影响。然而，使用这种方法，没有发现人类的类似性。讨论了公开视觉注意力在DCNN中的可能应用，并讨论了对面部检测理论的进一步影响。

translated by 谷歌翻译

Image Classification with Small Datasets: Overview and Benchmark

L. Brigato , B. Barz , L. Iocchi , J. Denzler

分类：计算机视觉 | 人工智能 | 神经与进化计算

2022-12-23

Image classification with small datasets has been an active research area in the recent past. However, as research in this scope is still in its infancy, two key ingredients are missing for ensuring reliable and truthful progress: a systematic and extensive overview of the state of the art, and a common benchmark to allow for objective comparisons between published methods. This article addresses both issues. First, we systematically organize and connect past studies to consolidate a community that is currently fragmented and scattered. Second, we propose a common benchmark that allows for an objective comparison of approaches. It consists of five datasets spanning various domains (e.g., natural images, medical imagery, satellite data) and data types (RGB, grayscale, multispectral). We use this benchmark to re-evaluate the standard cross-entropy baseline and ten existing methods published between 2017 and 2021 at renowned venues. Surprisingly, we find that thorough hyper-parameter tuning on held-out validation data results in a highly competitive baseline and highlights a stunted growth of performance over the years. Indeed, only a single specialized method dating back to 2019 clearly wins our benchmark and outperforms the baseline classifier.

translated by 谷歌翻译

Biomedical image analysis competitions: The state of current participation practice

Matthias Eisenmann , Annika Reinke , Vivienn Weru , Minu Dietlinde Tizabi , Fabian Isensee , Tim J. Adler , Patrick Godau , Veronika Cheplygina , Michal Kozubek , Sharib Ali

分类：计算机视觉 | 机器学习

2022-12-16

The number of international benchmarking competitions is steadily increasing in various fields of machine learning (ML) research and practice. So far, however, little is known about the common practice as well as bottlenecks faced by the community in tackling the research questions posed. To shed light on the status quo of algorithm development in the specific field of biomedical imaging analysis, we designed an international survey that was issued to all participants of challenges conducted in conjunction with the IEEE ISBI 2021 and MICCAI 2021 conferences (80 competitions in total). The survey covered participants' expertise and working environments, their chosen strategies, as well as algorithm characteristics. A median of 72% challenge participants took part in the survey. According to our results, knowledge exchange was the primary incentive (70%) for participation, while the reception of prize money played only a minor role (16%). While a median of 80 working hours was spent on method development, a large portion of participants stated that they did not have enough time for method development (32%). 25% perceived the infrastructure to be a bottleneck. Overall, 94% of all solutions were deep learning-based. Of these, 84% were based on standard architectures. 43% of the respondents reported that the data samples (e.g., images) were too large to be processed at once. This was most commonly addressed by patch-based training (69%), downsampling (37%), and solving 3D analysis tasks as a series of 2D tasks. K-fold cross-validation on the training set was performed by only 37% of the participants and only 50% of the participants performed ensembling based on multiple identical models (61%) or heterogeneous models (39%). 48% of the respondents applied postprocessing steps.

translated by 谷歌翻译

Sequence learning in a spiking neuronal network with memristive synapses

Younes Bouhadjar , Sebastian Siegel , Tom Tetzlaff , Markus Diesmann , Rainer Waser , Dirk J. Wouters

分类：神经与进化计算

2022-11-29

Brain-inspired computing proposes a set of algorithmic principles that hold promise for advancing artificial intelligence. They endow systems with self learning capabilities, efficient energy usage, and high storage capacity. A core concept that lies at the heart of brain computation is sequence learning and prediction. This form of computation is essential for almost all our daily tasks such as movement generation, perception, and language. Understanding how the brain performs such a computation is not only important to advance neuroscience but also to pave the way to new technological brain-inspired applications. A previously developed spiking neural network implementation of sequence prediction and recall learns complex, high-order sequences in an unsupervised manner by local, biologically inspired plasticity rules. An emerging type of hardware that holds promise for efficiently running this type of algorithm is neuromorphic hardware. It emulates the way the brain processes information and maps neurons and synapses directly into a physical substrate. Memristive devices have been identified as potential synaptic elements in neuromorphic hardware. In particular, redox-induced resistive random access memories (ReRAM) devices stand out at many aspects. They permit scalability, are energy efficient and fast, and can implement biological plasticity rules. In this work, we study the feasibility of using ReRAM devices as a replacement of the biological synapses in the sequence learning model. We implement and simulate the model including the ReRAM plasticity using the neural simulator NEST. We investigate the effect of different device properties on the performance characteristics of the sequence learning model, and demonstrate resilience with respect to different on-off ratios, conductance resolutions, device variability, and synaptic failure.

translated by 谷歌翻译

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Teven Le Scao , Angela Fan , Christopher Akiki , Ellie Pavlick , Suzana Ilić , Daniel Hesslow , Roman Castagné , Alexandra Sasha Luccioni , François Yvon , Matthias Gallé

分类：自然语言处理

2022-11-09

Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access language model designed and built thanks to a collaboration of hundreds of researchers. BLOOM is a decoder-only Transformer language model that was trained on the ROOTS corpus, a dataset comprising hundreds of sources in 46 natural and 13 programming languages (59 in total). We find that BLOOM achieves competitive performance on a wide variety of benchmarks, with stronger results after undergoing multitask prompted finetuning. To facilitate future research and applications using LLMs, we publicly release our models and code under the Responsible AI License.

translated by 谷歌翻译

Suppress with a Patch: Revisiting Universal Adversarial Patch Attacks against Object Detection

Svetlana Pavlitskaya , Jonas Hendl , Sebastian Kleim , Leopold Müller , Fabian Wylczoch , J. Marius Zöllner

分类：计算机视觉

2022-09-27

基于对抗斑块的攻击旨在欺骗一个有意产生的噪声的神经网络，该网络集中在输入图像的特定区域中。在这项工作中，我们对不同的贴片生成参数进行了深入的分析，包括初始化，贴剂大小，尤其是在训练过程中将贴剂放置在图像中。我们专注于对象消失的攻击，并以Yolov3作为白色盒子设置中的攻击的模型运行实验，并使用COCO数据集中的图像。我们的实验表明，在训练期间，将斑块插入大小增加的窗口内，与固定位置相比，攻击强度显着提高。当斑块在训练过程中随机定位时，获得了最佳结果，而贴片位置则在批处理中也有所不同。

translated by 谷歌翻译

Benchmarking energy consumption and latency for neuromorphic computing in condensed matter and particle physics

Dominique J. Kösters , Bryan A. Kortman , Irem Boybat , Elena Ferro , Sagar Dolas , Roberto de Austri , Johan Kwisthout , Hans Hilgenkamp , Theo Rasing , Heike Riel

分类：机器学习

2022-09-21

在科学计算的许多领域越来越流行的人工神经网络（ANN）的大量使用迅速增加了现代高性能计算系统的能源消耗。新型的神经形态范式提供了一种吸引人的替代方案，它直接在硬件中实施了ANN。但是，对于科学计算中用例使用ANN在神经形态硬件上运行ANN的实际好处知之甚少。在这里，我们提出了一种方法，用于测量使用常规硬件的ANN来计算推理任务的时间。此外，我们为这些任务设计了一个体系结构，并根据最先进的模拟内存计算（AIMC）平台估算了相同的指标，这是神经形态计算中的关键范例之一。在二维凝结物质系统中的量子多体物理学中的用例比较两种方法，并在粒子物理学中大型强子对撞机上以40 MHz的速率以40 MHz的速率进行异常检测。我们发现，与传统硬件相比，AIMC最多可以达到一个较短的计算时间，最高三个数量级的能源成本。这表明使用神经形态硬件进行更快，更可持续的科学计算的潜力。

translated by 谷歌翻译

MT-SNN: Spiking Neural Network that Enables Single-Tasking of Multiple Tasks

Paolo G. Cachi , Sebastian Ventura , Krzysztof J. Cios

分类：神经与进化计算 | 机器学习

2022-08-02

在本文中，我们探讨了使用多个任务的单任务方法来解决多任务分类问题的尖峰神经网络的功能。我们设计并实施了一个多任务尖峰神经网络（MT-SNN），该网络可以在一次执行一项任务时学习两个或多个分类任务。通过调节此工作中使用的泄漏的集成和火神经元的发射阈值来选择执行的任务。该网络是使用Intel的Laihi2神经形态芯片的Intel熔岩平台实现的。对NMNIST数据的动态多任务分类进行测试。结果表明，MT-SNN通过修改其动力学有效地学习了多个任务，即尖峰神经元的触发阈值。

translated by 谷歌翻译

Computer-aided diagnosis and prediction in brain disorders

Vikram Venkatraghavan , Sebastian R. van der Voort , Daniel Bos , Marion Smits , Frederik Barkhof , Wiro J. Niessen , Stefan Klein , Esther E. Bron

分类：机器学习

2022-06-29

计算机辅助方法为诊断和预测脑疾病显示了附加的价值，因此可以支持临床护理和治疗计划中的决策。本章将洞悉方法的类型，其工作，输入数据（例如认知测试，成像和遗传数据）及其提供的输出类型。我们将专注于诊断的特定用例，即估计患者的当前“状况”，例如痴呆症的早期检测和诊断，对脑肿瘤的鉴别诊断以及中风的决策。关于预测，即对患者的未来“状况”的估计，我们将缩小用例，例如预测多发性硬化症中的疾病病程，并预测脑癌治疗后患者的结局。此外，根据这些用例，我们将评估当前的最新方法，并强调当前对这些方法进行基准测试的努力以及其中的开放科学的重要性。最后，我们评估了计算机辅助方法的当前临床影响，并讨论了增加临床影响所需的下一步。

translated by 谷歌翻译

Distributional Gaussian Processes Layers for Out-of-Distribution Detection

Sebastian G. Popescu , David J. Sharp , James H. Cole , Konstantinos Kamnitsas , Ben Glocker

分类：计算机视觉 | 机器学习 | (统计)机器学习

2022-06-27

部署在医学成像任务上的机器学习模型必须配备分布外检测功能，以避免错误的预测。不确定依赖于深神经网络的分布外检测模型是否适合检测医学成像中的域移位。高斯流程可以通过其数学结构可靠地与分布数据点可靠地分开分发数据点。因此，我们为分层卷积高斯工艺提出了一个参数有效的贝叶斯层，该过程融合了在Wasserstein-2空间中运行的高斯过程，以可靠地传播不确定性。这直接用远距离的仿射操作员在分布中直接取代了高斯流程。我们对脑组织分割的实验表明，所得的架构接近了确定性分割算法（U-NET）的性能，而先前的层次高斯过程尚未实现。此外，通过将相同的分割模型应用于分布外数据（即具有病理学（例如脑肿瘤）的图像），我们表明我们的不确定性估计导致分布外检测，以优于以前的贝叶斯网络和以前的贝叶斯网络的功能基于重建的方法学习规范分布。为了促进未来的工作，我们的代码公开可用。

translated by 谷歌翻译