智能论文笔记

Cognitive Architecture for Decision-Making Based on Brain Principles Programming

Anton Kolonin , Andrey Kurpatov , Artem Molchanov , Gennadiy Averyanov

分类：人工智能

2022-04-17

我们描述了一种认知体系结构，旨在根据大脑活动的五个确定原理解决广泛的问题，并在三个子系统中实施：逻辑 - 稳态推理，概率形式的形式概念和功能系统理论。构建体系结构涉及实施任务驱动的方法，该方法允许将应用应用程序的目标功能定义为按照与该任务相对应的操作环境制定的任务，该任务在应用的本体论中表示。我们为许多实用应用以及基于它的主题领域本体论提供了一个基本的本体，描述了拟议的体系结构，并提供了在本架构中执行这些应用程序的可能示例。

translated by 谷歌翻译

Decentralized Control of Quadrotor Swarms with End-to-end Deep Reinforcement Learning

Sumeet Batra , Zhehui Huang , Aleksei Petrenko , Tushar Kumar , Artem Molchanov , Gaurav S. Sukhatme

分类：机器人

2021-09-16

我们展示了通过大规模多代理端到端增强学习的大射击可转移到真正的四轮压力机的无人驾驶群体控制器的可能性。我们培训由神经网络参数化的政策，该政策能够以完全分散的方式控制群体中的各个无人机。我们的政策，在具有现实的四轮流物理学的模拟环境中训练，展示了先进的植绒行为，在紧张的地层中执行侵略性的操作，同时避免彼此的碰撞，破裂和重新建立地层，以避免与移动障碍的碰撞，并有效地协调追求障碍，并有效地协调追求逃避任务。在模拟中，我们分析了培训制度的不同模型架构和参数影响神经群的最终表现。我们展示了在模拟中学习的模型的成功部署到高度资源受限的物理四体体执行站保持和目标交换行为。在Propers网站上提供代码和视频演示，在https://sites.google.com/view/swarm-rl上获得。

translated by 谷歌翻译

In Quest of Ground Truth: Learning Confident Models and Estimating Uncertainty in the Presence of Annotator Noise

Asma Ahmed Hashmi , Artem Agafonov , Aigerim Zhumabayeva , Mohammad Yaqub , Martin Takáč

分类：计算机视觉 | 机器学习

2023-01-02

The performance of the Deep Learning (DL) models depends on the quality of labels. In some areas, the involvement of human annotators may lead to noise in the data. When these corrupted labels are blindly regarded as the ground truth (GT), DL models suffer from performance deficiency. This paper presents a method that aims to learn a confident model in the presence of noisy labels. This is done in conjunction with estimating the uncertainty of multiple annotators. We robustly estimate the predictions given only the noisy labels by adding entropy or information-based regularizer to the classifier network. We conduct our experiments on a noisy version of MNIST, CIFAR-10, and FMNIST datasets. Our empirical results demonstrate the robustness of our method as it outperforms or performs comparably to other state-of-the-art (SOTA) methods. In addition, we evaluated the proposed method on the curated dataset, where the noise type and level of various annotators depend on the input image style. We show that our approach performs well and is adept at learning annotators' confusion. Moreover, we demonstrate how our model is more confident in predicting GT than other baselines. Finally, we assess our approach for segmentation problem and showcase its effectiveness with experiments.

translated by 谷歌翻译

Biomedical image analysis competitions: The state of current participation practice

Matthias Eisenmann , Annika Reinke , Vivienn Weru , Minu Dietlinde Tizabi , Fabian Isensee , Tim J. Adler , Patrick Godau , Veronika Cheplygina , Michal Kozubek , Sharib Ali

分类：计算机视觉 | 机器学习

2022-12-16

The number of international benchmarking competitions is steadily increasing in various fields of machine learning (ML) research and practice. So far, however, little is known about the common practice as well as bottlenecks faced by the community in tackling the research questions posed. To shed light on the status quo of algorithm development in the specific field of biomedical imaging analysis, we designed an international survey that was issued to all participants of challenges conducted in conjunction with the IEEE ISBI 2021 and MICCAI 2021 conferences (80 competitions in total). The survey covered participants' expertise and working environments, their chosen strategies, as well as algorithm characteristics. A median of 72% challenge participants took part in the survey. According to our results, knowledge exchange was the primary incentive (70%) for participation, while the reception of prize money played only a minor role (16%). While a median of 80 working hours was spent on method development, a large portion of participants stated that they did not have enough time for method development (32%). 25% perceived the infrastructure to be a bottleneck. Overall, 94% of all solutions were deep learning-based. Of these, 84% were based on standard architectures. 43% of the respondents reported that the data samples (e.g., images) were too large to be processed at once. This was most commonly addressed by patch-based training (69%), downsampling (37%), and solving 3D analysis tasks as a series of 2D tasks. K-fold cross-validation on the training set was performed by only 37% of the participants and only 50% of the participants performed ensembling based on multiple identical models (61%) or heterogeneous models (39%). 48% of the respondents applied postprocessing steps.

translated by 谷歌翻译

RANA: Relightable Articulated Neural Avatars

Umar Iqbal , Akin Caliskan , Koki Nagano , Sameh Khamis , Pavlo Molchanov , Jan Kautz

分类：计算机视觉

2022-12-06

We propose RANA, a relightable and articulated neural avatar for the photorealistic synthesis of humans under arbitrary viewpoints, body poses, and lighting. We only require a short video clip of the person to create the avatar and assume no knowledge about the lighting environment. We present a novel framework to model humans while disentangling their geometry, texture, and also lighting environment from monocular RGB videos. To simplify this otherwise ill-posed task we first estimate the coarse geometry and texture of the person via SMPL+D model fitting and then learn an articulated neural representation for photorealistic image generation. RANA first generates the normal and albedo maps of the person in any given target body pose and then uses spherical harmonics lighting to generate the shaded image in the target lighting environment. We also propose to pretrain RANA using synthetic images and demonstrate that it leads to better disentanglement between geometry and texture while also improving robustness to novel body poses. Finally, we also present a new photorealistic synthetic dataset, Relighting Humans, to quantitatively evaluate the performance of the proposed approach.

translated by 谷歌翻译

Bayesian Network Models of Causal Interventions in Healthcare Decision Making: Literature Review and Software Evaluation

Artem Velikzhanin , Benjie Wang , Marta Kwiatkowska

分类：人工智能 | 机器学习

2022-11-28

This report summarises the outcomes of a systematic literature search to identify Bayesian network models used to support decision making in healthcare. After describing the search methodology, the selected research papers are briefly reviewed, with the view to identify publicly available models and datasets that are well suited to analysis using the causal interventional analysis software tool developed in Wang B, Lyle C, Kwiatkowska M (2021). Finally, an experimental evaluation of applying the software on a selection of models is carried out and preliminary results are reported.

translated by 谷歌翻译

Medical Image Captioning via Generative Pretrained Transformers

Alexander Selivanov , Oleg Y. Rogov , Daniil Chesakov , Artem Shelmanov , Irina Fedulova , Dmitry V. Dylov

分类：计算机视觉 | 人工智能

2022-09-28

自动临床标题生成问题被称为建议模型，将额叶X射线扫描与放射学记录中的结构化患者信息结合在一起。我们将两种语言模型结合在一起，即表演 - 泰尔和GPT-3，以生成全面和描述性的放射学记录。这些模型的建议组合产生了文本摘要，其中包含有关发现的病理，其位置以及将每个病理定位在原始X射线扫描中的每个病理的2D热图。提出的模型在两个医学数据集（Open-I，Mimic-CXR和通用MS-Coco）上进行了测试。用自然语言评估指标测量的结果证明了它们对胸部X射线图像字幕的有效适用性。

translated by 谷歌翻译

Sauron U-Net: Simple automated redundancy elimination in medical image segmentation via filter pruning

Juan Miguel Valverde , Artem Shatillo , Jussi Tohka

分类：计算机视觉 | 人工智能

2022-09-27

我们提出了Sauron，这是一种过滤器修剪方法，它通过使用自动调整的层特异性阈值丢弃相应的过滤器来消除冗余特征图。此外，Sauron最大程度地减少了一个正规化术语，正如我们所显示的各种指标所显示的那样，促进了特征地图簇的形成。与大多数过滤器修剪方法相反，Sauron是单相，类似于典型的神经网络优化，需要更少的超参数和设计决策。此外，与其他基于群集的方法不同，我们的方法不需要预选簇的数量，而簇的数量是非平凡的，以确定和随着层的变化。我们在三个医学图像分割任务上评估了Sauron和三种最先进的过滤器修剪方法。在这个领域，过滤器修剪很少受到关注，并且可以帮助建立有效的医疗级计算机模型，这些计算机由于隐私考虑而无法使用云服务。索伦（Sauron）比竞争的修剪方法实现了具有更高性能和修剪率的模型。此外，由于Sauron在训练过程中除去过滤器，因此随着时间的推移，其优化加速了。最后，我们证明了Sauron-Prun的模型的特征地图是高度可解释的。 Sauron代码可在https://github.com/jmlipman/sauronunet上公开获得。

translated by 谷歌翻译

Characterizing Graph Datasets for Node Classification: Beyond Homophily-Heterophily Dichotomy

Oleg Platonov , Denis Kuznedelev , Artem Babenko , Liudmila Prokhorenkova

分类：机器学习

2022-09-13

同质性是描述边缘连接相似节点的趋势的图形属性。相反称为异性。尽管同质性对于许多现实世界网络是自然的，但也有没有此属性的网络。人们通常认为，标准消息的图形神经网络（GNNS）在非双性图形上表现不佳，因此此类数据集需要特别注意。尽管为异性图开发图表的学习方法已经付出了很多努力，但尚无普遍同意同质的措施。但是，在文献中使用了几种测量同质性的指标，但是，我们表明所有这些度量都有关键的缺点，以阻止不同数据集之间的同质级别比较。我们将理想的属性形式化，以进行适当的同质度量，并展示如何将有关分类绩效指标属性的现有文献与我们的问题联系起来。在这样做时，我们找到了一种措施，我们称调整后的同质性比现有同质措施更满足所需的特性。有趣的是，该措施与两个分类性能指标有关 - 科恩的kappa和马修斯相关系数。然后，我们超越了同质性的二分法，并提出了一种新的属性，我们称之为标签信息性（LI），该属性表征了邻居标签提供有关节点标签的信息的数量。从理论上讲，我们表明LI在具有不同数量的类和类大小平衡的数据集中相当。通过一系列实验，我们表明LI是对数据集上GNN的性能的更好预测指标，而不是同质性。我们证明了Li解释了为什么GNN有时可以在异性数据集上表现良好 - 这是文献中最近观察到的现象。

translated by 谷歌翻译

Petals: Collaborative Inference and Fine-tuning of Large Models

Alexander Borzunov , Dmitry Baranchuk , Tim Dettmers , Max Ryabinin , Younes Belkada , Artem Chumachenko , Pavel Samygin , Colin Raffel

分类：机器学习

2022-09-02

许多NLP任务受益于使用通常具有超过1000亿参数的大语言模型（LLM）。随着Bloom-176b和Opt-175B的发布，每个人都可以下载该规模的预估计型号。尽管如此，使用这些模型仍需要许多研究人员无法获得高端硬件。在某些情况下，LLM可以通过RAM卸载或托管API更实惠。但是，这些技术具有先天的局限性：对于交互推理而言，卸载太慢，而API的灵活性不足以进行研究。在这项工作中，我们通过加入信任处理客户数据的多个政党的资源来提出花瓣$ - $ $用于推理和微调大型模型的系统。我们证明，这种策略的表现极大地超过了非常大型型号的卸载，以每秒约1美元的价格$ \ $ \ $ \ $ \ $ \ $ \ $ \ $ \ $ 1。与大多数推理API不同，花瓣还本地揭示了服务模型的隐藏状态，从而使其用户可以根据有效的微调方法训练和共享自定义模型扩展。

translated by 谷歌翻译

HTML版本