智能论文笔记

Perceived personality state estimation in dyadic and small group interaction with deep learning methods

Kristian Fenech , Ádám Fodor , Sean P. Bergeron , Rachid R. Saboundji , Catharine Oertel , András Lőrincz

分类：机器学习

2022-11-09

Dyadic and small group collaboration is an evolutionary advantageous behaviour and the need for such collaboration is a regular occurrence in day to day life. In this paper we estimate the perceived personality traits of individuals in dyadic and small groups over thin-slices of interaction on four multimodal datasets. We find that our transformer based predictive model performs similarly to human annotators tasked with predicting the perceived big-five personality traits of participants. Using this model we analyse the estimated perceived personality traits of individuals performing tasks in small groups and dyads. Permutation analysis shows that in the case of small groups undergoing collaborative tasks, the perceived personality of group members clusters, this is also observed for dyads in a collaborative problem solving task, but not in dyads under non-collaborative task settings. Additionally, we find that the group level average perceived personality traits provide a better predictor of group performance than the group level average self-reported personality traits.

translated by 谷歌翻译

Cell-Free Data Power Control Via Scalable Multi-Objective Bayesian Optimisation

Sergey S. Tambovskiy , Gábor Fodor , Hugo Tullberg

分类：机器学习 | (统计)机器学习

2022-12-20

Cell-free multi-user multiple input multiple output networks are a promising alternative to classical cellular architectures, since they have the potential to provide uniform service quality and high resource utilisation over the entire coverage area of the network. To realise this potential, previous works have developed radio resource management mechanisms using various optimisation engines. In this work, we consider the problem of overall ergodic spectral efficiency maximisation in the context of uplink-downlink data power control in cell-free networks. To solve this problem in large networks, and to address convergence-time limitations, we apply scalable multi-objective Bayesian optimisation. Furthermore, we discuss how an intersection of multi-fidelity emulation and Bayesian optimisation can improve radio resource management in cell-free networks.

translated by 谷歌翻译

Multi-Server Over-the-Air Federated Learning

Seyed Mohammad Azimi-Abarghouyi , Viktoria Fodor

分类：人工智能 | 机器学习

2022-11-29

In this work, we propose a communication-efficient two-layer federated learning algorithm for distributed setups including a core server and multiple edge servers with clusters of devices. Assuming different learning tasks, clusters with a same task collaborate. To implement the algorithm over wireless links, we propose a scalable clustered over-the-air aggregation scheme for the uplink with a bandwidth-limited broadcast scheme for the downlink that requires only two single resource blocks for each algorithm iteration, independent of the number of edge servers and devices. This setup is faced with interference of devices in the uplink and interference of edge servers in the downlink that are to be modeled rigorously. We first develop a spatial model for the setup by modeling devices as a Poisson cluster process over the edge servers and quantify uplink and downlink error terms due to the interference. Accordingly, we present a comprehensive mathematical approach to derive the convergence bound for the proposed algorithm including any number of collaborating clusters in the setup and provide important special cases and design remarks. Finally, we show that despite the interference in the proposed uplink and downlink schemes, the proposed algorithm achieves high learning accuracy for a variety of parameters.

translated by 谷歌翻译

Federated Learning Using Three-Operator ADMM

Shashi Kant , José Mairton B. da Silva Jr. , Gabor Fodor , Bo Göransson , Mats Bengtsson , Carlo Fischione

分类：机器学习

2022-11-08

Federated learning (FL) has emerged as an instance of distributed machine learning paradigm that avoids the transmission of data generated on the users' side. Although data are not transmitted, edge devices have to deal with limited communication bandwidths, data heterogeneity, and straggler effects due to the limited computational resources of users' devices. A prominent approach to overcome such difficulties is FedADMM, which is based on the classical two-operator consensus alternating direction method of multipliers (ADMM). The common assumption of FL algorithms, including FedADMM, is that they learn a global model using data only on the users' side and not on the edge server. However, in edge learning, the server is expected to be near the base station and have direct access to rich datasets. In this paper, we argue that leveraging the rich data on the edge server is much more beneficial than utilizing only user datasets. Specifically, we show that the mere application of FL with an additional virtual user node representing the data on the edge server is inefficient. We propose FedTOP-ADMM, which generalizes FedADMM and is based on a three-operator ADMM-type technique that exploits a smooth cost function on the edge server to learn a global model parallel to the edge devices. Our numerical experiments indicate that FedTOP-ADMM has substantial gain up to 33\% in communication efficiency to reach a desired test accuracy with respect to FedADMM, including a virtual user on the edge server.

translated by 谷歌翻译

Industry-Scale Orchestrated Federated Learning for Drug Discovery

Martijn Oldenhof , Gergely Ács , Balázs Pejó , Ansgar Schuffenhauer , Nicholas Holway , Noé Sturm , Arne Dieckmann , Oliver Fortmeier , Eric Boniface , Clément Mayer

分类：机器学习 | (统计)机器学习

2022-10-17

To apply federated learning to drug discovery we developed a novel platform in the context of European Innovative Medicines Initiative (IMI) project MELLODDY (grant n{\deg}831472), which was comprised of 10 pharmaceutical companies, academic research labs, large industrial companies and startups. The MELLODDY platform was the first industry-scale platform to enable the creation of a global federated model for drug discovery without sharing the confidential data sets of the individual partners. The federated model was trained on the platform by aggregating the gradients of all contributing partners in a cryptographic, secure way following each training iteration. The platform was deployed on an Amazon Web Services (AWS) multi-account architecture running Kubernetes clusters in private subnets. Organisationally, the roles of the different partners were codified as different rights and permissions on the platform and administrated in a decentralized way. The MELLODDY platform generated new scientific discoveries which are described in a companion paper.

translated by 谷歌翻译

Knowledge Authoring with Factual English

Yuheng Wang , Giorgian Borca-Tasciuc , Nikhil Goel , Paul Fodor , Michael Kifer

分类：自然语言处理

2022-08-05

知识表示和推理（KRR）系统表示知识作为事实和规则的集合。像数据库一样，KRR系统包含有关工业企业，科学和业务等人类活动领域的信息。 KRR可以代表复杂的概念和关系，它们可以以复杂的方式查询和操纵信息。不幸的是，指定必要的知识需要大多数领域专家没有的技能，而专业知识工程师很难找到，因此KRR技术受到了阻碍。一种解决方案可能是从英语文本中提取知识，并且许多作品都尝试这样做（Openseame，Google的吊索等）。不幸的是，目前，从不受限制的自然语言中提取逻辑事实仍然是不准确的，无法用于推理，而限制语言语法（所谓的受控自然语言或CNL）对于用户来说很难学习和使用。然而，与其他方法相比，一些最近基于CNL的方法，例如知识创作逻辑机（KALM）的精度非常高，并且一个自然的问题是可以在多大程度上取消CNL限制。在本文中，我们通过将KALM框架移植到神经自然语言解析器Mstanza来解决这个问题。在这里，我们将注意力限制在创作事实和查询上，因此我们的重点是我们所说的事实英语陈述。在我们的后续工作中将考虑创作其他类型的知识，例如规则。事实证明，基于神经网络的解析器有自己的问题，并且他们犯的错误范围从言论的一部分标记到lemmatization到依赖性错误。我们介绍了许多解决这些问题并测试新系统KALMFL（即，事实语言的KALM）的技术，这些技术表明KALMFL的正确性超过95％。

translated by 谷歌翻译

Towards Robotic Laboratory Automation Plug & Play: Survey and Concept Proposal on Teaching-free Robot Integration with the LAPP Digital Twin

Ádám Wolf , Stefan Romeder-Finger , Károly Széll , Péter Galambos

分类：机器人

2022-05-17

The Laboratory Automation Plug & Play (LAPP) framework is an over-arching reference architecture concept for the integration of robots in life science laboratories. The plug & play nature lies in the fact that manual configuration is not required, including the teaching of the robots. In this paper a digital twin (DT) based concept is proposed that outlines the types of information that have to be provided for each relevant component of the system. In particular, for the devices interfacing with the robot, the robot positions have to be defined beforehand in a device-attached coordinate system (CS) by the vendor. This CS has to be detectable by the vision system of the robot by means of optical markers placed on the front side of the device. With that, the robot is capable of tending the machine by performing the pick-and-place type transportation of standard sample carriers. This basic use case is the primary scope of the LAPP-DT framework. The hardware scope is limited to simple benchtop and mobile manipulators with parallel grippers at this stage. This paper first provides an overview of relevant literature and state-of-the-art solutions, after which it outlines the framework on the conceptual level, followed by the specification of the relevant DT parameters for the robot, for the devices and for the facility. Finally, appropriate technologies and strategies are identified for the implementation.

translated by 谷歌翻译

A general technique for the estimation of farm animal body part weights from CT scans and its applications in a rabbit breeding program

Ádám Csóka , György Kovács , Virág Ács , Zsolt Matics , Zsolt Gerencsér , Zsolt Szendrő , István Nagy , Örs Petneházy , Imre Repa , Mariann Moizs

分类：计算机视觉 | 机器学习

2021-12-30

农场动物成像的各种应用基于某些身体部位的重量和从动物的CT图像切割的估计。在许多情况下，由于扫描非镇静的活动物，通过CT图像中的姿势的巨大变化来增加问题的复杂性。在本文中，我们提出了一种估计来自（可能）活体动物的CT图像的切割和身体部位的重量的一般和鲁棒方法。我们通过弹性登记和联合功能和用于斗篷的回归分量的模型选择，适应基于多标准的分段以及具有大量特征和较少量的样本。通过兔育种程序中的真实应用来评估和说明所提出的技术，显示R ^ 2比以前的技术和方法高于以前的技术和方法。所提出的技术很容易适应类似的问题，因此，它在开源软件包中共享，以便为社区的利益。

translated by 谷歌翻译

Over-the-Air Federated Learning with Retransmissions (Extended Version)

Henrik Hellström , Viktoria Fodor , Carlo Fischione

分类：机器学习

2021-11-19

通过增加无线设备的计算能力，以及用户和设备生成的数据的前所未有的级别，已经出现了新的分布式机器学习（ML）方法。在无线社区中，由于其通信效率及其处理非IID数据问题的能力，联邦学习（FL）特别有趣。可以通过称为空中计算（AIRCOMP）的无线通信方法加速FL训练，其利用同时上行链路传输的干扰以有效地聚合模型更新。但是，由于Aircomp利用模拟通信，因此它引入了不可避免的估计错误。在本文中，我们研究了这种估计误差对FL的收敛性的影响，并提出了一种改进资源受限无线网络的方法的转移。首先，我们通过静态通道重新传输获得最佳Aircomp电源控制方案。然后，我们调查了传递的空中流体的性能，并在流失函数上找到两个上限。最后，我们提出了一种选择最佳重传的启发式，可以在训练ML模型之前计算。数值结果表明，引入重传可能导致ML性能提高，而不会在通信或计算方面产生额外的成本。此外，我们为我们的启发式提供了模拟结果，表明它可以正确地确定不同无线网络设置和机器学习问题的最佳重传次数。

translated by 谷歌翻译

Towards Robotic Laboratory Automation Plug & Play: The "LAPP" Framework

Ádám Wolf , David Wolton , Josef Trapl , Julien Janda , Stefan Romeder-Finger , Thomas Gatternig , Jean-Baptiste Farcet , Péter Galambos , Károly Széll

分类：机器人

2021-06-18

增加制药实验室和生产设施的自动化水平起着至关重要的作用。然而，这一领域的特殊要求使其挑战适应其他行业中存在的尖端技术。本文概述了相关方法以及如何在制药行业中使用，特别是在发展实验室中。最近的进步包括能够处理能够处理复杂任务的灵活移动机械手。然而，由于接口的多样性，将来自许多不同供应商的设备集成到端到端的自动化系统中是复杂的。因此，在本文中考虑了各种标准化方法，提出了一种概念来进一步服用一步。该概念使具有视觉系统的移动操纵器能够“学习”每个设备的姿势，并利用来自通用云数据库的条形码 - 获取接口信息。该信息包括控制和通信协议定义以及操作设备所需的机器人操作的表示。为了定义与设备相关的动作，设备必须具有 - 除了条形码 - 作为标准的基准标记。在随访论文中的适当研究活动之后，将详细阐述该概念。

translated by 谷歌翻译