智能论文笔记

FedALA: Adaptive Local Aggregation for Personalized Federated Learning

Jianqing Zhang , Yang Hua , Hao Wang , Tao Song , Zhengui Xue , Ruhui Ma , Haibing Guan

分类：机器学习 | 人工智能

2022-12-02

A key challenge in federated learning (FL) is the statistical heterogeneity that impairs the generalization of the global model on each client. To address this, we propose a method Federated learning with Adaptive Local Aggregation (FedALA) by capturing the desired information in the global model for client models in personalized FL. The key component of FedALA is an Adaptive Local Aggregation (ALA) module, which can adaptively aggregate the downloaded global model and local model towards the local objective on each client to initialize the local model before training in each iteration. To evaluate the effectiveness of FedALA, we conduct extensive experiments with five benchmark datasets in computer vision and natural language processing domains. FedALA outperforms eleven state-of-the-art baselines by up to 3.27% in test accuracy. Furthermore, we also apply ALA module to other federated learning methods and achieve up to 24.19% improvement in test accuracy.

translated by 谷歌翻译

DomainATM: Domain Adaptation Toolbox for Medical Data Analysis

Hao Guan , Mingxia Liu

分类：计算机视觉

2022-09-24

域适应性（DA）是基于现代机器学习的医学数据分析的重要技术，旨在减少不同医疗数据集之间的分布差异。适当的域适应方法可以通过从多个站点/中心获取的数据来显着增强统计能力。为此，我们开发了用于医疗数据分析（DomainAtm）的域适应工具箱 - 一个开放式软件包，旨在快速促进和轻松自定义用于医疗数据分析的域适应方法。该域名在MATLAB中实现，并具有用户友好的图形接口，它由一系列流行的数据适应算法组成，这些算法已广泛应用于医学图像分析和计算机视觉。借助域名，研究人员能够促进对医学数据分析的不同适应方法的快速特征级别和图像级适应，可视化和性能评估。更重要的是，域名使用户能够通过脚本编写，大大增强其效用和可扩展性来开发和测试自己的适应方法。三个示例实验表明了概述和域的概述特征和用法，并证明了其有效性，简单性和灵活性。该软件，源代码和手册可在线获得。

translated by 谷歌翻译

Attention-Guided Autoencoder for Automated Progression Prediction of Subjective Cognitive Decline with Structural MRI

Hao Guan , Ling Yue , Pew-Thian Yap , Andrea Bozoki , Mingxia Liu

分类：计算机视觉

2022-06-24

主观认知下降（SCD）是阿尔茨海默氏病（AD）的临床前阶段，甚至在轻度认知障碍（MCI）之前就发生。渐进式SCD将转换为MCI，并有可能进一步发展为AD。因此，通过神经成像技术（例如，结构MRI）对进行性SCD的早期鉴定对于AD的早期干预具有巨大的临床价值。但是，现有的基于MRI的机器/深度学习方法通常会遇到小样本大小的问题，这对相关的神经影像学分析构成了巨大挑战。我们旨在解决本文的主要问题是如何利用相关领域（例如AD/NC）协助SCD的进展预测。同时，我们担心哪些大脑区域与进行性SCD的识别更加紧密相关。为此，我们提出了一个注意引导自动编码器模型，以进行有效的跨域适应，以促进知识转移从AD到SCD。所提出的模型由四个关键组成部分组成：1）用于学习不同域的共享子空间表示的功能编码模块，2）用于自动定义大脑中定义的兴趣障碍区域的注意模块，3）用于重构的解码模块原始输入，4）用于鉴定脑疾病的分类模块。通过对这四个模块的联合培训，可以学习域不变功能。同时，注意机制可以强调与脑部疾病相关的区域。公开可用的ADNI数据集和私人CLAS数据集的广泛实验证明了该方法的有效性。提出的模型直接可以在CPU上仅5-10秒进行训练和测试，并且适用于具有小数据集的医疗任务。

translated by 谷歌翻译

Implicit Autoencoder for Point Cloud Self-supervised Representation Learning

Siming Yan , Zhenpei Yang , Haoxiang Li , Li Guan , Hao Kang , Gang Hua , Qixing Huang

分类：计算机视觉

2022-01-03

许多3D表示（例如，点云）是下面连续3D表面的离散样本。该过程不可避免地介绍了底层的3D形状上的采样变化。在学习3D表示中，应忽略应忽略变化，而应捕获基础3D形状的可转换知识。这成为现有代表学习范式的大挑战。本文在点云上自动编码。标准自动编码范例强制编码器捕获这种采样变体，因为解码器必须重建具有采样变化的原始点云。我们介绍了隐式AutoEncoder（IAE），这是一种简单而有效的方法，通过用隐式解码器替换点云解码器来解决这一挑战。隐式解码器输出与相同模型的不同点云采样之间共享的连续表示。在隐式表示下重建可以优先考虑编码器丢弃采样变体，引入更多空间以学习有用的功能。在一个简单的线性AutoEncoder下，理论上理论地证明这一索赔。此外，隐式解码器提供丰富的空间来为不同的任务设计合适的隐式表示。我们展示了IAE对3D对象和3D场景的各种自我监督学习任务的有用性。实验结果表明，IAE在每项任务中始终如一地优于最先进的。

translated by 谷歌翻译

A real-time spatiotemporal AI model analyzes skill in open surgical videos

Emmett D. Goodman , Krishna K. Patel , Yilun Zhang , William Locke , Chris J. Kennedy , Rohan Mehrotra , Stephen Ren , Melody Guan , Maren Downing , Hao Wei Chen

分类：计算机视觉 | 人工智能

2021-12-14

开放程序代表全球手术的主要形式。人工智能（AI）有可能优化手术实践并改善患者结果，但努力主要集中在微创技术上。我们的工作通过策划，从YouTube，从YouTube，Open Surgical视频的最大数据集克服了培训AI模型的现有数据限制：1997年从50个国家上传的23个外科手术的视频。使用此数据集，我们开发了一种能够实时了解外科行为，手和工具的多任务AI模型 - 程序流程和外科医生技能的构建块。我们表明我们的模型推广了各种外科类型和环境。说明这种普遍性，我们直接应用了YouTube培训的模型，分析了在学术医疗中心前瞻性收集的开放式手术，并确定了与手动效率相关的外科技能的运动学描述符。我们的开放外科（AVOS）数据集和培训模式的注释视频将可用于进一步发展外科艾。

translated by 谷歌翻译

Less is More: Learning from Synthetic Data with Fine-grained Attributes for Person Re-Identification

Suncheng Xiang , Guanjie You , Mengyuan Guan , Hao Chen , Binjie Yan , Ting Liu , Yuzhuo Fu

分类：计算机视觉

2021-09-22

人重新识别（RE-ID）在公共安全和视频监控等应用中起着重要作用。最近，从合成数据引擎的普及中获益的合成数据学习，从公众眼中引起了极大的关注。但是，现有数据集数量，多样性和变性有限，并且不能有效地用于重新ID问题。为了解决这一挑战，我们手动构造一个名为FineGPR的大型人数据集，具有细粒度的属性注释。此外，旨在充分利用FineGPR的潜力，并推广从数百万综合数据的高效培训，我们提出了一个名为AOST的属性分析流水线，它动态地学习了真实域中的属性分布，然后消除了合成和现实世界之间的差距因此，自由地部署到新场景。在基准上进行的实验表明，FineGPR具有AOST胜过（或与）现有的实际和合成数据集，这表明其对重新ID任务的可行性，并证明了众所周知的较少的原则。我们的Synthetic FineGPR数据集可公开可用于\ URL {https://github.com/jeremyxsc/finegpr}。

translated by 谷歌翻译

Fed-EINI: An Efficient and Interpretable Inference Framework for Decision Tree Ensembles in Federated Learning

Xiaolin Chen , Shuai Zhou , Bei guan , Kai Yang , Hao Fan , Zejin Feng , Zhong Chen , Hu Wang , Yongji Wang

分类：机器学习 | 人工智能

2021-05-20

关于数据隐私和安全性的越来越多的担忧驱动了从孤立的数据源，即联合学习的隐私保留机学习的新兴领域。一类联合学习，\ Texit {垂直联合学习}，不同的各方对共同用户的不同特征，具有促进许多领域企业之间各种业务合作的潜力。在机器学习中，诸如梯度提升决策树（GBDT）和随机森林等决策树集合被广泛应用强大的型号，具有高的可解释性和建模效率。然而，最先进的垂直联合学习框架适应匿名功能以避免可能的数据泄露，使模型受到损害的可解释性。为了解决推理过程中的这个问题，在本文中，我们首先在垂直联合学习中对客场党的特征披露含义的必要性进行了问题分析。然后，我们发现树的预测结果可以表示为所有各方持有的树的子模型结果的交叉点。利用这种关键观察，我们通过隐藏决策路径来保护数据隐私并允许公开特征含义，并适应推理输出的通信有效的安全计算方法。通过理论分析和广泛的数值结果，将证明FED-EINI的优点。我们通过披露特征的含义来提高模型的可解释性，同时确保效率和准确性。

translated by 谷歌翻译

More is Better: A Database for Spontaneous Micro-Expression with High Frame Rates

Sirui Zhao , Huaying Tang , Xinglong Mao , Shifeng Liu , Hanqing Tao , Hao Wang , Tong Xu , Enhong Chen

分类：计算机视觉

2023-01-03

As one of the most important psychic stress reactions, micro-expressions (MEs), are spontaneous and transient facial expressions that can reveal the genuine emotions of human beings. Thus, recognizing MEs (MER) automatically is becoming increasingly crucial in the field of affective computing, and provides essential technical support in lie detection, psychological analysis and other areas. However, the lack of abundant ME data seriously restricts the development of cutting-edge data-driven MER models. Despite the recent efforts of several spontaneous ME datasets to alleviate this problem, it is still a tiny amount of work. To solve the problem of ME data hunger, we construct a dynamic spontaneous ME dataset with the largest current ME data scale, called DFME (Dynamic Facial Micro-expressions), which includes 7,526 well-labeled ME videos induced by 671 participants and annotated by more than 20 annotators throughout three years. Afterwards, we adopt four classical spatiotemporal feature learning models on DFME to perform MER experiments to objectively verify the validity of DFME dataset. In addition, we explore different solutions to the class imbalance and key-frame sequence sampling problems in dynamic MER respectively on DFME, so as to provide a valuable reference for future research. The comprehensive experimental results show that our DFME dataset can facilitate the research of automatic MER, and provide a new benchmark for MER. DFME will be published via https://mea-lab-421.github.io.

translated by 谷歌翻译

Surveillance Face Anti-spoofing

Hao Fang , Ajian Liu , Jun Wan , Sergio Escalera , Chenxu Zhao , Xu Zhang , Stan Z. Li , Zhen Lei

分类：计算机视觉

2023-01-03

Face Anti-spoofing (FAS) is essential to secure face recognition systems from various physical attacks. However, recent research generally focuses on short-distance applications (i.e., phone unlocking) while lacking consideration of long-distance scenes (i.e., surveillance security checks). In order to promote relevant research and fill this gap in the community, we collect a large-scale Surveillance High-Fidelity Mask (SuHiFiMask) dataset captured under 40 surveillance scenes, which has 101 subjects from different age groups with 232 3D attacks (high-fidelity masks), 200 2D attacks (posters, portraits, and screens), and 2 adversarial attacks. In this scene, low image resolution and noise interference are new challenges faced in surveillance FAS. Together with the SuHiFiMask dataset, we propose a Contrastive Quality-Invariance Learning (CQIL) network to alleviate the performance degradation caused by image quality from three aspects: (1) An Image Quality Variable module (IQV) is introduced to recover image information associated with discrimination by combining the super-resolution network. (2) Using generated sample pairs to simulate quality variance distributions to help contrastive learning strategies obtain robust feature representation under quality variation. (3) A Separate Quality Network (SQN) is designed to learn discriminative features independent of image quality. Finally, a large number of experiments verify the quality of the SuHiFiMask dataset and the superiority of the proposed CQIL.

translated by 谷歌翻译

OccluMix: Towards De-Occlusion Virtual Try-on by Semantically-Guided Mixup

Zhijing Yang , Junyang Chen , Yukai Shi , Hao Li , Tianshui Chen , Liang Lin

分类：计算机视觉

2023-01-03

Image Virtual try-on aims at replacing the cloth on a personal image with a garment image (in-shop clothes), which has attracted increasing attention from the multimedia and computer vision communities. Prior methods successfully preserve the character of clothing images, however, occlusion remains a pernicious effect for realistic virtual try-on. In this work, we first present a comprehensive analysis of the occlusions and categorize them into two aspects: i) Inherent-Occlusion: the ghost of the former cloth still exists in the try-on image; ii) Acquired-Occlusion: the target cloth warps to the unreasonable body part. Based on the in-depth analysis, we find that the occlusions can be simulated by a novel semantically-guided mixup module, which can generate semantic-specific occluded images that work together with the try-on images to facilitate training a de-occlusion try-on (DOC-VTON) framework. Specifically, DOC-VTON first conducts a sharpened semantic parsing on the try-on person. Aided by semantics guidance and pose prior, various complexities of texture are selectively blending with human parts in a copy-and-paste manner. Then, the Generative Module (GM) is utilized to take charge of synthesizing the final try-on image and learning to de-occlusion jointly. In comparison to the state-of-the-art methods, DOC-VTON achieves better perceptual quality by reducing occlusion effects.

translated by 谷歌翻译