智能论文笔记

EBHI-Seg: A Novel Enteroscope Biopsy Histopathological Haematoxylin and Eosin Image Dataset for Image Segmentation Tasks

Liyu Shi , Xiaoyan Li , Weiming Hua , Haoyuan Chen , Jing Chen , Zizhen Fan , Minghe Gao , Yujie Jing , Guotao Lu , Deguo Ma

分类：计算机视觉

2022-12-01

Background and Purpose: Colorectal cancer is a common fatal malignancy, the fourth most common cancer in men, and the third most common cancer in women worldwide. Timely detection of cancer in its early stages is essential for treating the disease. Currently, there is a lack of datasets for histopathological image segmentation of rectal cancer, which often hampers the assessment accuracy when computer technology is used to aid in diagnosis. Methods: This present study provided a new publicly available Enteroscope Biopsy Histopathological Hematoxylin and Eosin Image Dataset for Image Segmentation Tasks (EBHI-Seg). To demonstrate the validity and extensiveness of EBHI-Seg, the experimental results for EBHI-Seg are evaluated using classical machine learning methods and deep learning methods. Results: The experimental results showed that deep learning methods had a better image segmentation performance when utilizing EBHI-Seg. The maximum accuracy of the Dice evaluation metric for the classical machine learning method is 0.948, while the Dice evaluation metric for the deep learning method is 0.965. Conclusion: This publicly available dataset contained 5,170 images of six types of tumor differentiation stages and the corresponding ground truth images. The dataset can provide researchers with new segmentation algorithms for medical diagnosis of colorectal cancer, which can be used in the clinical setting to help doctors and patients.

translated by 谷歌翻译

A new way of video compression via forward-referencing using deep learning

S. M. A. K. Rajin , M. Murshed , M. Paul , S. W. Teng , J. Ma

分类：计算机视觉

2022-08-13

为了利用同一场景的视频框架中的高时间相关性，使用基于块的运动估计和补偿技术从已经编码的参考帧中预测了当前帧。尽管这种方法可以有效利用移动对象的翻译运动，但它容易受到其他类型的仿射运动和对象遮挡/除含量的影响。最近，深度学习已被用来模拟人类姿势的高级结构，以从短视频中的特定动作中进行，然后通过使用生成的对抗网络（GAN）来预测姿势，从而在未来的时间内生成虚拟框架。因此，建模人姿势的高级结构能够通过预测人类的行为并确定其轨迹来利用语义相关性。视频监视应用程序将受益，因为可以通过估算人类姿势轨迹并通过语义相关性产生未来的框架来压缩存储的大监视数据。本文通过从已经编码的框架中对人姿势进行建模并在当前时间使用生成的框架来探讨一种新的视频编码方式。预计所提出的方法可以通过预测包含具有较低残差的移动对象的块来克服传统向后引用框架的局限性。实验结果表明，提出的方法平均可以实现高达2.83 dB PSNR增益和25.93 \％比特率的节省，用于高运动视频序列

translated by 谷歌翻译

SuperLine3D: Self-supervised Line Segmentation and Description for LiDAR Point Cloud

Xiangrui Zhao , Sheng Yang , Tianxin Huang , Jun Chen , Teng Ma , Mingyang Li , Yong Liu

分类：计算机视觉

2022-08-03

电线杆和建筑物边缘经常是城市道路上可观察到的对象，为各种计算机视觉任务提供了可靠的提示。为了重复提取它们作为特征并在离散激光镜头框架之间进行注册，我们提出了第一个基于学习的功能分割和LIDAR点云中3D线的描述模型。为了训练我们的模型，而无需耗时和乏味的数据标记过程，我们首先生成了目标线基本外观的合成原始图，并构建一个迭代线自动标记的过程，以逐步完善真实激光扫描的线路标签。我们的分割模型可以在任意规模的扰动下提取线，我们使用共享的EDGECONV编码层共同训练两个分割和描述符头。基于模型，我们可以在没有初始转换提示的情况下构建一个高度可用的全局注册模块，用于点云注册。实验表明，我们基于线的注册方法对基于最先进的方法的方法具有很高的竞争力。我们的代码可在https://github.com/zxrzju/superline3d.git上找到。

translated by 谷歌翻译

A Piecewise Monotonic Gait Phase Estimation Model for Controlling a Powered Transfemoral Prosthesis in Various Locomotion Modes

Xinxing Chen , Chuheng Chen , Yuxuan Wang , Bowen Yang , Teng Ma , Yuquan Leng , Chenglong Fu

分类：机器人

2022-07-25

基于步态阶段的控制是步行AID机器人的热门研究主题，尤其是机器人下限假体。步态阶段估计是基于步态阶段控制的挑战。先前的研究使用了人类大腿角的整合或差异来估计步态阶段，但是累积的测量误差和噪声可能会影响估计结果。在本文中，提出了一种更健壮的步态相估计方法，使用各种运动模式的分段单调步态相位大角模型的统一形式。步态相仅根据大腿角度估算，这是一个稳定的变量，避免了相位漂移。基于卡尔曼滤波器的平滑液旨在进一步抑制估计步态阶段的突变。基于提出的步态相估计方法，基于步态阶段的关节角跟踪控制器是为跨股骨假体设计的。提出的步态估计方法，步态相和控制器通过在各种运动模式下的步行数据进行离线分析来评估。基于步态阶段的控制器的实时性能在经际假体的实验中得到了验证。

translated by 谷歌翻译

HiSTGNN: Hierarchical Spatio-temporal Graph Neural Networks for Weather Forecasting

Minbo Ma , Peng Xie , Fei Teng , Tianrui Li , Bin Wang , Shenggong Ji , Junbo Zhang

分类：机器学习 | 人工智能

2022-01-22

天气预报是一项有吸引力的挑战性任务，因为它对人类生活和大气运动的复杂性的影响。在大量历史观察到的时间序列数据的支持下，该任务适用于数据驱动的方法，尤其是深层神经网络。最近，基于图神经网络（GNN）方法在时空预测方面取得了出色的性能。但是，基于规范的GNNS方法仅分别对每个站的气象变量的局部图或整个车站的全局图进行建模，从而缺乏不同站点的气象变量之间的信息相互作用。在本文中，我们提出了一种新型的层次时空图形神经网络（Histgnn），以模拟多个站点气象变量之间的跨区域时空相关性。自适应图学习层和空间图卷积用于构建自学习图，并研究可变级别和站点级别图的节点之间的隐藏依赖性。为了捕获时间模式，扩张的成立为GATE时间卷积的主干旨在对长而各种气象趋势进行建模。此外，提出了动态的交互学习来构建在层次图中传递的双向信息。三个现实世界中的气象数据集的实验结果表明，史基元超过7个基准的卓越性能，并且将误差降低了4.2％至11.6％，尤其是与最先进的天气预测方法相比。

translated by 谷歌翻译

Advancing COVID-19 Diagnosis with Privacy-Preserving Collaboration in Artificial Intelligence

Xiang Bai , Hanchen Wang , Liya Ma , Yongchao Xu , Jiefeng Gan , Ziwei Fan , Fan Yang , Ke Ma , Jiehua Yang , Song Bai

分类：人工智能

2021-11-18

人工智能（AI）为简化Covid-19诊断提供了有前景的替代。然而，涉及周围的安全和可信度的担忧阻碍了大规模代表性的医学数据，对临床实践中训练广泛的模型造成了相当大的挑战。为了解决这个问题，我们启动了统一的CT-Covid AI诊断计划（UCADI），其中AI模型可以在没有数据共享的联合学习框架（FL）下在每个主机机构下分发和独立地在没有数据共享的情况下在每个主机机构上执行。在这里，我们认为我们的FL模型通过大的产量（中国测试敏感性/特异性：0.973 / 0.951，英国：0.730 / 0.942），与专业放射科医师的面板实现可比性表现。我们进一步评估了持有的模型（从另外两家医院收集，留出FL）和异构（用造影材料获取）数据，提供了模型所做的决策的视觉解释，并分析了模型之间的权衡联邦培训过程中的性能和沟通成本。我们的研究基于来自位于中国和英国的23家医院的3,336名患者的9,573次胸部计算断层扫描扫描（CTS）。统称，我们的工作提出了利用联邦学习的潜在保留了数字健康的前景。

translated by 谷歌翻译

An Information Theory-inspired Strategy for Automatic Network Pruning

Xiawu Zheng , Yuexiao Ma , Teng Xi , Gang Zhang , Errui Ding , Yuchao Li , Jie Chen , Yonghong Tian , Rongrong Ji

分类：计算机视觉

2021-08-19

尽管在许多计算机视觉任务上具有卓越的性能，但深度卷积神经网络众所周知，在具有资源限制的设备上被压缩。大多数现有的网络修剪方法需要艰苦的人类努力和禁止的计算资源，特别是当约束改变时。当需要部署在各种设备上时，这实际上限制了模型压缩的应用。此外，现有的方法仍然受到缺失的理论指导挑战。在本文中，我们提出了一种信息理论启发的自动模型压缩策略。我们的方法背后的原理是信息瓶颈理论，即隐藏的表示应该彼此压缩信息。因此，我们在网络激活中介绍了标准化的Hilbert-Schmidt独立性标准（NHSIC），作为层重要性的稳定和广义指标。当给出某个资源约束时，我们将HSIC指示器与约束将架构搜索问题转换为具有二次约束的线性编程问题。这种问题很容易通过几秒钟的凸优化方法解决。我们还提供严格的证据，揭示优化归一化的HSIC同时最小化不同层之间的相互信息。没有任何搜索过程，我们的方法实现了与最先进的压缩算法相比的更好的压缩权衡。例如，通过Reset-50，我们达到了45.3％的杂志，在想象中有75.75前1个精度。代码是在https://github.com/mac-automl/itpruner/tree/master上的途径。

translated by 谷歌翻译

A deep learning-based remaining useful life prediction approach for bearings

Cheng Cheng , Guijun Ma , Yong Zhang , Mingyang Sun , Fei Teng , Han Ding , Ye Yuan

分类：机器学习 | (统计)机器学习

2018-12-08

在工业应用中，电动机的故障近一半是由于滚动元件轴承（REB）的退化引起的。因此，准确估算REB的剩余使用寿命（RUL）对于确保机械系统的可靠性和安全至关重要。为了应对这一挑战，基于模型的方法通常受到数学建模的复杂性的限制。另一方面，传统的数据驱动方法需要巨大的努力来提取降解功能并构建健康指数。在本文中，提出了一个新颖的在线数据驱动框架，以利用深度卷积神经网络（CNN）的采用来预测轴承的统治。更具体地说，训练轴承的原始振动首先是使用Hilbert-huang变换（HHT）处理的，并将新型的非线性降解指标构建为学习标签。然后使用CNN来识别提取的降解指示器和训练轴承振动之间的隐藏模式，这使得可以自动估计测试轴承的降解。最后，通过使用$ \ epsilon $ -Support向量回归模型来预测测试轴承的规定。与最先进的方法相比，提出的规则估计框架的出色性能通过实验结果证明。提出的CNN模型的一般性也通过转移到经历不同操作条件的轴承来验证。

translated by 谷歌翻译

Hierarchical Explanations for Video Action Recognition

Sadaf Gulshad , Teng Long , Nanne van Noord

分类：计算机视觉 | 人工智能 | 机器学习

2023-01-01

We propose Hierarchical ProtoPNet: an interpretable network that explains its reasoning process by considering the hierarchical relationship between classes. Different from previous methods that explain their reasoning process by dissecting the input image and finding the prototypical parts responsible for the classification, we propose to explain the reasoning process for video action classification by dissecting the input video frames on multiple levels of the class hierarchy. The explanations leverage the hierarchy to deal with uncertainty, akin to human reasoning: When we observe water and human activity, but no definitive action it can be recognized as the water sports parent class. Only after observing a person swimming can we definitively refine it to the swimming action. Experiments on ActivityNet and UCF-101 show performance improvements while providing multi-level explanations.

translated by 谷歌翻译

Theoretical Guarantees for Sparse Principal Component Analysis based on the Elastic Net

Teng Zhang , Haoyi Yang , Lingzhou Xue

分类： (统计)机器学习

2022-12-29

Sparse principal component analysis (SPCA) has been widely used for dimensionality reduction and feature extraction in high-dimensional data analysis. Despite there are many methodological and theoretical developments in the past two decades, the theoretical guarantees of the popular SPCA algorithm proposed by Zou, Hastie & Tibshirani (2006) based on the elastic net are still unknown. We aim to close this important theoretical gap in this paper. We first revisit the SPCA algorithm of Zou et al. (2006) and present our implementation. Also, we study a computationally more efficient variant of the SPCA algorithm in Zou et al. (2006) that can be considered as the limiting case of SPCA. We provide the guarantees of convergence to a stationary point for both algorithms. We prove that, under a sparse spiked covariance model, both algorithms can recover the principal subspace consistently under mild regularity conditions. We show that their estimation error bounds match the best available bounds of existing works or the minimax rates up to some logarithmic factors. Moreover, we demonstrate the numerical performance of both algorithms in simulation studies.

translated by 谷歌翻译