智能论文笔记

BMD-GAN: Bone mineral density estimation using x-ray image decomposition into projections of bone-segmented quantitative computed tomography using hierarchical learning

Yi Gu , Yoshito Otake , Keisuke Uemura , Mazen Soufi , Masaki Takao , Nobuhiko Sugano , Yoshinobu Sato

分类：计算机视觉

2022-07-07

我们提出了一种从普通X射线图像中估算骨矿物质密度（BMD）的方法。双能X射线吸收法（DXA）和定量计算机断层扫描（QCT）在诊断骨质疏松症方面具有很高的精度；但是，这些方式需要特殊的设备和扫描协议。测量X射线图像的BMD提供了机会筛查，这对于早期诊断可能有用。先前直接了解X射线图像和BMD之间关系的方法需要大型训练数据集，以实现高精度，因为X射线图像中的强度很大。因此，我们提出了一种使用QCT训练生成对抗网络（GAN）的方法，并将X射线图像分解为骨分割QCT的投影。提出的分层学习提高了定量分解小区域目标的鲁棒性和准确性。使用拟议的方法对200例骨关节炎评估，我们将其命名为BMD-GAN，在预测和地面真实DXA测量的BMD之间显示出Pearson相关系数为0.888。除了不需要大规模训练数据库外，我们方法的另一个优点是它的扩展性对其他解剖区域，例如椎骨和肋骨。

translated by 谷歌翻译

TBI-GAN: An Adversarial Learning Approach for Data Synthesis on Traumatic Brain Segmentation

Xiangyu Zhao , Di Zang , Sheng Wang , Zhenrong Shen , Kai Xuan , Zeyu Wei , Zhe Wang , Ruizhe Zheng , Xuehai Wu , Zheren Li

分类：计算机视觉

2022-08-12

创伤性脑损伤（TBI）患者的脑网络分析对于其意识水平评估和预后评估至关重要，这需要分割某些意识相关的大脑区域。但是，由于很难收集TBI患者的手动注释的MR扫描，因此很难构建TBI分割模型。数据增强技术可用于缓解数据稀缺问题。但是，常规数据增强策略（例如空间和强度转化）无法模仿创伤性大脑中的变形和病变，这限制了后续分割任务的性能。为了解决这些问题，我们提出了一种名为TBIGA的新型医学图像授课模型，以通过配对的脑标签图合成TBI MR扫描。我们的TBIGAN方法的主要优势在于，它可以同时生成TBI图像和相应的标签映射，这在以前的医学图像的先前涂上方法中尚未实现。我们首先按照粗到细节的方式在边缘信息的指导下生成成分的图像，然后将合成强度图像用作标签上填充的先验。此外，我们引入了基于注册的模板增强管道，以增加合成图像对的多样性并增强数据增强能力。实验结果表明，提出的TBIGAN方法可以产生具有高质量和有效标签图的足够合成的TBI图像，这可以大大改善与替代方案相比的2D和3D创伤性脑部分割性能。

translated by 谷歌翻译

Body Composition Assessment with Limited Field-of-view Computed Tomography: A Semantic Image Extension Perspective

Kaiwen Xu , Thomas Li , Mirza S. Khan , Riqiang Gao , Sanja L. Antic , Yuankai Huo , Kim L. Sandler , Fabien Maldonado , Bennett A. Landman

分类：计算机视觉

2022-07-13

肺部以外的视野（FOV）组织截断在常规的肺筛查计算机断层扫描（CT）中很常见。这对机会性CT的身体组成（BC）评估构成了局限性，因为缺少关键的解剖结构。传统上，扩展CT的FOV被认为是使用有限数据的CT重建问题。但是，这种方法依赖于应用程序中可能无法使用的投影域数据。在这项工作中，我们从语义图像扩展角度提出问题，该角度仅需要图像数据作为输入。提出的两阶段方法根据完整体的估计范围识别新的FOV边框，并在截短区域中渗出了缺失的组织。使用在FOV中具有完整主体的CT切片对训练样品进行模拟，从而使模型开发自制。我们使用有限FOV的肺筛选CT评估了所提出的方法在自动BC评估中的有效性。提出的方法有效地恢复了缺失的组织并减少了FOV组织截断引入的BC评估误差。在大规模肺部筛查CT数据集的BC评估中，这种校正既可以提高受试者内的一致性和与人体测量近似值的相关性。已开发的方法可在https://github.com/masilab/s-efov上获得。

translated by 谷歌翻译

Segmentation-guided Domain Adaptation and Data Harmonization of Multi-device Retinal Optical Coherence Tomography using Cycle-Consistent Generative Adversarial Networks

Shuo Chen , Da Ma , Sieun Lee , Timothy T. L. Yu , Gavin Xu , Donghuan Lu , Karteek Popuri , Myeong Jin Ju , Marinko V. Sarunic , Mirza Faisal Beg

分类：计算机视觉 | 机器学习

2022-08-31

光学相干断层扫描（OCT）是一种非侵入性技术，可在微米分辨率中捕获视网膜的横截面区域。它已被广泛用作辅助成像参考，以检测与眼睛有关的病理学并预测疾病特征的纵向进展。视网膜层分割是至关重要的特征提取技术之一，其中视网膜层厚度的变化和由于液体的存在而引起的视网膜层变形高度相关，与多种流行性眼部疾病（如糖尿病性视网膜病）和年龄相关的黄斑疾病高度相关。变性（AMD）。但是，这些图像是从具有不同强度分布或换句话说的不同设备中获取的，属于不同的成像域。本文提出了一种分割引导的域适应方法，以将来自多个设备的图像调整为单个图像域，其中可用的最先进的预训练模型可用。它避免了即将推出的新数据集的手动标签的时间消耗以及现有网络的重新培训。网络的语义一致性和全球特征一致性将最大程度地减少许多研究人员报告的幻觉效果，这些效应对周期矛盾的生成对抗网络（Cyclegan）体系结构。

translated by 谷歌翻译

HTML版本

SyntheX: Scaling Up Learning-based X-ray Image Analysis Through In Silico Experiments

Cong Gao , Benjamin D. Killeen , Yicheng Hu , Robert B. Grupp , Russell H. Taylor , Mehran Armand , Mathias Unberath

分类：计算机视觉 | 机器学习

2022-06-13

现在，人工智能（AI）可以自动解释医学图像以供临床使用。但是，AI在介入图像中的潜在用途（相对于参与分类或诊断的图像），例如在手术期间的指导，在很大程度上尚未开发。这是因为目前，使用现场分析对现场手术收集的数据进行了事后分析，这是因为手术AI系统具有基本和实际限制，包括道德考虑，费用，可扩展性，数据完整性以及缺乏地面真相。在这里，我们证明从人类模型中创建逼真的模拟图像是可行的替代方法，并与大规模的原位数据收集进行了补充。我们表明，对现实合成数据的训练AI图像分析模型，结合当代域的概括或适应技术，导致在实际数据上的模型与在精确匹配的真实数据训练集中训练的模型相当地执行的模型。由于从基于人类的模型尺度的合成生成培训数据，因此我们发现我们称为X射线图像分析的模型传输范式（我们称为Syntheex）甚至可以超越实际数据训练的模型，因为训练的有效性较大的数据集。我们证明了合成在三个临床任务上的潜力：髋关节图像分析，手术机器人工具检测和COVID-19肺病变分割。 Synthex提供了一个机会，可以极大地加速基于X射线药物的智能系统的概念，设计和评估。此外，模拟图像环境还提供了测试新颖仪器，设计互补手术方法的机会，并设想了改善结果，节省时间或减轻人为错误的新技术，从实时人类数据收集的道德和实际考虑方面摆脱了人为错误。

translated by 谷歌翻译

2D/3D Deep Image Registration by Learning 3D Displacement Fields for Abdominal Organs

Ryuto Miura , Megumi Nakao , Mitsuhiro Nakamura , Tetsuya Matsuda

分类：计算机视觉

2022-12-11

Deformable registration of two-dimensional/three-dimensional (2D/3D) images of abdominal organs is a complicated task because the abdominal organs deform significantly and their contours are not detected in two-dimensional X-ray images. We propose a supervised deep learning framework that achieves 2D/3D deformable image registration between 3D volumes and single-viewpoint 2D projected images. The proposed method learns the translation from the target 2D projection images and the initial 3D volume to 3D displacement fields. In experiments, we registered 3D-computed tomography (CT) volumes to digitally reconstructed radiographs generated from abdominal 4D-CT volumes. For validation, we used 4D-CT volumes of 35 cases and confirmed that the 3D-CT volumes reflecting the nonlinear and local respiratory organ displacement were reconstructed. The proposed method demonstrate the compatible performance to the conventional methods with a dice similarity coefficient of 91.6 \% for the liver region and 85.9 \% for the stomach region, while estimating a significantly more accurate CT values.

translated by 谷歌翻译

Lumbar Bone Mineral Density Estimation from Chest X-ray Images: Anatomy-aware Attentive Multi-ROI Modeling

Fakai Wang , Kang Zheng , Le Lu , Jing Xiao , Min Wu , Chang-Fu Kuo , Shun Miao

分类：计算机视觉

2022-01-05

骨质疏松症是一种常见的慢性代谢骨病，通常是由于对骨矿物密度（BMD）检查有限的有限获得而被诊断和妥善治疗，例如。通过双能X射线吸收测定法（DXA）。在本文中，我们提出了一种方法来预测来自胸X射线（CXR）的BMD，最常见的和低成本的医学成像考试之一。我们的方法首先自动检测来自CXR的局部和全球骨骼结构的感兴趣区域（ROI）。然后，开发了一种具有变压器编码器的多ROI深模型，以利用胸部X射线图像中的本地和全局信息以进行准确的BMD估计。我们的方法在13719 CXR患者病例中进行评估，并通过金标准DXA测量其实际BMD评分。该模型预测的BMD与地面真理（Pearson相关系数0.889腰腰1）具有强烈的相关性。当施用骨质疏松症筛查时，它实现了高分类性能（腰腰1的AUC 0.963）。作为现场使用CXR扫描预测BMD的第一次努力，所提出的算法在早期骨质疏松症筛查和公共卫生促进中具有很强的潜力。

translated by 谷歌翻译

DC-cycleGAN: Bidirectional CT-to-MR Synthesis from Unpaired Data

Jiayuan Wang , Q. M. Jonathan Wu , Farhad Pourpanah

分类：计算机视觉 | 机器学习

2022-11-02

Magnetic resonance (MR) and computer tomography (CT) images are two typical types of medical images that provide mutually-complementary information for accurate clinical diagnosis and treatment. However, obtaining both images may be limited due to some considerations such as cost, radiation dose and modality missing. Recently, medical image synthesis has aroused gaining research interest to cope with this limitation. In this paper, we propose a bidirectional learning model, denoted as dual contrast cycleGAN (DC-cycleGAN), to synthesize medical images from unpaired data. Specifically, a dual contrast loss is introduced into the discriminators to indirectly build constraints between real source and synthetic images by taking advantage of samples from the source domain as negative samples and enforce the synthetic images to fall far away from the source domain. In addition, cross-entropy and structural similarity index (SSIM) are integrated into the DC-cycleGAN in order to consider both the luminance and structure of samples when synthesizing images. The experimental results indicate that DC-cycleGAN is able to produce promising results as compared with other cycleGAN-based medical image synthesis methods such as cycleGAN, RegGAN, DualGAN, and NiceGAN. The code will be available at https://github.com/JiayuanWang-JW/DC-cycleGAN.

translated by 谷歌翻译

Enhanced artificial intelligence-based diagnosis using CBCT with internal denoising: Clinical validation for discrimination of fungal ball, sinusitis, and normal cases in the maxillary sinus

Kyungsu Kim , Chae Yeon Lim , Joong Bo Shin , Myung Jin Chung , Yong Gi Jung

分类：计算机视觉

2022-11-29

The cone-beam computed tomography (CBCT) provides 3D volumetric imaging of a target with low radiation dose and cost compared with conventional computed tomography, and it is widely used in the detection of paranasal sinus disease. However, it lacks the sensitivity to detect soft tissue lesions owing to reconstruction constraints. Consequently, only physicians with expertise in CBCT reading can distinguish between inherent artifacts or noise and diseases, restricting the use of this imaging modality. The development of artificial intelligence (AI)-based computer-aided diagnosis methods for CBCT to overcome the shortage of experienced physicians has attracted substantial attention. However, advanced AI-based diagnosis addressing intrinsic noise in CBCT has not been devised, discouraging the practical use of AI solutions for CBCT. To address this issue, we propose an AI-based computer-aided diagnosis method using CBCT with a denoising module. This module is implemented before diagnosis to reconstruct the internal ground-truth full-dose scan corresponding to an input CBCT image and thereby improve the diagnostic performance. The external validation results for the unified diagnosis of sinus fungal ball, chronic rhinosinusitis, and normal cases show that the proposed method improves the micro-, macro-average AUC, and accuracy by 7.4, 5.6, and 9.6% (from 86.2, 87.0, and 73.4 to 93.6, 92.6, and 83.0%), respectively, compared with a baseline while improving human diagnosis accuracy by 11% (from 71.7 to 83.0%), demonstrating technical differentiation and clinical effectiveness. This pioneering study on AI-based diagnosis using CBCT indicates denoising can improve diagnostic performance and reader interpretability in images from the sinonasal area, thereby providing a new approach and direction to radiographic image reconstruction regarding the development of AI-based diagnostic solutions.

translated by 谷歌翻译

IGCN: Image-to-graph Convolutional Network for 2D/3D Deformable Registration

Megumi Nakao , Mitsuhiro Nakamura , Tetsuya Matsuda

分类：计算机视觉 | 机器学习

2021-10-31

基于治疗期间的单投影图像的器官形状重建具有广泛的临床范围，例如在图像引导放射治疗和手术指导中。我们提出了一种图形卷积网络，该网络实现了用于单视点2D投影图像的3D器官网格的可变形登记。该框架使得能够同时训练两种类型的变换：从2D投影图像到位移图，以及从采样的每周顶点特征到满足网格结构的几何约束的3D位移。假设申请放射治疗，验证了2D / 3D可变形的登记性能，用于尚未瞄准迄今为止，即肝脏，胃，十二指肠和肾脏以及胰腺癌的多个腹部器官。实验结果表明，考虑多个器官之间的关系的形状预测可用于预测临床上可接受的准确性的数字重建射线照片的呼吸运动和变形。

translated by 谷歌翻译

Opportunistic hip fracture risk prediction in Men from X-ray: Findings from the Osteoporosis in Men (MrOS) Study

Lars Schmarje , Stefan Reinhold , Timo Damm , Eric Orwoll , Claus-C. Glüer , Reinhard Koch

分类：计算机视觉

2022-07-22

骨质疏松症是一种常见疾病，可增加骨折风险。髋部骨折，尤其是在老年人中，导致发病率增加，生活质量降低和死亡率增加。骨质疏松症在骨折前是一种沉默的疾病，通常仍未被诊断和治疗。通过双能X射线吸收法（DXA）评估的面骨矿物质密度（ABMD）是骨质疏松诊断的金标准方法，因此也用于未来的骨折预测（Pregnosticic）。但是，所需的特殊设备在任何地方都没有广泛可用，特别是对于发展中国家的患者而言。我们提出了一个深度学习分类模型（形式），该模型可以直接预测计算机断层扫描（CT）数据的普通X光片（X射线）或2D投影图像。我们的方法是完全自动化的，因此非常适合机会性筛查设置，确定了更广泛的人群中的高风险患者而没有额外的筛查。对男性骨质疏松症（MROS）研究的X射线和CT投影进行了训练和评估。使用了3108张X射线（89个事件髋部骨折）或2150 CTS（80个入射髋部骨折），并使用了80/20分。我们显示，表格可以正确预测10年的髋部骨折风险，而验证AUC为81.44 +-3.11％ / 81.04 +-5.54％（平均 +-STD），包括其他信息，例如年龄，BMI，秋季历史和健康背景， X射线和CT队列的5倍交叉验证。我们的方法显着（p <0.01）在X射线队列上分别优于以70.19 +-6.58和74.72 +-7.21为70.19 +-6.58和74.72 +-7.21的\ frax等先前的方法。我们的模型在两个基于髋关节ABMD的预测上都跑赢了。我们有信心形式可以在早期阶段改善骨质疏松症的诊断。

translated by 谷歌翻译

Deformable Image Registration using Unsupervised Deep Learning for CBCT-guided Abdominal Radiotherapy

Huiqiao Xie , Yang Lei , Yabo Fu , Tonghe Wang , Justin Roper , Jeffrey D. Bradley , Pretesh Patel , Tian Liu , Xiaofeng Yang

分类：计算机视觉

2022-08-29

图像引导放射疗法中的CBCT为患者的设置和计划评估提供了关键的解剖学信息。纵向CBCT图像登记可以量化分裂间的解剖变化。这项研究的目的是提出一个无监督的基于深度学习的CBCT-CBCT变形图像登记。提出的可变形注册工作流程包括训练和推理阶段，这些培训和推理阶段通过基于空间转换的网络（STN）共享相同的进率前路。 STN由全球生成对抗网络（Globalgan）和本地GAN（Localgan）组成，分别预测了粗略和细尺度运动。通过最小化图像相似性损失和可变形矢量场（DVF）正则化损失，而无需监督地面真实DVF的训练，对网络进行了训练。在推理阶段，训练有素的Localgan预测了局部DVF的斑块，并融合形成全图像DVF。随后将局部全图像DVF与Globalgan生成的DVF合并以获得最终的DVF。在实验中，使用来自20名腹部癌症患者的100个分数CBCT评估了该方法，并在保持测试中来自21名不同腹部癌症患者的队列中的105个分数CBCT。从定性上讲，注册结果显示了变形的CBCT图像与目标CBCT图像之间的对齐。定量地，在基准标记和手动确定的地标计算的平均目标注册误差（TRE）为1.91+-1.11 mm。变形CBCT和目标CBCT之间的平均平均绝对误差（MAE），归一化的跨相关性（NCC）分别为33.42+-7.48 HU，0.94+-0.04。这种有希望的注册方法可以提供快速准确的纵向CBCT对准，以促进分流的解剖变化分析和预测。

translated by 谷歌翻译

HTML版本

Three-stage binarization of color document images based on discrete wavelet transform and generative adversarial networks

Yu-Shian Lin , Rui-Yang Ju , Chih-Chia Chen , Ting-Yu Lin , Jen-Shiun Chiang

分类：计算机视觉

2022-11-29

The efficient segmentation of foreground text information from the background in degraded color document images is a hot research topic. Due to the imperfect preservation of ancient documents over a long period of time, various types of degradation, including staining, yellowing, and ink seepage, have seriously affected the results of image binarization. In this paper, a three-stage method is proposed for image enhancement and binarization of degraded color document images by using discrete wavelet transform (DWT) and generative adversarial network (GAN). In Stage-1, we use DWT and retain the LL subband images to achieve the image enhancement. In Stage-2, the original input image is split into four (Red, Green, Blue and Gray) single-channel images, each of which trains the independent adversarial networks. The trained adversarial network models are used to extract the color foreground information from the images. In Stage-3, in order to combine global and local features, the output image from Stage-2 and the original input image are used to train the independent adversarial networks for document binarization. The experimental results demonstrate that our proposed method outperforms many classical and state-of-the-art (SOTA) methods on the Document Image Binarization Contest (DIBCO) dataset. We release our implementation code at https://github.com/abcpp12383/ThreeStageBinarization.

translated by 谷歌翻译

Automated Precision Localization of Peripherally Inserted Central Catheter Tip through Model-Agnostic Multi-Stage Networks

Subin Park , Yoon Ki Cha , Soyoung Park , Kyung-Su Kim , Myung Jin Chung

分类：计算机视觉

2022-06-14

外围插入的中央导管（PICC）由于其长期的血管内渗透感具有低感染率，因此已被广泛用作代表性的中央静脉线（CVC）之一。但是，PICC的尖端错位频率很高，增加了刺穿，栓塞和心律不齐等并发症的风险。为了自动，精确地检测到它，使用最新的深度学习（DL）技术进行了各种尝试。但是，即使采用了这些方法，实际上仍然很难确定尖端位置，因为多个片段现象（MFP）发生在预测和提取PICC线之前预测尖端之前所需的PICC线的过程。这项研究旨在开发一种通常应用于现有模型的系统，并通过删除模型输出的MF来更准确地恢复PICC线路，从而精确地定位了检测其处置的实际尖端位置。为此，我们提出了一个基于多阶段DL的框架后处理，以后处理现有技术的PICC线提取结果。根据是否将MFCN应用于五个常规模型，将每个均方根误差（RMSE）和MFP发病率比较性能。在内部验证中，当将MFCN应用于现有单个模型时，MFP平均提高了45％。 RMSE从平均26.85mm（17.16至35.80mm）到9.72mm（9.37至10.98mm）的平均增长了63％以上。在外部验证中，当应用MFCN时，MFP的发病率平均下降32％，RMSE平均下降了65 \％。因此，通过应用提出的MFCN，我们观察到与现有模型相比，PICC尖端位置的显着/一致检测性能提高。

translated by 谷歌翻译

Lesion-Specific Prediction with Discriminator-Based Supervised Guided Attention Module Enabled GANs in Multiple Sclerosis

Jueqi Wang , Derek Berger , Erin Mazerolle , Jean-Alexis Delamer , Jacob Levman

分类：计算机视觉

2022-08-30

多发性硬化症（MS）是一种慢性神经系统疾病，其特征是大脑白质病变的发展。相对于其他MRI模态，T2流体减弱的反转恢复（FLAIR）脑磁共振成像（MRI）提供了MS病变的卓越可视化和表征。 MS中的后续大脑FLAIR MRI为临床医生提供了有用的信息，以监测疾病进展。在这项研究中，我们提出了对生成对抗网络（GAN）的新颖修饰，以预测MS以固定时间间隔的MS预测未来病变特异性MRI。我们在鉴别器中使用受监督的引导注意力和扩张卷积，该歧视者支持对生成图像是否实现的明智预测，这是基于对病变区域的关注，这反过来又有可能帮助改善生成器以预测病变区域将来的考试更准确。我们将我们的方法与几个基线和一种最先进的CF-Sagan模型进行了比较[1]。总之，我们的结果表明，与其他总体性能相似的模型相比，所提出的方法可实现更高的准确性，并减少病变区域预测误差的标准偏差。

translated by 谷歌翻译

Adaptive Contrast for Image Regression in Computer-Aided Disease Assessment

Weihang Dai , Xiaomeng Li , Wan Hang Keith Chiu , Michael D. Kuo , Kwang-Ting Cheng

分类：计算机视觉

2021-12-22

图像回归任务，如骨矿物密度（BMD）估计和左心室喷射分数（LVEF）预测，在计算机辅助疾病评估中起重要作用。大多数深度回归方法用单一的回归损耗函数训练神经网络，如MSE或L1损耗。在本文中，我们提出了一种用于深度图像回归的第一个对比学习框架，即adacon，其包括通过新颖的自适应边缘对比损耗和回归预测分支的特征学习分支组成。我们的方法包含标签距离关系作为学习特征表示的一部分，这允许在下游回归任务中进行更好的性能。此外，它可以用作即插即用模块，以提高现有回归方法的性能。我们展示了adacon对来自X射线图像的骨矿物密度估计和来自超声心动图象的X射线图像和左心室喷射分数预测的骨矿物密度估计的有效性。 Adacon分别导致MAE在最先进的BMD估计和LVEF预测方法中相对提高3.3％和5.9％。

translated by 谷歌翻译

Human Treelike Tubular Structure Segmentation: A Comprehensive Review and Future Perspectives

Hao Li , Zeyu Tang , Yang Nan , Guang Yang

分类：计算机视觉 | 机器学习

2022-07-12

人类生理学中的各种结构遵循特异性形态，通常在非常细的尺度上表达复杂性。这种结构的例子是胸前气道，视网膜血管和肝血管。可以观察到可以观察到可以观察到可以观察到可以观察到空间排列的磁共振成像（MRI），计算机断层扫描（CT），光学相干断层扫描（OCT）等医学成像模式（MRI），计算机断层扫描（CT），可以观察到空间排列的大量2D和3D图像的集合。这些结构在医学成像中的分割非常重要，因为对结构的分析提供了对疾病诊断，治疗计划和预后的见解。放射科医生手动标记广泛的数据通常是耗时且容易出错的。结果，在过去的二十年中，自动化或半自动化的计算模型已成为医学成像的流行研究领域，迄今为止，许多计算模型已经开发出来。在这项调查中，我们旨在对当前公开可用的数据集，细分算法和评估指标进行全面审查。此外，讨论了当前的挑战和未来的研究方向。

translated by 谷歌翻译

Computer Vision on X-ray Data in Industrial Production and Security Applications: A survey

Mehdi Rafiei , Jenni Raitoharju , Alexandros Iosifidis

分类：计算机视觉

2022-11-10

X-ray imaging technology has been used for decades in clinical tasks to reveal the internal condition of different organs, and in recent years, it has become more common in other areas such as industry, security, and geography. The recent development of computer vision and machine learning techniques has also made it easier to automatically process X-ray images and several machine learning-based object (anomaly) detection, classification, and segmentation methods have been recently employed in X-ray image analysis. Due to the high potential of deep learning in related image processing applications, it has been used in most of the studies. This survey reviews the recent research on using computer vision and machine learning for X-ray analysis in industrial production and security applications and covers the applications, techniques, evaluation metrics, datasets, and performance comparison of those techniques on publicly available datasets. We also highlight some drawbacks in the published research and give recommendations for future research in computer vision-based X-ray analysis.

translated by 谷歌翻译

De-Noising of Photoacoustic Microscopy Images by Deep Learning

Da He , Jiasheng Zhou , Xiaoyu Shang , Jiajia Luo , Sung-Liang Chen

分类：机器学习

2022-01-12

作为混合成像技术，光声显微镜（PAM）成像由于激光强度的最大允许暴露，组织中超声波的衰减以及换能器的固有噪声而受到噪声。去噪是降低噪声的后处理方法，并且可以恢复PAM图像质量。然而，之前的去噪技术通常严重依赖于数学前导者以及手动选择的参数，导致对不同噪声图像的不令人满意和慢的去噪能，这极大地阻碍了实用和临床应用。在这项工作中，我们提出了一种基于深度学习的方法，可以从PAM图像中除去复杂的噪声，没有数学前导者，并手动选择不同输入图像的设置。注意增强的生成对抗性网络用于提取图像特征并去除各种噪声。在合成和实际数据集上证明了所提出的方法，包括幻影（叶静脉）和体内（小鼠耳血管和斑马鱼颜料）实验。结果表明，与先前的PAM去噪方法相比，我们的方法在定性和定量上恢复图像时表现出良好的性能。此外，为256次\ times256 $像素的图像实现了0.016 s的去噪速度。我们的方法对于PAM图像的去噪有效和实用。

translated by 谷歌翻译

Self-Attention Generative Adversarial Network for Iterative Reconstruction of CT Images

Ruiwen Xing , Thomas Humphries , Dong Si

分类：计算机视觉

2021-12-23

计算机断层扫描（CT）使用从身体周围的传感器取出的X射线测量以产生人体的断层图像。如果X射线数据充分采样和高质量，则可以使用传统的重建算法;然而，诸如将剂量减少给患者的问题，或数据采集的几何限制可能导致低质量或不完整的数据。由于噪声和其他伪像，使用传统方法从这些数据重建的图像具有差的质量。本研究的目的是训练单个神经网络，从嘈杂或不完全CT扫描数据重建高质量CT图像，包括低剂量，稀疏视图和有限的角度场景。为了完成这项任务，我们将生成的对冲网络（GaN）作为信号训练，以与CT数据的迭代同步代数重建技术（SART）结合使用。网络包括自我关注块，以模拟数据中的远程依赖性。我们将我们的自我关注GaN进行CT图像重建，包括几种最先进的方法，包括去噪循环GaN，Circle GaN和总变化的校长算法。我们的方法被证明是可以相当的整体性能来圈出GaN，同时优于其他两种方法。

translated by 谷歌翻译