智能论文笔记

An Image Processing approach to identify solar plages observed at 393.37 nm by Kodaikanal Solar Observatory

Sarvesh Gharat , Bhaskar Bose

分类：计算机视觉

2022-09-21

太阳水箱是在太阳的Ca II K摄影观测中观察到的明亮的色球环特征。这些是高磁场浓度的区域，因此是太阳的磁性活性的示踪剂，并且是研究太阳长期可变性的最重要特征之一，因为记录了一个多世纪以来的Ca II K镜头。。但是，从一个世纪的数据库中检测到份额是一项非平凡的任务，需要大量的人力资源来手动进行。因此，在这项研究中，我们提出了一种图像处理算法，该算法可以从CA II K摄影观测中识别出太阳份量。拟议的研究已在Kodaikanal太阳能天文台的档案数据上实施。为了确保算法有效，无论噪声水平，亮度和其他图像属性如何，我们从数据存档中随机绘制图像样本以测试我们的算法。

translated by 谷歌翻译

A Weakly-Supervised Surface Crack Segmentation Method using Localisation with a Classifier and Thresholding

Jacob König , Mark Jenkins , Mike Mannion , Peter Barrie , Gordon Morison

分类：计算机视觉

2021-09-01

如今，表面裂缝是公共基础设施的常见景象。最近的工作通过支持使用背景曲面裂缝的机器学习方法支持结构维护措施，解决了这个问题，使它们易于本地化。然而，这些方法的常见问题是创建一个良好的运行算法，训练数据需要详细地注释属于裂缝的像素。我们的工作提出了一种弱监督的方法，它利用CNN分类器来创建曲面裂纹分割图。我们使用此分类器通过使用其类激活映射和基于贴片的分类方法来创建粗糙的裂缝本地化地图，并用基于阈值的方法熔断器来融合它，以分段为大多数较暗的裂纹像素。分类器有助于抑制背景区域的噪声，这通常是通过标准阈值处理方法被错误地突出显示的裂缝。我们专注于我们的方法的易于实现，并且显示在几个表面裂纹数据集上表现良好，即使用于训练的唯一数据是简单的分类标签，也可以有效地进行分割裂缝。

translated by 谷歌翻译

Identification of chicken egg fertility using SVM classifier based on first-order statistical feature extraction

Shoffan Saifullah , Andiko Putro Suryotomo

分类：计算机视觉

2022-01-10

本研究旨在使用支持向量机（SVM）分类器方法识别鸡蛋生育率。分类基础使用一阶统计（FOS）参数作为识别过程中的特征提取。该研究是根据该过程的识别过程开发的，这是仍然是手动（常规）的。虽然目前在识别过程中有很多技术，但它们仍然需要开发。因此，该研究是图像处理技术领域的发展之一。示例数据使用以前研究的数据集，共有100个鸡蛋图像。图像中的蛋对象是单个对象。根据这些数据，每个肥沃和不孕蛋的分类是50个图像数据。鸡蛋图像数据在图像处理中输入，初始过程是分段。此初始分割旨在根据对象获取裁剪图像。使用具有灰度和图像增强方法的图像预处理修复裁剪图像。这种方法（图像增强）使用了两种组合方法：对比度有限的自适应直方图均衡（CLAHE）和直方图均衡（HE）。使用FOS方法，改进的图像成为特征提取的输入。 FOS使用五个参数，即均值，熵，方差，偏振和峰氏症。输入了SVM分类方法的五个参数，以识别鸡蛋的生育率。这些实验的结果，识别过程中提出的方法的成功率为84.57％。因此，该方法的实现可以用作未来研究改进的参考。另外，可以使用二阶特征提取方法来提高其准确性和改进对分类的监督学习。

translated by 谷歌翻译

Saliency-based segmentation of dermoscopic images using color information

Giuliana Ramella

分类：计算机视觉

2020-11-26

皮肤病变分割是高效的非侵入性计算机辅助性早期诊断黑素瘤的关键步骤之一。本文调查了除了显着性的颜色信息，可用于自动测定着色的病变区。与仅使用显着性的大多数现有的分割方法不同，以便与周围地区的皮肤病变区分，我们提出了一种采用二值化过程的新方法，其与新的感知标准相结合，受到人类视觉感知的启发，与显着性的性质有关和输入图像数据分布的颜色。作为改进所提出的方法的准确性的手段，在分割步骤之前前面通过预处理，旨在减少计算负担，消除伪像和改善对比度。我们已经在两个公共数据库上评估了该方法，包括1497个Dermoscopic图像。我们还通过明确为DerMicopic图像明确设计的经典和最近的基于显着的方法的性能。定性和定量评估表明，该方法是有前途的，因为它产生了精确的皮肤病变分割，与其他基于显着性的分段方法相比令人满意地表现得令人满意。

translated by 谷歌翻译

Exploring the Effects of Data Augmentation for Drivable Area Segmentation

Srinjoy Bhuiya , Ayushman Kumar , Sankalok Sen

分类：计算机视觉 | 人工智能

2022-08-06

可驱动区域的实时分割在完成汽车的自主感知中起着至关重要的作用。最近，使用深度学习的图像分割模型开发了一些快速的进步。但是，大多数进步都是在模型架构设计中取得的。在解决与细分有关的任何有监督的深度学习问题时，一个人构建的模型的成功取决于我们用于该模型的输入培训数据的数量和质量。该数据应包含良好的各种图像，以更好地工作分割模型。与数据集中的注释有关的问题可能会导致该模型在测试和验证中的压倒性I型和II型错误中得出结论，在试图解决现实世界问题时造成恶意问题。为了解决这个问题并使我们的模型更加准确，动态和健壮，数据增强涉及使用，因为它有助于扩展我们的样本培训数据并使其更好，整体上更加多样化。因此，在我们的研究中，我们专注于通过分析预先存在的图像数据集并相应地进行增强来研究数据增强的好处。我们的结果表明，现有最新模型（或SOTA）模型的性能和鲁棒性可以大大增加，而不会增加模型复杂性或推理时间。仅在对当今广泛使用中的其他几种增强方法和策略进行彻底研究及其相应的效果之后，仅在本文中决定并使用的增强作用。我们所有的结果都在广泛使用的CityScapes数据集上报告。

translated by 谷歌翻译

Towards Automatic Cetacean Photo-Identification: A Framework for Fine-Grain, Few-Shot Learning in Marine Ecology

Cameron Trotter , Nick Wright , A. Stephen McGough , Matt Sharpe , Barbara Cheney , Mònica Arso Civil , Reny Tyson Moore , Jason Allen , Per Berggren

分类：计算机视觉 | 人工智能 | 机器学习

2022-12-07

Photo-identification (photo-id) is one of the main non-invasive capture-recapture methods utilised by marine researchers for monitoring cetacean (dolphin, whale, and porpoise) populations. This method has historically been performed manually resulting in high workload and cost due to the vast number of images collected. Recently automated aids have been developed to help speed-up photo-id, although they are often disjoint in their processing and do not utilise all available identifying information. Work presented in this paper aims to create a fully automatic photo-id aid capable of providing most likely matches based on all available information without the need for data pre-processing such as cropping. This is achieved through a pipeline of computer vision models and post-processing techniques aimed at detecting cetaceans in unedited field imagery before passing them downstream for individual level catalogue matching. The system is capable of handling previously uncatalogued individuals and flagging these for investigation thanks to catalogue similarity comparison. We evaluate the system against multiple real-life photo-id catalogues, achieving mAP@IOU[0.5] = 0.91, 0.96 for the task of dorsal fin detection on catalogues from Tanzania and the UK respectively and 83.1, 97.5% top-10 accuracy for the task of individual classification on catalogues from the UK and USA.

translated by 谷歌翻译

A comprehensive survey on computer-aided diagnostic systems in diabetic retinopathy screening

Meysam Tavakoli , Patrick Kelley

分类：计算机视觉

2022-08-03

糖尿病（DM）可导致严重的微脉管破坏，最终导致糖尿病性视网膜病变（DR）或由于糖尿病引起的眼睛并发症。如果不受组织的检查，这种疾病会随着时间的流逝而增加，并最终导致完全视力丧失。检测到这种光学发展的一般方法是通过检查视网膜图像的血管，视神经头，微型毛发，出血，渗出液等。最终，这受到经验丰富的眼科医生和大量DM案例的数量的限制。为了启用早期有效的DR诊断，眼科领域需要强大的计算机辅助诊断（CAD）系统。我们的审查旨在为从学生到成熟的研究人员提供给任何人，他们想了解CAD系统及其算法可以完成的工作，再到建模以及计算机视觉和模式识别中的视网膜图像处理领域的发展方向。对于刚开始的人来说，我们特别强调了不同数据库和算法框架的逻辑，优势和缺点，重点是最近的方法。

translated by 谷歌翻译

Artifact Reduction in Fundus Imaging using Cycle Consistent Adversarial Neural Networks

Sai Koushik S S , K. G. Srinivasa

分类：计算机视觉 | 机器学习

2021-12-25

眼底图像对于鉴定各种眼科疾病非常有用。然而，由于伪影存在，视网膜的可见性受到严重影响。这可能导致可能导致更复杂的问题的疾病误诊。由于深度学习是一种强大的工具，可以在没有人为干预的情况下从数据中提取模式，它们可以应用于图像到图像图像到图像的翻译问题。本文已经尝试了，以自动纠正眼底图像中存在的伪像。我们使用基于Cnercangan的模型，该模型由残余块组成，以减少图像中的伪影。与现有技术相比，可以看到显着的改进。

translated by 谷歌翻译

A scalable framework for annotating photovoltaic cell defects in electroluminescence images

Urtzi Otamendi , Inigo Martinez , Igor G. Olaizola , Marco Quartulli

分类：计算机视觉 | 人工智能 | 机器学习

2022-12-15

The correct functioning of photovoltaic (PV) cells is critical to ensuring the optimal performance of a solar plant. Anomaly detection techniques for PV cells can result in significant cost savings in operation and maintenance (O&M). Recent research has focused on deep learning techniques for automatically detecting anomalies in Electroluminescence (EL) images. Automated anomaly annotations can improve current O&M methodologies and help develop decision-making systems to extend the life-cycle of the PV cells and predict failures. This paper addresses the lack of anomaly segmentation annotations in the literature by proposing a combination of state-of-the-art data-driven techniques to create a Golden Standard benchmark. The proposed method stands out for (1) its adaptability to new PV cell types, (2) cost-efficient fine-tuning, and (3) leverage public datasets to generate advanced annotations. The methodology has been validated in the annotation of a widely used dataset, obtaining a reduction of the annotation cost by 60%.

translated by 谷歌翻译

Chromosome Segmentation Analysis Using Image Processing Techniques and Autoencoders

Amritha S Pallavoor , Prajwal A , Sundareshan TS , Sreekanth K Pallavoor

分类：计算机视觉

2022-09-12

中期图像的染色体分析和鉴定是基于细胞遗传学的医学诊断的关键部分。它主要用于识别遗传疾病和疾病的宪法，产前和获得异常。从中期鉴定染色体的过程是一个乏味的过程，需要训练有素的人员和几个小时才能进行。挑战尤其是在中期图像中处理触摸，重叠和聚集的染色体方面存在挑战，如果未正确进行分割，则会导致错误的分类。我们提出了一种自动化从给定的中期图像的检测和分割过程的方法，并在使用深层CNN体系结构中对其进行分类以了解染色体类型。我们已经使用了两种方法来处理中期中发现的重叠染色体的分离 - 一种涉及流域算法的方法，其次是自动编码器，另一种纯粹基于流域算法的方法。这些方法涉及自动化和非常最小的手动努力来执行分割，从而产生输出。手动努力确保了人类的直觉得到考虑，尤其是在处理触摸，重叠和聚类染色体时。分割后，使用深CNN模型将单个染色体图像分类为95.75 \％精度。此外，我们将分布策略从给定输出（通常可以在正常情况下的46个单个图像组成）中分类为单个类别，准确性为98 \％。我们的研究有助于得出结论，通过图像处理技术可以将参与染色体分割的纯手动努力自动化至非常好的水平，从而产生可靠且令人满意的结果。

translated by 谷歌翻译

Single image deep defocus estimation and its applications

Fernando J. Galetto , Guang Deng

分类：计算机视觉

2021-07-30

深度信息在许多图像处理应用程序中是有用的。然而，由于拍摄图像是在2D成像传感器上投射3D场景的过程，因此深度信息嵌入图像中。从图像中提取深度信息是一个具有挑战性的任务。引导原理是由于散焦引起的蓝色水平与物体和焦平面之间的距离有关。基于该原理和广泛使用的假设，即高斯模糊是散焦模糊的良好模型，我们制定了作为高斯模糊分类问题的空间变化散焦模糊的问题。我们通过培训深度神经网络来解决图像补丁中的20级蓝色蓝色之一来解决问题。我们创建了一个超过500000美元的尺寸为32 \ times32 $的数据集，用于培训和测试几种知名网络模型。我们发现MobileNetv2由于其较低的内存要求和高精度而适用于此应用。训练模型用于确定通过施加迭代加权引导滤波器来改进的贴剂模糊。结果是散焦图，其携带每个像素的模糊度的信息。我们将提出的方法与最先进的技术进行比较，我们展示了其在自适应图像增强，散焦倍率和多聚焦图像融合中的成功应用。

translated by 谷歌翻译

Improving Warped Planar Object Detection Network For Automatic License Plate Recognition

Nguyen Dinh Tra , Nguyen Cong Tri , Phan Duy Hung

分类：计算机视觉 | 人工智能

2022-12-14

This paper aims to improve the Warping Planer Object Detection Network (WPOD-Net) using feature engineering to increase accuracy. What problems are solved using the Warping Object Detection Network using feature engineering? More specifically, we think that it makes sense to add knowledge about edges in the image to enhance the information for determining the license plate contour of the original WPOD-Net model. The Sobel filter has been selected experimentally and acts as a Convolutional Neural Network layer, the edge information is combined with the old information of the original network to create the final embedding vector. The proposed model was compared with the original model on a set of data that we collected for evaluation. The results are evaluated through the Quadrilateral Intersection over Union value and demonstrate that the model has a significant improvement in performance.

translated by 谷歌翻译

Human Treelike Tubular Structure Segmentation: A Comprehensive Review and Future Perspectives

Hao Li , Zeyu Tang , Yang Nan , Guang Yang

分类：计算机视觉 | 机器学习

2022-07-12

人类生理学中的各种结构遵循特异性形态，通常在非常细的尺度上表达复杂性。这种结构的例子是胸前气道，视网膜血管和肝血管。可以观察到可以观察到可以观察到可以观察到可以观察到空间排列的磁共振成像（MRI），计算机断层扫描（CT），光学相干断层扫描（OCT）等医学成像模式（MRI），计算机断层扫描（CT），可以观察到空间排列的大量2D和3D图像的集合。这些结构在医学成像中的分割非常重要，因为对结构的分析提供了对疾病诊断，治疗计划和预后的见解。放射科医生手动标记广泛的数据通常是耗时且容易出错的。结果，在过去的二十年中，自动化或半自动化的计算模型已成为医学成像的流行研究领域，迄今为止，许多计算模型已经开发出来。在这项调查中，我们旨在对当前公开可用的数据集，细分算法和评估指标进行全面审查。此外，讨论了当前的挑战和未来的研究方向。

translated by 谷歌翻译

Learning to See in the Dark

Chen Chen , Qifeng Chen , Jia Xu , Vladlen Koltun

分类：

2018-05-04

a) Camera output with ISO 8,000 (b) Camera output with ISO 409,600 (c) Our result from the raw data of (a) Figure 1. Extreme low-light imaging with a convolutional network. Dark indoor environment. The illuminance at the camera is < 0.1 lux. The Sony α7S II sensor is exposed for 1/30 second. (a) Image produced by the camera with ISO 8,000. (b) Image produced by the camera with ISO 409,600. The image suffers from noise and color bias. (c) Image produced by our convolutional network applied to the raw sensor data from (a).

translated by 谷歌翻译

Digitizing Historical Balance Sheet Data: A Practitioner's Guide

Sergio Correia , Stephan Luck

分类：计算机视觉

2022-03-31

本文讨论了如何通过通过预处理和后处理方法增强光学特征识别（OCR）发动机来成功数字化大规模的历史微数据。尽管由于机器学习的改善，近年来OCR软件已大大改善，但现成的OCR应用程序仍然显示高错误率，这限制了其应用程序以准确提取结构化信息。但是，补充OCR可以大大提高其成功率，使其成为经济史学家的强大且具有成本效益的工具。本文展示了这些方法，并解释了为什么它们有用。我们将它们应用于两个大型资产负债表数据集，并引入Quipucamayoc，Quipucamayoc是一个统一框架中包含这些方法的Python软件包。

translated by 谷歌翻译

Guided interactive image segmentation using machine learning and color based data set clustering

Adrian Friebel , Tim Johann , Dirk Drasdo , Stefan Hoehme

分类：计算机视觉

2020-05-15

我们提出了一种新颖的方法，该方法将基于机器学习的交互式图像分割结合在一起，使用Supersoxels与聚类方法结合了用于自动识别大型数据集中类似颜色的图像的聚类方法，从而使分类器的指导重复使用。我们的方法解决了普遍的颜色可变性的问题，并且在生物学和医学图像中通常不可避免，这通常会导致分割恶化和量化精度，从而大大降低了必要的训练工作。效率的这种提高促进了大量图像的量化，从而为高通量成像中的最新技术进步提供了交互式图像分析。所呈现的方法几乎适用于任何图像类型，并代表通常用于图像分析任务的有用工具。

translated by 谷歌翻译

Global contrast based salient region detection

分类：

Reliable estimation of visual saliency allows appropriate processing of images without prior knowledge of their contents, and thus remains an important step in many computer vision tasks including image segmentation, object recognition, and adaptive compression. We propose a regional contrast based saliency extraction algorithm, which simultaneously evaluates global contrast differences and spatial coherence. The proposed algorithm is simple, efficient, and yields full resolution saliency maps. Our algorithm consistently outperformed existing saliency detection methods, yielding higher precision and better recall rates, when evaluated using one of the largest publicly available data sets. We also demonstrate how the extracted saliency map can be used to create high quality segmentation masks for subsequent image processing.

translated by 谷歌翻译

Segmentation of Blood Vessels, Optic Disc Localization, Detection of Exudates and Diabetic Retinopathy Diagnosis from Digital Fundus Images

Soham Basu , Sayantan Mukherjee , Ankit Bhattacharya , Anindya Sen

分类：计算机视觉

2022-07-09

糖尿病性视网膜病（DR）是长期存在的，未经检查的糖尿病的并发症，是世界上失明的主要原因之一。本文着重于改进且可靠的方法，以提取DR，VIZ的某些功能。血管和渗出液。使用多个形态和阈值手术分割血管。对于渗出液的分割，使用了原始图像上的K均值聚类和轮廓检测。进行大量降噪以消除血管分割算法的结果中的假阳性。还执行了使用K-均值聚类和模板匹配的光盘定位。最后，本文提出了一个深卷卷神经网络（DCNN）模型，具有14个卷积层和2个完全连接的层，用于自动，二元诊断。血管分割，视盘定位和DCNN的精度分别为95.93％，98.77％和75.73％。源代码和预培训模型可用https://github.com/sohambasu07/dr_2021

translated by 谷歌翻译

A Segmentation Method for fluorescence images without a machine learning approach

Giuseppe Giacopelli , Michele Migliore , Domenico Tegolo

分类：计算机视觉 | 人工智能

2022-12-28

Background: Image analysis applications in digital pathology include various methods for segmenting regions of interest. Their identification is one of the most complex steps, and therefore of great interest for the study of robust methods that do not necessarily rely on a machine learning (ML) approach. Method: A fully automatic and optimized segmentation process for different datasets is a prerequisite for classifying and diagnosing Indirect ImmunoFluorescence (IIF) raw data. This study describes a deterministic computational neuroscience approach for identifying cells and nuclei. It is far from the conventional neural network approach, but it is equivalent to their quantitative and qualitative performance, and it is also solid to adversative noise. The method is robust, based on formally correct functions, and does not suffer from tuning on specific data sets. Results: This work demonstrates the robustness of the method against the variability of parameters, such as image size, mode, and signal-to-noise ratio. We validated the method on two datasets (Neuroblastoma and NucleusSegData) using images annotated by independent medical doctors. Conclusions: The definition of deterministic and formally correct methods, from a functional to a structural point of view, guarantees the achievement of optimized and functionally correct results. The excellent performance of our deterministic method (NeuronalAlg) to segment cells and nuclei from fluorescence images was measured with quantitative indicators and compared with those achieved by three published ML approaches.

translated by 谷歌翻译

Contrast Enhancement of Brightness-Distorted Images by Improved Adaptive Gamma Correction

Gang Cao , Lihui Huang , Huawei Tian , Xianglin Huang , Yongbin Wang , Ruicong Zhi

分类：计算机视觉

2017-09-13

作为有效的图像对比度增强（CE）工具，先前通过将伽马参数与图像中像素灰度的累积分布函数（CDF）相关联，以前提出了自适应伽马校正（AGC）。ACG很好地处理了大多数昏暗的图像，但对于全球明亮的图像和带有局部明亮区域的昏暗图像失败。在实际场景中，这两类亮度延伸的图像是通用的，例如暴露不当和白对象区域。为了减轻此类缺陷，我们在这里提出了改进的AGC算法。负面图像的新型策略用于实现明亮图像的CE，并采用截短CDF调制的伽马校正来增强昏暗的γ。因此，可以缓解局部过度增强和结构失真。定性和定量实验结果都表明，我们提出的方法始终取得良好的CE结果。

translated by 谷歌翻译