智能论文笔记

Recovery of Continuous 3D Refractive Index Maps from Discrete Intensity-Only Measurements using Neural Fields

Renhao Liu , Yu Sun , Jiabei Zhu , Lei Tian , Ulugbek Kamilov

分类：计算机视觉

2021-11-27

强度衍射断层扫描（IDT）是指用于从一组仅2D强度测量的样品成像样品的3D折射率（RI）分布的一类光学显微镜技术。由于相位信息的丢失和缺失的锥体问题，无伪影RI地图的重建是IDT的一个基本挑战。神经领域（NF）最近成为一种新的深度学习方法（DL），用于学习物理领域的连续表示。 NF使用基于坐标的神经网络来表示该场，通过将空间坐标映射到相应的物理量，在我们的情况下，复杂价值的折射率值。我们将DEPAF作为第一种基于NF的IDT方法，可以从仅强度和有限角度的测量值中学习RI体积的高质量连续表示。 DECAF中的表示形式是通过使用IDT向前模型直接从测试样品的测量值中学到的，而无需任何地面真相图。我们对模拟和实验生物学样品进行定性和定量评估DECAF。我们的结果表明，DECAF可以生成高对比度和无伪影RI图，并导致MSE超过现有方法的2.1倍。

translated by 谷歌翻译

Tensorial tomographic differential phase-contrast microscopy

Shiqi Xu , Xiang Dai , Xi Yang , Kevin C. Zhou , Kanghyun Kim , Vinayak Pathak , Carolyn Glass , Roarke Horstmeyer

分类：计算机视觉

2022-04-25

我们报告了张力层造影差异相位对比度显微镜（T2DPC），这是一种用于同时测量相和各向异性的无定量标签层析成像方法。T2DPC扩展了差异相位对比显微镜（一种定量相成像技术），以突出光的矢量性质。该方法求解了从配备有LED矩阵，圆极偏振器和偏振敏感摄像机的标准显微镜获得的强度测量的各向异性样品的介电常数张量。我们证明了各种验证样品的折射率，双折射和方向的准确体积重建，并证明生物标本的重建极化结构是病理学的预测。

translated by 谷歌翻译

FourierNets enable the design of highly non-local optical encoders for computational imaging

Diptodip Deb , Zhenfei Jiao , Ruth Sims , Alex B. Chen , Michael Broxton , Misha B. Ahrens , Kaspar Podgorski , Srinivas C. Turaga

分类：计算机视觉 | 机器学习

2021-04-21

光学系统的可区分模拟可以与基于深度学习的重建网络结合使用，以通过端到端（E2E）优化光学编码器和深度解码器来实现高性能计算成像。这使成像应用程序（例如3D定位显微镜，深度估计和无透镜摄影）通过优化局部光学编码器。更具挑战性的计算成像应用，例如将3D卷压入单个2D图像的3D快照显微镜，需要高度非本地光学编码器。我们表明，现有的深网解码器具有局部性偏差，可防止这种高度非本地光学编码器的优化。我们使用全球内核傅里叶卷积神经网络（Fouriernets）基于浅神经网络体系结构的解码器来解决此问题。我们表明，在高度非本地分散镜头光学编码器捕获的照片中，傅立叶网络超过了现有的基于网络的解码器。此外，我们表明傅里叶可以对3D快照显微镜的高度非本地光学编码器进行E2E优化。通过将傅立叶网和大规模多GPU可区分的光学模拟相结合，我们能够优化非本地光学编码器170 $ \ times $ \ times $ tos 7372 $ \ times $ \ times $ \ times $比以前的最新状态，并证明了ROI的潜力-type特定的光学编码使用可编程显微镜。

translated by 谷歌翻译

Deformation-Compensated Learning for Image Reconstruction without Ground Truth

Weijie Gan , Yu Sun , Cihat Eldeniz , Jiaming Liu , Hongyu An , Ulugbek S. Kamilov

分类：计算机视觉

2021-07-12

用于医学图像重建的深度神经网络传统上使用高质量的地基图像作为训练目标训练。最近关于噪声的工作（N2N）已经示出了使用与具有地面真理的多个噪声测量的潜力。然而，现有的基于N2N的方法不适合于从经历非身份变形的物体的测量来学习。本文通过补偿对象变形来提出用于训练深层重建网络的变形补偿学习（DecoLearn）方法来解决此问题。DecoLearn的一个关键组件是一个深度登记模块，它与深度重建网络共同培训，没有任何地理监督。我们在模拟和实验收集的磁共振成像（MRI）数据上验证了甲板，并表明它显着提高了成像质量。

translated by 谷歌翻译

Programmable Spectral Filter Arrays using Phase Spatial Light Modulator

Vishwanath Saragadam , Vijay Rengarajan , Ryuichi Tadano , Tuo Zhuang , Hideki Oyaizu , Jun Murayama , Aswin C. Sankaranarayanan

分类：计算机视觉

2021-09-29

Spatially varying spectral modulation can be implemented using a liquid crystal spatial light modulator (SLM) since it provides an array of liquid crystal cells, each of which can be purposed to act as a programmable spectral filter array. However, such an optical setup suffers from strong optical aberrations due to the unintended phase modulation, precluding spectral modulation at high spatial resolutions. In this work, we propose a novel computational approach for the practical implementation of phase SLMs for implementing spatially varying spectral filters. We provide a careful and systematic analysis of the aberrations arising out of phase SLMs for the purposes of spatially varying spectral modulation. The analysis naturally leads us to a set of "good patterns" that minimize the optical aberrations. We then train a deep network that overcomes any residual aberrations, thereby achieving ideal spectral modulation at high spatial resolution. We show a number of unique operating points with our prototype including dynamic spectral filtering, material classification, and single- and multi-image hyperspectral imaging.

translated by 谷歌翻译

Neural Fields in Visual Computing and Beyond

Yiheng Xie , Towaki Takikawa , Shunsuke Saito , Or Litany , Shiqin Yan , Numair Khan , Federico Tombari , James Tompkin , Vincent Sitzmann , Srinath Sridhar

分类：计算机视觉 | 机器学习

2021-11-22

机器学习的最近进步已经创造了利用一类基于坐标的神经网络来解决视觉计算问题的兴趣，该基于坐标的神经网络在空间和时间跨空间和时间的场景或对象的物理属性。我们称之为神经领域的这些方法已经看到在3D形状和图像的合成中成功应用，人体的动画，3D重建和姿势估计。然而，由于在短时间内的快速进展，许多论文存在，但尚未出现全面的审查和制定问题。在本报告中，我们通过提供上下文，数学接地和对神经领域的文学进行广泛综述来解决这一限制。本报告涉及两种维度的研究。在第一部分中，我们通过识别神经字段方法的公共组件，包括不同的表示，架构，前向映射和泛化方法来专注于神经字段的技术。在第二部分中，我们专注于神经领域的应用在视觉计算中的不同问题，超越（例如，机器人，音频）。我们的评论显示了历史上和当前化身的视觉计算中已覆盖的主题的广度，展示了神经字段方法所带来的提高的质量，灵活性和能力。最后，我们展示了一个伴随着贡献本综述的生活版本，可以由社区不断更新。

translated by 谷歌翻译

Diffractive lensless imaging with optimized Voronoi-Fresnel phase

Qiang Fu , Dong-Ming Yan , Wolfgang Heidrich

分类：计算机视觉

2021-09-28

Lensless cameras are a class of imaging devices that shrink the physical dimensions to the very close vicinity of the image sensor by replacing conventional compound lenses with integrated flat optics and computational algorithms. Here we report a diffractive lensless camera with spatially-coded Voronoi-Fresnel phase to achieve superior image quality. We propose a design principle of maximizing the acquired information in optics to facilitate the computational reconstruction. By introducing an easy-to-optimize Fourier domain metric, Modulation Transfer Function volume (MTFv), which is related to the Strehl ratio, we devise an optimization framework to guide the optimization of the diffractive optical element. The resulting Voronoi-Fresnel phase features an irregular array of quasi-Centroidal Voronoi cells containing a base first-order Fresnel phase function. We demonstrate and verify the imaging performance for photography applications with a prototype Voronoi-Fresnel lensless camera on a 1.6-megapixel image sensor in various illumination conditions. Results show that the proposed design outperforms existing lensless cameras, and could benefit the development of compact imaging systems that work in extreme physical conditions.

translated by 谷歌翻译

Neural Implicit k-Space for Binning-free Non-Cartesian Cardiac MR Imaging

Wenqi Huang , Hongwei Li , Gastao Cruz , Jiazhen Pan , Daniel Rueckert , Kerstin Hammernik

分类：计算机视觉 | 机器学习

2022-12-16

In this work, we propose a novel image reconstruction framework that directly learns a neural implicit representation in k-space for ECG-triggered non-Cartesian Cardiac Magnetic Resonance Imaging (CMR). While existing methods bin acquired data from neighboring time points to reconstruct one phase of the cardiac motion, our framework allows for a continuous, binning-free, and subject-specific k-space representation.We assign a unique coordinate that consists of time, coil index, and frequency domain location to each sampled k-space point. We then learn the subject-specific mapping from these unique coordinates to k-space intensities using a multi-layer perceptron with frequency domain regularization. During inference, we obtain a complete k-space for Cartesian coordinates and an arbitrary temporal resolution. A simple inverse Fourier transform recovers the image, eliminating the need for density compensation and costly non-uniform Fourier transforms for non-Cartesian data. This novel imaging framework was tested on 42 radially sampled datasets from 6 subjects. The proposed method outperforms other techniques qualitatively and quantitatively using data from four and one heartbeat(s) and 30 cardiac phases. Our results for one heartbeat reconstruction of 50 cardiac phases show improved artifact removal and spatio-temporal resolution, leveraging the potential for real-time CMR.

translated by 谷歌翻译

Differentiable Microscopy Designs an All Optical Quantitative Phase Microscope

Kithmini Herath , Udith Haputhanthri , Ramith Hettiarachchi , Hasindu Kariyawasam , Raja N. Ahmad , Azeem Ahmad , Balpreet S. Ahluwalia , Chamira U. S. Edussooriya , Dushan Wadduwage

分类：计算机视觉

2022-03-28

Ever since the first microscope by Zacharias Janssen in the late 16th century, scientists have been inventing new types of microscopes for various tasks. Inventing a novel architecture demands years, if not decades, worth of scientific experience and creativity. In this work, we introduce Differentiable Microscopy ($\partial\mu$), a deep learning-based design paradigm, to aid scientists design new interpretable microscope architectures. Differentiable microscopy first models a common physics-based optical system however with trainable optical elements at key locations on the optical path. Using pre-acquired data, we then train the model end-to-end for a task of interest. The learnt design proposal can then be simplified by interpreting the learnt optical elements. As a first demonstration, based on the optical 4-$f$ system, we present an all-optical quantitative phase microscope (QPM) design that requires no computational post-reconstruction. A follow-up literature survey suggested that the learnt architecture is similar to the generalized phase contrast method developed two decades ago. Our extensive experiments on multiple datasets that include biological samples show that our learnt all-optical QPM designs consistently outperform existing methods. We experimentally verify the functionality of the optical 4-$f$ system based QPM design using a spatial light modulator. Furthermore, we also demonstrate that similar results can be achieved by an uninterpretable learning based method, namely diffractive deep neural networks (D2NN). The proposed differentiable microscopy framework supplements the creative process of designing new optical systems and would perhaps lead to unconventional but better optical designs.

translated by 谷歌翻译

PS$^2$F: Polarized Spiral Point Spread Function for Single-Shot 3D Sensing

Bhargav Ghanekar , Vishwanath Saragadam , Dushyant Mehra , Anna-Karin Gustavsson , Aswin Sankaranarayanan , Ashok Veeraraghavan

分类：计算机视觉

2022-07-03

我们提出了一种依赖工程点扩散功能（PSF）的紧凑型快照单眼估计技术。微观超分辨率成像中使用的传统方法，例如双螺旋PSF（DHPSF），不适合比稀疏的一组点光源更复杂的场景。我们使用cram \'er-rao下限（CRLB）显示，将DHPSF的两个叶分开，从而捕获两个单独的图像导致深度精度的急剧增加。用于生成DHPSF的相掩码的独特属性是，将相掩码分为两个半部分，导致两个裂片的空间分离。我们利用该属性建立一个基于紧凑的极化光学设置，在该设置中，我们将两个正交线性极化器放在DHPSF相位掩码的每一半上，然后使用极化敏感的摄像机捕获所得图像。模拟和实验室原型的结果表明，与包括DHPSF和Tetrapod PSF在内的最新设计相比，我们的技术达到了高达50美元的深度误差，而空间分辨率几乎没有损失。

translated by 谷歌翻译

Advances in Neural Rendering

Ayush Tewari , Justus Thies , Ben Mildenhall , Pratul Srinivasan , Edgar Tretschk , Yifan Wang , Christoph Lassner , Vincent Sitzmann , Ricardo Martin-Brualla , Stephen Lombardi

分类：计算机视觉

2021-11-10

综合照片 - 现实图像和视频是计算机图形的核心，并且是几十年的研究焦点。传统上，使用渲染算法（如光栅化或射线跟踪）生成场景的合成图像，其将几何形状和材料属性的表示为输入。统称，这些输入定义了实际场景和呈现的内容，并且被称为场景表示（其中场景由一个或多个对象组成）。示例场景表示是具有附带纹理的三角形网格（例如，由艺术家创建），点云（例如，来自深度传感器），体积网格（例如，来自CT扫描）或隐式曲面函数（例如，截短的符号距离）字段）。使用可分辨率渲染损耗的观察结果的这种场景表示的重建被称为逆图形或反向渲染。神经渲染密切相关，并将思想与经典计算机图形和机器学习中的思想相结合，以创建用于合成来自真实观察图像的图像的算法。神经渲染是朝向合成照片现实图像和视频内容的目标的跨越。近年来，我们通过数百个出版物显示了这一领域的巨大进展，这些出版物显示了将被动组件注入渲染管道的不同方式。这种最先进的神经渲染进步的报告侧重于将经典渲染原则与学习的3D场景表示结合的方法，通常现在被称为神经场景表示。这些方法的一个关键优势在于它们是通过设计的3D-一致，使诸如新颖的视点合成捕获场景的应用。除了处理静态场景的方法外，我们还涵盖了用于建模非刚性变形对象的神经场景表示...

translated by 谷歌翻译

Spatiotemporal implicit neural representation for unsupervised dynamic MRI reconstruction

Jie Feng , Ruimin Feng , Qing Wu , Zhiyong Zhang , Yuyao Zhang , Hongjiang Wei

分类：计算机视觉

2022-12-31

Supervised Deep-Learning (DL)-based reconstruction algorithms have shown state-of-the-art results for highly-undersampled dynamic Magnetic Resonance Imaging (MRI) reconstruction. However, the requirement of excessive high-quality ground-truth data hinders their applications due to the generalization problem. Recently, Implicit Neural Representation (INR) has appeared as a powerful DL-based tool for solving the inverse problem by characterizing the attributes of a signal as a continuous function of corresponding coordinates in an unsupervised manner. In this work, we proposed an INR-based method to improve dynamic MRI reconstruction from highly undersampled k-space data, which only takes spatiotemporal coordinates as inputs. Specifically, the proposed INR represents the dynamic MRI images as an implicit function and encodes them into neural networks. The weights of the network are learned from sparsely-acquired (k, t)-space data itself only, without external training datasets or prior images. Benefiting from the strong implicit continuity regularization of INR together with explicit regularization for low-rankness and sparsity, our proposed method outperforms the compared scan-specific methods at various acceleration factors. E.g., experiments on retrospective cardiac cine datasets show an improvement of 5.5 ~ 7.1 dB in PSNR for extremely high accelerations (up to 41.6-fold). The high-quality and inner continuity of the images provided by INR has great potential to further improve the spatiotemporal resolution of dynamic MRI, without the need of any training data.

translated by 谷歌翻译

Projection-Domain Self-Supervision for Volumetric Helical CT Reconstruction

Onni Kosomaa , Samuli Laine , Tero Karras , Miika Aittala , Jaakko Lehtinen

分类：计算机视觉 | 机器学习 | 神经与进化计算

2022-12-14

We propose a deep learning method for three-dimensional reconstruction in low-dose helical cone-beam computed tomography. We reconstruct the volume directly, i.e., not from 2D slices, guaranteeing consistency along all axes. In a crucial step beyond prior work, we train our model in a self-supervised manner in the projection domain using noisy 2D projection data, without relying on 3D reference data or the output of a reference reconstruction method. This means the fidelity of our results is not limited by the quality and availability of such data. We evaluate our method on real helical cone-beam projections and simulated phantoms. Our reconstructions are sharper and less noisy than those of previous methods, and several decibels better in quantitative PSNR measurements. When applied to full-dose data, our method produces high-quality results orders of magnitude faster than iterative techniques.

translated by 谷歌翻译

Physics-informed neural networks for diffraction tomography

Amirhossein Saba , Carlo Gigli , Ahmed B. Ayoub , Demetri Psaltis

分类：人工智能

2022-07-28

我们提出了一个具有物理信息的神经网络，作为生物样品层析成像重建的正向模型。我们证明，通过用Helmholtz方程训练该网络作为物理损失，我们可以准确预测散射场。可以证明，可以对不同的样本进行微调的验证网络，并用于与其他数值解决方案更快地解决散射问题。我们通过数值和实验结果评估我们的方法。我们的物理知识神经网络可以推广到任何前进和反向散射问题。

translated by 谷歌翻译

Self-Supervised Coordinate Projection Network for Sparse-View Computed Tomography

Qing Wu , Ruimin Feng , Hongjiang Wei , Jingyi Yu , Yuyao Zhang

分类：计算机视觉 | 机器学习

2022-09-12

在目前的工作中，我们提出了一个自制的坐标投影网络（范围），以通过解决逆断层扫描成像问题来从单个SV正弦图中重建无伪像的CT图像。与使用隐式神经代表网络（INR）解决类似问题的最新相关工作相比，我们的基本贡献是一种有效而简单的重新注射策略，可以将层析成像图像重建质量推向监督的深度学习CT重建工作。提出的策略是受线性代数与反问题之间的简单关系的启发。为了求解未确定的线性方程式系统，我们首先引入INR以通过图像连续性之前限制解决方案空间并实现粗糙解决方案。其次，我们建议生成一个密集的视图正式图，以改善线性方程系统的等级并产生更稳定的CT图像解决方案空间。我们的实验结果表明，重新投影策略显着提高了图像重建质量（至少为PSNR的+3 dB）。此外，我们将最近的哈希编码集成到我们的范围模型中，这极大地加速了模型培训。最后，我们评估并联和风扇X射线梁SVCT重建任务的范围。实验结果表明，所提出的范围模型优于两种基于INR的方法和两种受欢迎的监督DL方法。

translated by 谷歌翻译

NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis

Ben Mildenhall , Pratul P. Srinivasan , Matthew Tancik , Jonathan T. Barron , Ravi Ramamoorthi , Ren Ng

分类：

2020-03-19

We present a method that achieves state-of-the-art results for synthesizing novel views of complex scenes by optimizing an underlying continuous volumetric scene function using a sparse set of input views. Our algorithm represents a scene using a fully-connected (nonconvolutional) deep network, whose input is a single continuous 5D coordinate (spatial location (x, y, z) and viewing direction (θ, φ)) and whose output is the volume density and view-dependent emitted radiance at that spatial location. We synthesize views by querying 5D coordinates along camera rays and use classic volume rendering techniques to project the output colors and densities into an image. Because volume rendering is naturally differentiable, the only input required to optimize our representation is a set of images with known camera poses. We describe how to effectively optimize neural radiance fields to render photorealistic novel views of scenes with complicated geometry and appearance, and demonstrate results that outperform prior work on neural rendering and view synthesis. View synthesis results are best viewed as videos, so we urge readers to view our supplementary video for convincing comparisons.

translated by 谷歌翻译

Unsupervised Deep Learning Methods for Biological Image Reconstruction and Enhancement

Mehmet Akçakaya , Burhaneddin Yaman , Hyungjin Chung , Jong Chul Ye

分类：计算机视觉 | 机器学习

2021-05-17

最近，由于高性能，深度学习方法已成为生物学图像重建和增强问题的主要研究前沿，以及其超快速推理时间。但是，由于获得监督学习的匹配参考数据的难度，对不需要配对的参考数据的无监督学习方法越来越兴趣。特别是，已成功用于各种生物成像应用的自我监督的学习和生成模型。在本文中，我们概述了在古典逆问题的背景下的连贯性观点，并讨论其对生物成像的应用，包括电子，荧光和去卷积显微镜，光学衍射断层扫描和功能性神经影像。

translated by 谷歌翻译

SiSPRNet: End-to-End Learning for Single-Shot Phase Retrieval

Qiuliang Ye , Li-Wen Wang , Daniel P. K. Lun

分类：计算机视觉 | 机器学习

2022-05-23

在许多图像处理任务中，深度学习方法的成功，最近还将深度学习方法引入了阶段检索问题。这些方法与传统的迭代优化方法不同，因为它们通常只需要一个强度测量，并且可以实时重建相位图像。但是，由于巨大的领域差异，这些方法给出的重建图像的质量仍然有很大的改进空间来满足一般应用要求。在本文中，我们设计了一种新型的深神经网络结构，名为Sisprnet，以基于单个傅立叶强度测量值进行相检索。为了有效利用测量的光谱信息，我们建议使用多层感知器（MLP）作为前端提出一个新的特征提取单元。它允许将输入强度图像的所有像素一起考虑，以探索其全局表示。 MLP的大小经过精心设计，以促进代表性特征的提取，同时减少噪音和异常值。辍学层还可以减轻训练MLP的过度拟合问题。为了促进重建图像中的全局相关性，将自我注意力的机制引入了提议的Sisprnet的上采样和重建（UR）块。这些UR块被插入残留的学习结构中，以防止由于其复杂的层结构而导致的较弱的信息流和消失的梯度问题。使用线性相关幅度和相位的仅相位图像和图像的不同测试数据集对所提出的模型进行了广泛的评估。在光学实验平台上进行了实验，以了解在实用环境中工作时不同深度学习方法的性能。

translated by 谷歌翻译

Video Reconstruction from a Single Motion Blurred Image using Learned Dynamic Phase Coding

Erez Yosef , Shay Elmalem , Raja Giryes

分类：计算机视觉

2021-12-28

来自单个运动模糊图像的视频重建是一个具有挑战性的问题，可以增强现有的相机的能力。最近，几种作品使用传统的成像和深度学习解决了这项任务。然而，由于方向模糊和噪声灵敏度，这种纯粹 - 数字方法本质上是有限的。一些作品提出使用非传统图像传感器解决这些限制，然而，这种传感器非常罕见和昂贵。为了使这些限制具有更简单的方法，我们提出了一种用于视频重建的混合光学 - 数字方法，其仅需要对现有光学系统的简单修改。在图像采集期间，在镜头孔径中使用学习的动态相位编码以对运动轨迹进行编码，该运动轨迹用作视频重建过程的先前信息。使用图像到视频卷积神经网络，所提出的计算相机以各种编码运动模糊图像的各种帧速率产生锐帧帧突发。与现有方法相比，我们使用模拟和现实世界的相机原型表现了优势和改进的性能。

translated by 谷歌翻译

Imaging dynamics beneath turbid media via parallelized single-photon detection

Shiqi Xu , Xi Yang , Wenhui Liu , Joakim Jonsson , Ruobing Qian , Pavan Chandra Konda , Kevin C. Zhou , Lucas Kreiss , Qionghai Dai , Haoqian Wang

分类：计算机视觉

2021-07-03

通过动态散射介质进行非侵入性光学成像具有许多重要的生物医学应用，但仍然是一项艰巨的任务。尽管标准弥漫成像方法测量光吸收或荧光发射，但也良好的是，散射的相干光的时间相关性通过组织像光强度一样扩散。然而，迄今为止，很少有作品旨在通过实验测量和处理这种时间相关数据，以证明去相关动力学的深度组织视频重建。在这项工作中，我们利用单光子雪崩二极管（SPAD）阵列摄像机同时监视单photon水平的斑点波动的时间动力学，从12种不同的幻影组织通过定制的纤维束阵列传递的位置。然后，我们应用深度神经网络将所获得的单光子测量值转换为迅速去摩擦组织幻像下散射动力学的视频。我们证明了重建瞬态（0.1-0.4s）动态事件的图像的能力，该动态事件发生在非相关的组织幻影下，并以毫米级分辨率进行重构，并突出显示我们的模型如何灵活地扩展到埋藏的phantom船只内的流速。

translated by 谷歌翻译