智能论文笔记

Necessary and sufficient graphical conditions for optimal adjustment sets in causal graphical models with hidden variables

Jakob Runge

分类：机器学习

2021-02-20

解决了选择最佳后门调整集的问题，以解决隐藏和条件变量的图形模型中的因果效应。以前的工作已经定义了实现最小的渐近估计方差，并且在没有隐藏变量的情况下派生的最佳集。对于隐藏变量的情况，可以有设置在没有最佳集合的情况下，并且目前仅导出有限适用性的足够的图形最优标准。在本工作中，最优性的特征在于最大化某个调整信息，该信息允许导出用于存在最佳调整集的必要和足够的图形标准和构造它的定义和算法。此外，如果仅存在有效调整集并且具有比Perkovi {\'C}等所提出的调整集更高（或等于）调整信息，则最佳集是有效的。 [机器学习研究学报，18：1--62,2018]任何图表。结果转化为一类估计的渐近估计差异，其渐近方差遵循某种信息理论关系。数值实验表明，渐近结果也适用于相对较小的样本尺寸，并且最佳调整集或其最小化变体通常也会产生更好的方差，也超出该估计类。令人惊讶的是，在随机创建的设置中，超过90 \％满足最优性条件，指示在许多现实世界场景中也可以保持。代码可用作Python Package \ URL {https://github.com/jakobrunge/tigramite}的一部分。

translated by 谷歌翻译

Discovering contemporaneous and lagged causal relations in autocorrelated nonlinear time series datasets

Jakob Runge

分类：机器学习 | (统计)机器学习

2020-03-07

本文介绍了一种基于新的条件独立性（CI）的线性和非线性，滞后和同期因因果发现的方法，从而在因果上足够的情况下。基于CI的基于CI的方法，如PC算法以及来自其他框架的常见方法遭受低召回和部分充气的误报，用于强大的自相关，这是时间序列中无处不在的挑战。小说方法PCMCI $ ^ + $，扩展PCMCI [Runge等，2019B]，包括发现同期链接。 PCMCI $ ^ + $通过优化调节套件的选择甚至从自相关的益处来提高CI测试的可靠性。该方法在Oracle案例中是单独无关的且一致。广泛的数值实验表明，与其他方法相比，PCMCI $ ^ + $具有更高的邻接检测功率，尤其是同时定向召回，同时更好地控制误报。优化的调节集还会导致比PC算法更短的运行时间。 PCMCI $ ^ + $可以在许多真实世界应用方案中具有相当大的用途，其中通常时间分辨率太粗糙以解决时间延迟，并且存在强大的自相关。

translated by 谷歌翻译

ChatGPT Makes Medicine Easy to Swallow: An Exploratory Case Study on Simplified Radiology Reports

Katharina Jeblick , Balthasar Schachtner , Jakob Dexl , Andreas Mittermeier , Anna Theresa Stüber , Johanna Topalis , Tobias Weber , Philipp Wesp , Bastian Sabel , Jens Ricke

分类：自然语言处理 | 机器学习

2022-12-30

The release of ChatGPT, a language model capable of generating text that appears human-like and authentic, has gained significant attention beyond the research community. We expect that the convincing performance of ChatGPT incentivizes users to apply it to a variety of downstream tasks, including prompting the model to simplify their own medical reports. To investigate this phenomenon, we conducted an exploratory case study. In a questionnaire, we asked 15 radiologists to assess the quality of radiology reports simplified by ChatGPT. Most radiologists agreed that the simplified reports were factually correct, complete, and not potentially harmful to the patient. Nevertheless, instances of incorrect statements, missed key medical findings, and potentially harmful passages were reported. While further studies are needed, the initial insights of this study indicate a great potential in using large language models like ChatGPT to improve patient-centered care in radiology and other medical domains.

translated by 谷歌翻译

High-fidelity Direct Contrast Synthesis from Magnetic Resonance Fingerprinting

Ke Wang , Mariya Doneva , Jakob Meineke , Thomas Amthor , Ekin Karasan , Fei Tan , Jonathan I. Tamir , Stella X. Yu , Michael Lustig

分类：计算机视觉

2022-12-21

Magnetic Resonance Fingerprinting (MRF) is an efficient quantitative MRI technique that can extract important tissue and system parameters such as T1, T2, B0, and B1 from a single scan. This property also makes it attractive for retrospectively synthesizing contrast-weighted images. In general, contrast-weighted images like T1-weighted, T2-weighted, etc., can be synthesized directly from parameter maps through spin-dynamics simulation (i.e., Bloch or Extended Phase Graph models). However, these approaches often exhibit artifacts due to imperfections in the mapping, the sequence modeling, and the data acquisition. Here we propose a supervised learning-based method that directly synthesizes contrast-weighted images from the MRF data without going through the quantitative mapping and spin-dynamics simulation. To implement our direct contrast synthesis (DCS) method, we deploy a conditional Generative Adversarial Network (GAN) framework and propose a multi-branch U-Net as the generator. The input MRF data are used to directly synthesize T1-weighted, T2-weighted, and fluid-attenuated inversion recovery (FLAIR) images through supervised training on paired MRF and target spin echo-based contrast-weighted scans. In-vivo experiments demonstrate excellent image quality compared to simulation-based contrast synthesis and previous DCS methods, both visually as well as by quantitative metrics. We also demonstrate cases where our trained model is able to mitigate in-flow and spiral off-resonance artifacts that are typically seen in MRF reconstructions and thus more faithfully represent conventional spin echo-based contrast-weighted images.

translated by 谷歌翻译

Medical Diagnosis with Large Scale Multimodal Transformers -- Leveraging Diverse Data for More Accurate Diagnosis

Firas Khader , Gustav Mueller-Franzes , Tianci Wang , Tianyu Han , Soroosh Tayebi Arasteh , Christoph Haarburger , Johannes Stegmaier , Keno Bressem , Christiane Kuhl , Sven Nebelung

分类：机器学习 | 人工智能

2022-12-18

Multimodal deep learning has been used to predict clinical endpoints and diagnoses from clinical routine data. However, these models suffer from scaling issues: they have to learn pairwise interactions between each piece of information in each data type, thereby escalating model complexity beyond manageable scales. This has so far precluded a widespread use of multimodal deep learning. Here, we present a new technical approach of "learnable synergies", in which the model only selects relevant interactions between data modalities and keeps an "internal memory" of relevant data. Our approach is easily scalable and naturally adapts to multimodal data inputs from clinical routine. We demonstrate this approach on three large multimodal datasets from radiology and ophthalmology and show that it outperforms state-of-the-art models in clinically relevant diagnosis tasks. Our new approach is transferable and will allow the application of multimodal deep learning to a broad set of clinically relevant problems.

translated by 谷歌翻译

DUIDD: Deep-Unfolded Interleaved Detection and Decoding for MIMO Wireless Systems

Reinhard Wiesmayr , Chris Dick , Jakob Hoydis , Christoph Studer

分类：机器学习

2022-12-15

Iterative detection and decoding (IDD) is known to achieve near-capacity performance in multi-antenna wireless systems. We propose deep-unfolded interleaved detection and decoding (DUIDD), a new paradigm that reduces the complexity of IDD while achieving even lower error rates. DUIDD interleaves the inner stages of the data detector and channel decoder, which expedites convergence and reduces complexity. Furthermore, DUIDD applies deep unfolding to automatically optimize algorithmic hyperparameters, soft-information exchange, message damping, and state forwarding. We demonstrate the efficacy of DUIDD using NVIDIA's Sionna link-level simulator in a 5G-near multi-user MIMO-OFDM wireless system with a novel low-complexity soft-input soft-output data detector, an optimized low-density parity-check decoder, and channel vectors from a commercial ray-tracer. Our results show that DUIDD outperforms classical IDD both in terms of block error rate and computational complexity.

translated by 谷歌翻译

Diffusion Probabilistic Models beat GANs on Medical Images

Gustav Müller-Franzes , Jan Moritz Niehues , Firas Khader , Soroosh Tayebi Arasteh , Christoph Haarburger , Christiane Kuhl , Tianci Wang , Tianyu Han , Sven Nebelung , Jakob Nikolas Kather

分类：计算机视觉

2022-12-14

The success of Deep Learning applications critically depends on the quality and scale of the underlying training data. Generative adversarial networks (GANs) can generate arbitrary large datasets, but diversity and fidelity are limited, which has recently been addressed by denoising diffusion probabilistic models (DDPMs) whose superiority has been demonstrated on natural images. In this study, we propose Medfusion, a conditional latent DDPM for medical images. We compare our DDPM-based model against GAN-based models, which constitute the current state-of-the-art in the medical domain. Medfusion was trained and compared with (i) StyleGan-3 on n=101,442 images from the AIROGS challenge dataset to generate fundoscopies with and without glaucoma, (ii) ProGAN on n=191,027 from the CheXpert dataset to generate radiographs with and without cardiomegaly and (iii) wGAN on n=19,557 images from the CRCMS dataset to generate histopathological images with and without microsatellite stability. In the AIROGS, CRMCS, and CheXpert datasets, Medfusion achieved lower (=better) FID than the GANs (11.63 versus 20.43, 30.03 versus 49.26, and 17.28 versus 84.31). Also, fidelity (precision) and diversity (recall) were higher (=better) for Medfusion in all three datasets. Our study shows that DDPM are a superior alternative to GANs for image synthesis in the medical domain.

translated by 谷歌翻译

SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning

Benjamin Ellis , Skander Moalla , Mikayel Samvelyan , Mingfei Sun , Anuj Mahajan , Jakob N. Foerster , Shimon Whiteson

分类：机器学习

2022-12-14

The availability of challenging benchmarks has played a key role in the recent progress of machine learning. In cooperative multi-agent reinforcement learning, the StarCraft Multi-Agent Challenge (SMAC) has become a popular testbed for centralised training with decentralised execution. However, after years of sustained improvement on SMAC, algorithms now achieve near-perfect performance. In this work, we conduct new analysis demonstrating that SMAC is not sufficiently stochastic to require complex closed-loop policies. In particular, we show that an open-loop policy conditioned only on the timestep can achieve non-trivial win rates for many SMAC scenarios. To address this limitation, we introduce SMACv2, a new version of the benchmark where scenarios are procedurally generated and require agents to generalise to previously unseen settings (from the same distribution) during evaluation. We show that these changes ensure the benchmark requires the use of closed-loop policies. We evaluate state-of-the-art algorithms on SMACv2 and show that it presents significant challenges not present in the original benchmark. Our analysis illustrates that SMACv2 addresses the discovered deficiencies of SMAC and can help benchmark the next generation of MARL methods. Videos of training are available at https://sites.google.com/view/smacv2

translated by 谷歌翻译

Image Compression with Product Quantized Masked Image Modeling

Alaaeldin El-Nouby , Matthew J. Muckley , Karen Ullrich , Ivan Laptev , Jakob Verbeek , Hervé Jégou

分类：计算机视觉

2022-12-14

Recent neural compression methods have been based on the popular hyperprior framework. It relies on Scalar Quantization and offers a very strong compression performance. This contrasts from recent advances in image generation and representation learning, where Vector Quantization is more commonly employed. In this work, we attempt to bring these lines of research closer by revisiting vector quantization for image compression. We build upon the VQ-VAE framework and introduce several modifications. First, we replace the vanilla vector quantizer by a product quantizer. This intermediate solution between vector and scalar quantization allows for a much wider set of rate-distortion points: It implicitly defines high-quality quantizers that would otherwise require intractably large codebooks. Second, inspired by the success of Masked Image Modeling (MIM) in the context of self-supervised learning and generative image models, we propose a novel conditional entropy model which improves entropy coding by modelling the co-dependencies of the quantized latent codes. The resulting PQ-MIM model is surprisingly effective: its compression performance on par with recent hyperprior methods. It also outperforms HiFiC in terms of FID and KID metrics when optimized with perceptual losses (e.g. adversarial). Finally, since PQ-MIM is compatible with image generation frameworks, we show qualitatively that it can operate under a hybrid mode between compression and generation, with no further training or finetuning. As a result, we explore the extreme compression regime where an image is compressed into 200 bytes, i.e., less than a tweet.

translated by 谷歌翻译

Co-training $2^L$ Submodels for Visual Recognition

Hugo Touvron , Matthieu Cord , Maxime Oquab , Piotr Bojanowski , Jakob Verbeek , Hervé Jégou

分类：计算机视觉

2022-12-09

We introduce submodel co-training, a regularization method related to co-training, self-distillation and stochastic depth. Given a neural network to be trained, for each sample we implicitly instantiate two altered networks, ``submodels'', with stochastic depth: we activate only a subset of the layers. Each network serves as a soft teacher to the other, by providing a loss that complements the regular loss provided by the one-hot label. Our approach, dubbed cosub, uses a single set of weights, and does not involve a pre-trained external model or temporal averaging. Experimentally, we show that submodel co-training is effective to train backbones for recognition tasks such as image classification and semantic segmentation. Our approach is compatible with multiple architectures, including RegNet, ViT, PiT, XCiT, Swin and ConvNext. Our training strategy improves their results in comparable settings. For instance, a ViT-B pretrained with cosub on ImageNet-21k obtains 87.4% top-1 acc. @448 on ImageNet-val.

translated by 谷歌翻译