智能论文笔记

Spatio-Temporal Variational Gaussian Processes

Oliver Hamelijnck , William J. Wilkinson , Niki A. Loppi , Arno Solin , Theodoros Damoulas

分类：机器学习 | (统计)机器学习

2021-11-02

我们介绍了一种可扩展的方法来实现高斯工艺推断，它将时空滤波与自然梯度变化推断相结合，导致用于多变量数据的非共轭GP方法，其相对于时间线性缩放。我们的自然梯度方法可以应用并行滤波和平滑，进一步降低时间跨度复杂性在时间步长的对数。我们得出了稀疏近似，该稀疏近似值在减少的空间诱导点上构造一个状态空间模型，并且显示用于可分离的马尔可夫内核，完整和稀疏的情况完全恢复标准变分GP，同时表现出有利的计算特性。为了进一步改善空间缩放，我们提出了一种平均场景假设空间位置之间的独立性，当与稀疏性和平行化连接时，这导致了大规模的时空问题的有效和准确的方法。

translated by 谷歌翻译

Bayes-Newton Methods for Approximate Bayesian Inference with PSD Guarantees

William J. Wilkinson , Simo Särkkä , Arno Solin

分类： (统计)机器学习 | 机器学习

2021-11-02

我们制定自然梯度变推理（VI），期望传播（EP），和后线性化（PL）作为牛顿法用于优化贝叶斯后验分布的参数扩展。这种观点明确地把数值优化框架下的推理算法。我们表明，通用近似牛顿法从优化文献，即高斯 - 牛顿和准牛顿方法（例如，该BFGS算法），仍然是这种“贝叶斯牛顿”框架下有效。这导致了一套这些都保证以产生半正定协方差矩阵，不像标准VI和EP新颖算法。我们统一的观点提供了新的见解各种推理方案之间的连接。所有提出的方法适用于具有高斯事先和非共轭的可能性，这是我们与（疏）高斯过程和状态空间模型展示任何模型。

translated by 谷歌翻译

Imagen Editor and EditBench: Advancing and Evaluating Text-Guided Image Inpainting

Su Wang , Chitwan Saharia , Ceslee Montgomery , Jordi Pont-Tuset , Shai Noy , Stefano Pellegrini , Yasumasa Onoe , Sarah Laszlo , David J. Fleet , Radu Soricut

分类：计算机视觉 | 人工智能

2022-12-13

Text-guided image editing can have a transformative impact in supporting creative applications. A key challenge is to generate edits that are faithful to input text prompts, while consistent with input images. We present Imagen Editor, a cascaded diffusion model built, by fine-tuning Imagen on text-guided image inpainting. Imagen Editor's edits are faithful to the text prompts, which is accomplished by using object detectors to propose inpainting masks during training. In addition, Imagen Editor captures fine details in the input image by conditioning the cascaded pipeline on the original high resolution image. To improve qualitative and quantitative evaluation, we introduce EditBench, a systematic benchmark for text-guided image inpainting. EditBench evaluates inpainting edits on natural and generated images exploring objects, attributes, and scenes. Through extensive human evaluation on EditBench, we find that object-masking during training leads to across-the-board improvements in text-image alignment -- such that Imagen Editor is preferred over DALL-E 2 and Stable Diffusion -- and, as a cohort, these models are better at object-rendering than text-rendering, and handle material/color/size attributes better than count/shape attributes.

translated by 谷歌翻译

Exploring Randomly Wired Neural Networks for Climate Model Emulation

William Yik , Sam J. Silva , Andrew Geiss , Duncan Watson-Parris

分类：机器学习

2022-12-06

Exploring the climate impacts of various anthropogenic emissions scenarios is key to making informed decisions for climate change mitigation and adaptation. State-of-the-art Earth system models can provide detailed insight into these impacts, but have a large associated computational cost on a per-scenario basis. This large computational burden has driven recent interest in developing cheap machine learning models for the task of climate model emulation. In this manuscript, we explore the efficacy of randomly wired neural networks for this task. We describe how they can be constructed and compare them to their standard feedforward counterparts using the ClimateBench dataset. Specifically, we replace the serially connected dense layers in multilayer perceptrons, convolutional neural networks, and convolutional long short-term memory networks with randomly wired dense layers and assess the impact on model performance for models with 1 million and 10 million parameters. We find average performance improvements of 4.2% across model complexities and prediction tasks, with substantial performance improvements of up to 16.4% in some cases. Furthermore, we find no significant difference in prediction speed between networks with standard feedforward dense layers and those with randomly wired layers. These findings indicate that randomly wired neural networks may be suitable direct replacements for traditional dense layers in many standard models.

translated by 谷歌翻译

SODA: A Natural Language Processing Package to Extract Social Determinants of Health for Cancer Studies

Zehao Yu , Xi Yang , Chong Dang , Prakash Adekkanattu , Braja Gopal Patra , Yifan Peng , Jyotishman Pathak , Debbie L. Wilson , Ching-Yuan Chang , Wei-Hsuan Lo-Ciganic

分类：自然语言处理 | 人工智能 | 机器学习

2022-12-06

Objective: We aim to develop an open-source natural language processing (NLP) package, SODA (i.e., SOcial DeterminAnts), with pre-trained transformer models to extract social determinants of health (SDoH) for cancer patients, examine the generalizability of SODA to a new disease domain (i.e., opioid use), and evaluate the extraction rate of SDoH using cancer populations. Methods: We identified SDoH categories and attributes and developed an SDoH corpus using clinical notes from a general cancer cohort. We compared four transformer-based NLP models to extract SDoH, examined the generalizability of NLP models to a cohort of patients prescribed with opioids, and explored customization strategies to improve performance. We applied the best NLP model to extract 19 categories of SDoH from the breast (n=7,971), lung (n=11,804), and colorectal cancer (n=6,240) cohorts. Results and Conclusion: We developed a corpus of 629 cancer patients notes with annotations of 13,193 SDoH concepts/attributes from 19 categories of SDoH. The Bidirectional Encoder Representations from Transformers (BERT) model achieved the best strict/lenient F1 scores of 0.9216 and 0.9441 for SDoH concept extraction, 0.9617 and 0.9626 for linking attributes to SDoH concepts. Fine-tuning the NLP models using new annotations from opioid use patients improved the strict/lenient F1 scores from 0.8172/0.8502 to 0.8312/0.8679. The extraction rates among 19 categories of SDoH varied greatly, where 10 SDoH could be extracted from >70% of cancer patients, but 9 SDoH had a low extraction rate (<70% of cancer patients). The SODA package with pre-trained transformer models is publicly available at https://github.com/uf-hobiinformatics-lab/SDoH_SODA.

translated by 谷歌翻译

Temporally Extended Successor Representations

Matthew J. Sargent , Peter J. Bentley , Caswell Barry , William de Cothi

分类：机器学习 | 人工智能

2022-09-25

我们提出了连续表示的时间扩展变化，我们称其为t-SR。 T-SR通过在原始动作重复序列上构造后继表示，捕获了时间扩展动作的预期状态过渡动力学。这种时间抽象的这种形式不能学习相关任务结构的自上而下的层次结构，而是对耦合动作和动作重复的自下而上的组成。这减少了在没有学习层次政策的情况下控制中所需的决策数量。因此，T-SR直接考虑了时间扩展的动作序列的时间范围，而无需预定义或域特异性选项。我们表明，在具有动态奖励结构的环境中，T-SR能够利用后继表示的灵活性和时间扩展的动作提供的抽象。因此，在一系列稀疏的网格世界环境中，T-SR最佳地适应策略远比基于可比的无模型的强化学习方法快得多。我们还表明，T-SR学到的解决这些任务的方式要求学习的策略的始终如一的频率比非临时扩展的策略少。

translated by 谷歌翻译

Deep learning at the edge enables real-time streaming ptychographic imaging

Anakha V Babu , Tao Zhou , Saugat Kandel , Tekin Bicer , Zhengchun Liu , William Judge , Daniel J. Ching , Yi Jiang , Sinisa Veseli , Steven Henke

分类：机器学习

2022-09-20

相干显微镜技术提供了跨科学和技术领域的材料的无与伦比的多尺度视图，从结构材料到量子设备，从综合电路到生物细胞。在构造更明亮的来源和高速探测器的驱动下，连贯的X射线显微镜方法（如Ptychography）有望彻底改变纳米级材料的特征。但是，相关的数据和计算需求显着增加意味着，常规方法不再足以从高速相干成像实验实时恢复样品图像。在这里，我们演示了一个工作流程，该工作流利用边缘的人工智能和高性能计算，以实现直接从检测器直接从检测器流出的X射线ptychography数据实时反演。拟议的AI支持的工作流程消除了传统的Ptychography施加的采样约束，从而使用比传统方法所需的数据较少的数据级允许低剂量成像。

translated by 谷歌翻译

Ontologizing Health Systems Data at Scale: Making Translational Discovery a Reality

Tiffany J. Callahan , Adrianne L. Stefanski , Jordan M. Wyrwa , Chenjie Zeng , Anna Ostropolets , Juan M. Banda , William A. Baumgartner Jr. , Richard D. Boyce , Elena Casiraghi , Ben D. Coleman

分类：人工智能

2022-09-10

通用数据模型解决了标准化电子健康记录（EHR）数据的许多挑战，但无法将其集成深度表型所需的资源。开放的生物学和生物医学本体论（OBO）铸造本体论提供了可用于生物学知识的语义计算表示，并能够整合多种生物医学数据。但是，将EHR数据映射到OBO Foundry本体论需要大量的手动策展和域专业知识。我们介绍了一个框架，用于将观察性医学成果合作伙伴关系（OMOP）标准词汇介绍给OBO铸造本体。使用此框架，我们制作了92,367条条件，8,615种药物成分和10,673个测量结果的映射。域专家验证了映射准确性，并且在24家医院进行检查时，映射覆盖了99％的条件和药物成分和68％的测量结果。最后，我们证明OMOP2OBO映射可以帮助系统地识别可能受益于基因检测的未诊断罕见病患者。

translated by 谷歌翻译

Developing moral AI to support antimicrobial decision making

William J Bolton , Cosmin Badea , Pantelis Georgiou , Alison Holmes , Timothy M Rawson

分类：人工智能 | 机器学习

2022-08-12

辅助抗菌处方的人工智能（AI）提出了重大的道德问题。利用与AI驱动的系统一起利用道德框架，同时考虑特定的复杂性，可以支持道德决策以应对抗菌抗性。

translated by 谷歌翻译

Knowledge-Driven Mechanistic Enrichment of the Preeclampsia Ignorome

Tiffany J. Callahan , Adrianne L. Stefanski , Jin-Dong Kim , William A. Baumgartner Jr. , Jordan M. Wyrwa , Lawrence E. Hunter

分类：人工智能

2022-07-28

子痫前期是孕产妇和胎儿发病率和死亡率的主要原因。目前，先兆子痫的唯一明确治疗方法是胎盘的递送，这对于疾病的发病机理至关重要。已经广泛地进行了鉴定出差异表达的基因（DEGS），已经进行了广泛的先兆子痫对人胎盘的转录分析。使用无偏见的测定法确定了DEG，但是，在实验上研究DEG的决策受到许多因素的偏见，导致许多DEGS仍未被评估。一组与疾病在实验上相关的DEG，但与文献中的疾病尚无相关性，被称为无知组。先兆子痫具有广泛的科学文献，大量的DEG数据库，只有一种确定的治疗方法。促进基于知识的分析的工具能够将许多来源的不同数据结合起来，以提出基本的行动机制，可能是支持发现并提高我们对这种疾病的理解的宝贵资源。在这项工作中，我们证明了如何使用生物医学知识图（KG）来识别新型的先兆子痫分子机制。现有的开源生物医学资源和公开可用的高通量转录分析数据用于识别和注释当前未经资助的先兆子痫相关的DEG的功能。使用文本挖掘方法从PubMed摘要中鉴定出与先兆子痫相关的基因。文本媒介和荟萃分析衍生的列表的相对补体被确定为未经投票的前启示性脱位相关的DEG（n = 445），即先前的无知组。使用KG研究相关的DEG，揭示了53种新型临床相关和生物学作用的机械关联。

translated by 谷歌翻译