智能论文笔记

Mining the Factor Zoo: Estimation of Latent Factor Models with Sufficient Proxies

Runzhe Wan , Yingying Li , Wenbin Lu , Rui Song

分类：机器学习

2022-12-25

Latent factor model estimation typically relies on either using domain knowledge to manually pick several observed covariates as factor proxies, or purely conducting multivariate analysis such as principal component analysis. However, the former approach may suffer from the bias while the latter can not incorporate additional information. We propose to bridge these two approaches while allowing the number of factor proxies to diverge, and hence make the latent factor model estimation robust, flexible, and statistically more accurate. As a bonus, the number of factors is also allowed to grow. At the heart of our method is a penalized reduced rank regression to combine information. To further deal with heavy-tailed data, a computationally attractive penalized robust reduced rank regression method is proposed. We establish faster rates of convergence compared with the benchmark. Extensive simulations and real examples are used to illustrate the advantages.

translated by 谷歌翻译

AutoLV: Automatic Lecture Video Generator

Wenbin Wang , Yang Song , Sanjay Jha

分类：计算机视觉

2022-09-19

我们提出了一个端到端的讲座视频生成系统，该系统可以直接从注释的幻灯片，讲师的参考语音和讲师的参考肖像视频中生成现实和完整的讲座视频。我们的系统主要由语音合成模块组成，具有很少的扬声器适应器和基于对抗性学习的说话头生成模块。它不仅能够减少讲师的工作量，还可以改变语言和口音，这可以帮助学生更轻松地跟随讲座，并能够更广泛地传播讲座内容。我们的实验结果表明，所提出的模型在真实性，自然性和准确性方面优于其他当前方法。这是一个视频演示，展示了我们的系统的工作原理以及评估和比较的结果：https：//youtu.be/cy6tyki0cog。

translated by 谷歌翻译

Jump Interval-Learning for Individualized Decision Making

Hengrui Cai , Chengchun Shi , Rui Song , Wenbin Lu

分类：机器学习 | (统计)机器学习

2021-11-17

个性化决定规则（IDR）是一个决定函数，可根据他/她观察到的特征分配给定的治疗。文献中的大多数现有工作考虑使用二进制或有限的许多治疗方案的设置。在本文中，我们专注于连续治疗设定，并提出跳跃间隔 - 学习，开发一个最大化预期结果的个性化间隔值决定规则（I2DR）。与推荐单一治疗的IDRS不同，所提出的I2DR为每个人产生了一系列治疗方案，使其在实践中实施更加灵活。为了获得最佳I2DR，我们的跳跃间隔学习方法估计通过跳转惩罚回归给予治疗和协变量的结果的条件平均值，并基于估计的结果回归函数来衍生相应的最佳I2DR。允许回归线是用于清晰的解释或深神经网络的线性，以模拟复杂的处理 - 协调会相互作用。为了实现跳跃间隔学习，我们开发了一种基于动态编程的搜索算法，其有效计算结果回归函数。当结果回归函数是处理空间的分段或连续功能时，建立所得I2DR的统计特性。我们进一步制定了一个程序，以推断（估计）最佳政策下的平均结果。进行广泛的模拟和对华法林研究的真实数据应用，以证明所提出的I2DR的经验有效性。

translated by 谷歌翻译

A Probit Tensor Factorization Model For Relational Learning

Ye Liu , Rui Song , Wenbin Lu , Yanghua Xiao

分类：机器学习 | (统计)机器学习

2021-11-06

随着知识图的扩散，具有复杂多界结构的建模数据在统计关系学习领域获得了越来越大的关注。统计关系学习最重要的目标之一是链路预测，即，预测知识图中是否存在某些关系。已经提出了大量模型和算法来执行链路预测，其中张量分解方法已经证明在计算效率和预测准确性方面实现了最先进的性能。然而，现有张量分解模型的共同缺点是缺失的关系和非现有关系是以相同的方式对待，这导致信息丢失。为了解决这个问题，我们提出了一种具有探测链路的二进制张量分解模型，其不仅继承了来自经典张量分解模型的计算效率，还占关联数据的二进制性质。我们所提出的探测张量分解（PTF）模型显示了预测准确性和可解释性的优点

translated by 谷歌翻译

Multi-Agent Reinforcement Learning for Active Voltage Control on Power Distribution Networks

Jianhong Wang , Wangkun Xu , Yunjie Gu , Wenbin Song , Tim C. Green

分类：机器学习

2021-10-27

本文介绍了电力网络的问题，可以为应用多功能增强学习（Marl）创造一个令人兴奋和挑战的现实情景。脱碳的新出现趋势在配电网络上放置过大的压力。主动电压控制被视为有希望的解决方案，以减轻电力拥塞和改善电压质量，无需额外的硬件投资，利用网络中的可控装置，例如屋顶光伏（PVS）和静态VAR补偿器（SVC）。这些可控设备出现在大量广大数字中，并分布在宽的地理区域中，使Marl成为自然候选者。本文在DEC-POMDP框架中制定了主动电压控制问题，并建立了开源环境。它旨在弥合电力社区与马尔社区之间的差距，并成为马尔算法实际应用的驱动力。最后，我们分析了主动电压控制问题的特殊特征，导致最先进的Marl方法挑战，并总结了潜在的方向。

translated by 谷歌翻译

CODA: Calibrated Optimal Decision Making with Multiple Data Sources and Limited Outcome

Hengrui Cai , Wenbin Lu , Rui Song

分类： (统计)机器学习

2021-04-21

我们考虑在具有多个可用的多个辅助来源的主要兴趣样本中最佳决策问题。感兴趣的结果是有限的，因为它仅在主要样本中观察到。实际上，这种多个数据源可能属于异质研究，因此不能直接组合。本文提出了一种新的框架来处理异构研究，并通过新的校准最佳决策（CODA）方法同时解决有限的结果，通过利用多种数据来源的常见中间结果来解决。具体地，CODA允许跨不同样品的基线协变量具有均匀或异质的分布。在温和和可测试的假设下，不同样本中的中间结果的条件方法等于基线协变量和治疗信息，我们表明，条件平均结果的提议CODA估计是渐近正常的和更有效的，而不是使用主要样品。此外，由于速率双重稳健性，可以使用简单的插件方法轻松获得CODA估计器的方差。对模拟数据集的广泛实验显示了使用CoDa的经验有效性和提高效率，然后是与来自Eicu的辅助数据的主要样本是MIMIC-III数据集的真实应用程序。

translated by 谷歌翻译

Learning to Shape Rewards using a Game of Two Partners

David Mguni , Taher Jafferjee , Jianhong Wang , Nicolas Perez-Nieves , Yaodong Yang , Tianpei Yang , Matthew Taylor , Wenbin Song , Feifei Tong , Hui Chen

分类：机器学习 | 人工智能

2021-03-16

奖励成型（RS）是克服稀疏或不信息奖励问题的强大方法（RL）。但是，RS通常依赖于手动设计的成型奖励功能，其构造耗时且容易出错。它还需要与自主学习目标相反的领域知识。我们介绍了增强学习优化塑造算法（ROSA）的增强型，这是一个自动化的RS框架，其中塑造奖励函数是在两个代理之间的新型马尔可夫游戏中构建的。奖励塑料代理（Shaper）使用切换控件来确定在其他代理（控制器）使用这些形状奖励的任务中学习任务的最佳策略，以确定要添加形状奖励及其最佳值的状态。我们证明，Rosa很容易采用现有的RL算法，学会了构建针对任务的塑造奖励功能，从而确保有效地收敛到高性能策略。我们在三个经过精心设计的实验中展示了罗莎（Rosa）在挑战稀疏奖励环境中对最先进的RS算法的优越性能。

translated by 谷歌翻译

Deep Jump Learning for Off-Policy Evaluation in Continuous Treatment Settings

Hengrui Cai , Chengchun Shi , Rui Song , Wenbin Lu

分类： (统计)机器学习 | 机器学习

2020-10-29

我们认为离政策在连续处理设置，如个性化的剂量调查评价（OPE）。在OPE，一个目标来估算下使用不同的决策规则产生的历史数据的新的治疗决策规则中的平均结果。离散处理设置上OPE焦点大多数现有的作品。为了应对持续的治疗，我们开发使用OPE深跳学习一种新的估计方法。我们的方法在于在使用深离散化，通过利用深度学习和多尺度变化点检测自适应离散化治疗领域的主要成分。这使我们能够应用在离散处理现有OPE方法来处理连续治疗。我们的方法是通过理论计算结果，模拟和实际应用程序，以华法林给药进一步合理的。

translated by 谷歌翻译

A Class-wise Non-salient Region Generalized Framework for Video Semantic Segmentation

Yuhang Zhang , Shishun Tian , Muxin Liao , Zhengyu Zhang , Wenbin Zou , Chen Xu

分类：计算机视觉

2022-12-29

Video semantic segmentation (VSS) is beneficial for dealing with dynamic scenes due to the continuous property of the real-world environment. On the one hand, some methods alleviate the predicted inconsistent problem between continuous frames. On the other hand, other methods employ the previous frame as the prior information to assist in segmenting the current frame. Although the previous methods achieve superior performances on the independent and identically distributed (i.i.d) data, they can not generalize well on other unseen domains. Thus, we explore a new task, the video generalizable semantic segmentation (VGSS) task that considers both continuous frames and domain generalization. In this paper, we propose a class-wise non-salient region generalized (CNSG) framework for the VGSS task. Concretely, we first define the class-wise non-salient feature, which describes features of the class-wise non-salient region that carry more generalizable information. Then, we propose a class-wise non-salient feature reasoning strategy to select and enhance the most generalized channels adaptively. Finally, we propose an inter-frame non-salient centroid alignment loss to alleviate the predicted inconsistent problem in the VGSS task. We also extend our video-based framework to the image-based generalizable semantic segmentation (IGSS) task. Experiments demonstrate that our CNSG framework yields significant improvement in the VGSS and IGSS tasks.

translated by 谷歌翻译

OFASys: A Multi-Modal Multi-Task Learning System for Building Generalist Models

Jinze Bai , Rui Men , Hao Yang , Xuancheng Ren , Kai Dang , Yichang Zhang , Xiaohuan Zhou , Peng Wang , Sinan Tan , An Yang

分类：计算机视觉 | 人工智能 | 自然语言处理 | 机器学习

2022-12-08

Generalist models, which are capable of performing diverse multi-modal tasks in a task-agnostic way within a single model, have been explored recently. Being, hopefully, an alternative to approaching general-purpose AI, existing generalist models are still at an early stage, where modality and task coverage is limited. To empower multi-modal task-scaling and speed up this line of research, we release a generalist model learning system, OFASys, built on top of a declarative task interface named multi-modal instruction. At the core of OFASys is the idea of decoupling multi-modal task representations from the underlying model implementations. In OFASys, a task involving multiple modalities can be defined declaratively even with just a single line of code. The system automatically generates task plans from such instructions for training and inference. It also facilitates multi-task training for diverse multi-modal workloads. As a starting point, we provide presets of 7 different modalities and 23 highly-diverse example tasks in OFASys, with which we also develop a first-in-kind, single model, OFA+, that can handle text, image, speech, video, and motion data. The single OFA+ model achieves 95% performance in average with only 16% parameters of 15 task-finetuned models, showcasing the performance reliability of multi-modal task-scaling provided by OFASys. Available at https://github.com/OFA-Sys/OFASys

translated by 谷歌翻译