DeeProb-kit is a unified library written in Python consisting of a collection of deep probabilistic models (DPMs) that are tractable and exact representations for the modelled probability distributions. The availability of a representative selection of DPMs in a single library makes it possible to combine them in a straightforward manner, a common practice in deep learning research nowadays. In addition, it includes efficiently implemented learning techniques, inference routines, statistical algorithms, and provides high-quality fully-documented APIs. The development of DeeProb-kit will help the community to accelerate research on DPMs as well as to standardise their evaluation and better understand how they are related based on their expressivity.
translated by 谷歌翻译
translated by 谷歌翻译
Objective: Accurate visual classification of bladder tissue during Trans-Urethral Resection of Bladder Tumor (TURBT) procedures is essential to improve early cancer diagnosis and treatment. During TURBT interventions, White Light Imaging (WLI) and Narrow Band Imaging (NBI) techniques are used for lesion detection. Each imaging technique provides diverse visual information that allows clinicians to identify and classify cancerous lesions. Computer vision methods that use both imaging techniques could improve endoscopic diagnosis. We address the challenge of tissue classification when annotations are available only in one domain, in our case WLI, and the endoscopic images correspond to an unpaired dataset, i.e. there is no exact equivalent for every image in both NBI and WLI domains. Method: We propose a semi-surprised Generative Adversarial Network (GAN)-based method composed of three main components: a teacher network trained on the labeled WLI data; a cycle-consistency GAN to perform unpaired image-to-image translation, and a multi-input student network. To ensure the quality of the synthetic images generated by the proposed GAN we perform a detailed quantitative, and qualitative analysis with the help of specialists. Conclusion: The overall average classification accuracy, precision, and recall obtained with the proposed method for tissue classification are 0.90, 0.88, and 0.89 respectively, while the same metrics obtained in the unlabeled domain (NBI) are 0.92, 0.64, and 0.94 respectively. The quality of the generated images is reliable enough to deceive specialists. Significance: This study shows the potential of using semi-supervised GAN-based classification to improve bladder tissue classification when annotations are limited in multi-domain data.
translated by 谷歌翻译
In this paper, we introduce MINTIME, a video deepfake detection approach that captures spatial and temporal anomalies and handles instances of multiple people in the same video and variations in face sizes. Previous approaches disregard such information either by using simple a-posteriori aggregation schemes, i.e., average or max operation, or using only one identity for the inference, i.e., the largest one. On the contrary, the proposed approach builds on a Spatio-Temporal TimeSformer combined with a Convolutional Neural Network backbone to capture spatio-temporal anomalies from the face sequences of multiple identities depicted in a video. This is achieved through an Identity-aware Attention mechanism that attends to each face sequence independently based on a masking operation and facilitates video-level aggregation. In addition, two novel embeddings are employed: (i) the Temporal Coherent Positional Embedding that encodes each face sequence's temporal information and (ii) the Size Embedding that encodes the size of the faces as a ratio to the video frame size. These extensions allow our system to adapt particularly well in the wild by learning how to aggregate information of multiple identities, which is usually disregarded by other methods in the literature. It achieves state-of-the-art results on the ForgeryNet dataset with an improvement of up to 14% AUC in videos containing multiple people and demonstrates ample generalization capabilities in cross-forgery and cross-dataset settings. The code is publicly available at
translated by 谷歌翻译
translated by 谷歌翻译
深度神经网络的学习算法通常基于有误后传播(BackProp)的监督端到端随机梯度下降(SGD)培训。 Backprop算法需要大量标记的训练样本才能获得高性能。但是,在许多现实的应用中,即使有很多图像样本,很少有标签被标记,并且必须使用半监督的样品培训策略。 Hebbian学习代表了一种可能采取样本培训的方法;但是,在当前解决方案中,它不能很好地扩展到大型数据集。在本文中,我们提出了FastheBB,这是HEBBIAN学习的有效且可扩展的解决方案,通过1)合并在一批输入上更新计算和聚集,以及2)利用有效的GPU上的有效矩阵乘法算法。在半监督的学习方案中,我们在不同的计算机视觉基准测试方面验证了我们的方法。 FastheBB在训练速度方面最多优于先前的解决方案,尤其是,我们首次能够将HEBBIAN算法带入ImageNet量表。
translated by 谷歌翻译
translated by 谷歌翻译
本文介绍了2021年进行的一系列教育事件的方法和结果,该活动利用机器人群来教育高中生和大学生有关流行病学模型以及如何为社会和政府政策提供信息。这些事件特别关注Covid-19的大流行,由4个在线和3个面对面的研讨会组成,学生有机会与一群20个定制的Brushbots互动 - 针对优化的小规模振动驱动的机器人便携性和鲁棒性。通过对事后调查中收集的数据的分析,本文展示了这些事件如何对学生对指导现实世界决策的科学方法的看法产生积极影响,以及他们对机器人技术的兴趣。
translated by 谷歌翻译
translated by 谷歌翻译
阿拉伯联合酋长国阿布扎比技术创新研究所最近完成了一辆新的无人面车辆的生产和测试,称为Nukhada,专门用于自主调查,检查和对水下行动的支持。此稿件描述了Nukhada USV的主要特征,以及在开发期间进行的一些试验。
translated by 谷歌翻译