智能论文笔记

CEG4N: Counter-Example Guided Neural Network Quantization Refinement

João Batista P. Matos Jr. , Iury Bessa , Edoardo Manino , Xidan Song , Lucas C. Cordeiro

分类：机器学习 | 人工智能

2022-07-09

神经网络是基于学习的软件系统的重要组成部分。但是，它们的高计算，内存和功率要求使在低资源域中使用它们具有挑战性。因此，在部署前通常对神经网络进行量化。现有的量化技术倾向于降低网络准确性。我们提出了反示例引导的神经网络量化改进（CEG4N）。该技术结合了基于搜索的量化和等效性验证：前者最小化了计算要求，而后者保证网络的输出在量化后不会改变。我们根据包括大型和小型网络在内的各种基准测试对CEG4N〜进行评估。我们的技术成功地量化了我们评估中的网络，同时生产的模型比最先进的技术高达72％。

translated by 谷歌翻译

Video Segmentation Learning Using Cascade Residual Convolutional Neural Network

Daniel F. S. Santos , Rafael G. Pires , Danilo Colombo , João P. Papa

分类：计算机视觉 | 机器学习

2022-12-20

Video segmentation consists of a frame-by-frame selection process of meaningful areas related to foreground moving objects. Some applications include traffic monitoring, human tracking, action recognition, efficient video surveillance, and anomaly detection. In these applications, it is not rare to face challenges such as abrupt changes in weather conditions, illumination issues, shadows, subtle dynamic background motions, and also camouflage effects. In this work, we address such shortcomings by proposing a novel deep learning video segmentation approach that incorporates residual information into the foreground detection learning process. The main goal is to provide a method capable of generating an accurate foreground detection given a grayscale video. Experiments conducted on the Change Detection 2014 and on the private dataset PetrobrasROUTES from Petrobras support the effectiveness of the proposed approach concerning some state-of-the-art video segmentation techniques, with overall F-measures of $\mathbf{0.9535}$ and $\mathbf{0.9636}$ in the Change Detection 2014 and PetrobrasROUTES datasets, respectively. Such a result places the proposed technique amongst the top 3 state-of-the-art video segmentation methods, besides comprising approximately seven times less parameters than its top one counterpart.

translated by 谷歌翻译

Scene Change Detection Using Multiscale Cascade Residual Convolutional Neural Networks

Daniel F. S. Santos , Rafael G. Pires , Danilo Colombo , João P. Papa

分类：计算机视觉 | 机器学习

2022-12-20

Scene change detection is an image processing problem related to partitioning pixels of a digital image into foreground and background regions. Mostly, visual knowledge-based computer intelligent systems, like traffic monitoring, video surveillance, and anomaly detection, need to use change detection techniques. Amongst the most prominent detection methods, there are the learning-based ones, which besides sharing similar training and testing protocols, differ from each other in terms of their architecture design strategies. Such architecture design directly impacts on the quality of the detection results, and also in the device resources capacity, like memory. In this work, we propose a novel Multiscale Cascade Residual Convolutional Neural Network that integrates multiscale processing strategy through a Residual Processing Module, with a Segmentation Convolutional Neural Network. Experiments conducted on two different datasets support the effectiveness of the proposed approach, achieving average overall $\boldsymbol{F\text{-}measure}$ results of $\boldsymbol{0.9622}$ and $\boldsymbol{0.9664}$ over Change Detection 2014 and PetrobrasROUTES datasets respectively, besides comprising approximately eight times fewer parameters. Such obtained results place the proposed technique amongst the top four state-of-the-art scene change detection methods.

translated by 谷歌翻译

DDIPNet and DDIPNet+: Discriminant Deep Image Prior Networks for Remote Sensing Image Classification

Daniel F. S. Santos , Rafael G. Pires , Leandro A. Passos , João P. Papa

分类：计算机视觉 | 机器学习

2022-12-20

Research on remote sensing image classification significantly impacts essential human routine tasks such as urban planning and agriculture. Nowadays, the rapid advance in technology and the availability of many high-quality remote sensing images create a demand for reliable automation methods. The current paper proposes two novel deep learning-based architectures for image classification purposes, i.e., the Discriminant Deep Image Prior Network and the Discriminant Deep Image Prior Network+, which combine Deep Image Prior and Triplet Networks learning strategies. Experiments conducted over three well-known public remote sensing image datasets achieved state-of-the-art results, evidencing the effectiveness of using deep image priors for remote sensing image classification.

translated by 谷歌翻译

FEMa-FS: Finite Element Machines for Feature Selection

Lucas Biaggi , João P. Papa , Kelton A. P Costa , Danillo R. Pereira , Leandro A. Passos

分类：机器学习 | 人工智能

2022-12-05

Identifying anomalies has become one of the primary strategies towards security and protection procedures in computer networks. In this context, machine learning-based methods emerge as an elegant solution to identify such scenarios and learn irrelevant information so that a reduction in the identification time and possible gain in accuracy can be obtained. This paper proposes a novel feature selection approach called Finite Element Machines for Feature Selection (FEMa-FS), which uses the framework of finite elements to identify the most relevant information from a given dataset. Although FEMa-FS can be applied to any application domain, it has been evaluated in the context of anomaly detection in computer networks. The outcomes over two datasets showed promising results.

translated by 谷歌翻译

Active learning using adaptable task-based prioritisation

Shaheer U. Saeed , João Ramalhinho , Mark Pinnock , Ziyi Shen , Yunguan Fu , Nina Montaña-Brown , Ester Bonmati , Dean C. Barratt , Stephen P. Pereira , Brian Davidson

分类：计算机视觉

2022-12-03

Supervised machine learning-based medical image computing applications necessitate expert label curation, while unlabelled image data might be relatively abundant. Active learning methods aim to prioritise a subset of available image data for expert annotation, for label-efficient model training. We develop a controller neural network that measures priority of images in a sequence of batches, as in batch-mode active learning, for multi-class segmentation tasks. The controller is optimised by rewarding positive task-specific performance gain, within a Markov decision process (MDP) environment that also optimises the task predictor. In this work, the task predictor is a segmentation network. A meta-reinforcement learning algorithm is proposed with multiple MDPs, such that the pre-trained controller can be adapted to a new MDP that contains data from different institutes and/or requires segmentation of different organs or structures within the abdomen. We present experimental results using multiple CT datasets from more than one thousand patients, with segmentation tasks of nine different abdominal organs, to demonstrate the efficacy of the learnt prioritisation controller function and its cross-institute and cross-organ adaptability. We show that the proposed adaptable prioritisation metric yields converging segmentation accuracy for the novel class of kidney, unseen in training, using between approximately 40\% to 60\% of labels otherwise required with other heuristic or random prioritisation metrics. For clinical datasets of limited size, the proposed adaptable prioritisation offers a performance improvement of 22.6\% and 10.2\% in Dice score, for tasks of kidney and liver vessel segmentation, respectively, compared to random prioritisation and alternative active sampling strategies.

translated by 谷歌翻译

From Actions to Events: A Transfer Learning Approach Using Improved Deep Belief Networks

Mateus Roder , Jurandy Almeida , Gustavo H. de Rosa , Leandro A. Passos , André L. D. Rossi , João P. Papa

分类：计算机视觉 | 人工智能

2022-11-30

In the last decade, exponential data growth supplied machine learning-based algorithms' capacity and enabled their usage in daily-life activities. Additionally, such an improvement is partially explained due to the advent of deep learning techniques, i.e., stacks of simple architectures that end up in more complex models. Although both factors produce outstanding results, they also pose drawbacks regarding the learning process as training complex models over large datasets are expensive and time-consuming. Such a problem is even more evident when dealing with video analysis. Some works have considered transfer learning or domain adaptation, i.e., approaches that map the knowledge from one domain to another, to ease the training burden, yet most of them operate over individual or small blocks of frames. This paper proposes a novel approach to map the knowledge from action recognition to event recognition using an energy-based model, denoted as Spectral Deep Belief Network. Such a model can process all frames simultaneously, carrying spatial and temporal information through the learning process. The experimental results conducted over two public video dataset, the HMDB-51 and the UCF-101, depict the effectiveness of the proposed model and its reduced computational burden when compared to traditional energy-based models, such as Restricted Boltzmann Machines and Deep Belief Networks.

translated by 谷歌翻译

Turning the Tables: Biased, Imbalanced, Dynamic Tabular Datasets for ML Evaluation

Sérgio Jesus , José Pombal , Duarte Alves , André Cruz , Pedro Saleiro , Rita P. Ribeiro , João Gama , Pedro Bizarro

分类：机器学习

2022-11-24

Evaluating new techniques on realistic datasets plays a crucial role in the development of ML research and its broader adoption by practitioners. In recent years, there has been a significant increase of publicly available unstructured data resources for computer vision and NLP tasks. However, tabular data -- which is prevalent in many high-stakes domains -- has been lagging behind. To bridge this gap, we present Bank Account Fraud (BAF), the first publicly available privacy-preserving, large-scale, realistic suite of tabular datasets. The suite was generated by applying state-of-the-art tabular data generation techniques on an anonymized,real-world bank account opening fraud detection dataset. This setting carries a set of challenges that are commonplace in real-world applications, including temporal dynamics and significant class imbalance. Additionally, to allow practitioners to stress test both performance and fairness of ML methods, each dataset variant of BAF contains specific types of data bias. With this resource, we aim to provide the research community with a more realistic, complete, and robust test bed to evaluate novel and existing methods.

translated by 谷歌翻译

Chronic pain patient narratives allow for the estimation of current pain intensity

Diogo A. P. Nunes , Joana Ferreira-Gomes , Carlos Vaz , Daniela Oliveira , Sofia Pimenta , Fani Neto , David Martins de Matos

分类：自然语言处理

2022-10-31

Chronic pain is a multi-dimensional experience, and pain intensity plays an important part, impacting the patients emotional balance, psychology, and behaviour. Standard self-reporting tools, such as the Visual Analogue Scale for pain, fail to capture this burden. Moreover, this type of tools is susceptible to a degree of subjectivity, dependent on the patients clear understanding of how to use it, social biases, and their ability to translate a complex experience to a scale. To overcome these and other self-reporting challenges, pain intensity estimation has been previously studied based on facial expressions, electroencephalograms, brain imaging, and autonomic features. However, to the best of our knowledge, it has never been attempted to base this estimation on the patient narratives of the personal experience of chronic pain, which is what we propose in this work. Indeed, in the clinical assessment and management of chronic pain, verbal communication is essential to convey information to physicians that would otherwise not be easily accessible through standard reporting tools, since language, sociocultural, and psychosocial variables are intertwined. We show that language features from patient narratives indeed convey information relevant for pain intensity estimation, and that our computational models can take advantage of that. Specifically, our results show that patients with mild pain focus more on the use of verbs, whilst moderate and severe pain patients focus on adverbs, and nouns and adjectives, respectively, and that these differences allow for the distinction between these three pain classes.

translated by 谷歌翻译

ComplexWoundDB: A Database for Automatic Complex Wound Tissue Categorization

Talita A. Pereira , Regina C. Popim , Leandro A. Passos , Danillo R. Pereira , Clayton R. Pereira , João P. Papa

分类：计算机视觉 | 机器学习

2022-09-26

复杂的伤口通常会面临部分或完全损失皮肤厚度，从而通过次要意图愈合。它们可以是急性或慢性的，可以发现感染，缺血和组织坏死以及与全身性疾病的关联。全球研究机构报告了无数案件，最终涉及严重的公共卫生问题，因为它们涉及人力资源（例如医师和医疗保健专业人员），并对生活质量产生负面影响。本文提出了一个新的数据库，用于自动将复杂伤口自动分类为五个类别，即非缠绕区域，肉芽，纤维蛋白样组织和干性坏死，血肿。这些图像包括由压力，血管溃疡，糖尿病，燃烧和手术干预后的并发症引起的复杂伤口的不同情况。该数据集（称为ComplexWoundDB）是独一无二的，因为它可以从野外获得的27美元图像中的像素级分类，即在患者的房屋中收集图像，并由四名卫生专业人员标记。用不同的机器学习技术进行的进一步实验证明了解决计算机辅助复杂伤口组织分类问题的挑战。手稿阐明了该地区未来的方向，在文献中广泛使用的其他数据库中进行了详细比较。

translated by 谷歌翻译