智能论文笔记

Scalable Machine Learning Architecture for Neonatal Seizure Detection on Ultra-Edge Devices

Vishal Nagarajan , Ashwini Muralidharan , Deekshitha Sriraman , Pravin Kumar S

分类：机器学习

2021-11-29

新生儿癫痫发作是一种通常遇到的神经系统条件。它们是严重神经障碍的第一个临床迹象。因此，需要快速识别和治疗以防止严重的死亡。在神经学领域中使用脑电图（EEG）允许精确地诊断几种医疗条件。然而，解释EEG信号需要高度专业人员的注意，因为婴儿脑在新生儿期间发育不起。检测癫痫发作可能会妨碍对婴儿的神经认知发展的负面影响。近年来，使用机器学习算法的新生儿癫痫发作检测已经获得牵引力。由于需要在癫痫发作检测的情况下对生物信号进行计算廉价的生物信号，因此本研究提供了一种基于机器学习（ML）的架构，其与以前的模型相当的预测性能，但具有最小级别配置。拟议的分类器在赫尔辛基大学医院录制的尼古尔缉获量的公共数据数据上进行了培训和测试。我们的架构实现了87％的最佳敏感性，比本研究中选择的标准ML型号的6％增加了6％。 ML分类器的模型大小优化为仅为4.84 kB，最小预测时间为182.61毫秒，从而使其部署在可穿戴的超边设备上，以便快速准确，并避免基于云的需求和其他这种穷举计算方法。

translated by 谷歌翻译

e-Inu: Simulating A Quadruped Robot With Emotional Sentience

Abhiruph Chakravarty , Jatin Karthik Tripathy , Sibi Chakkaravarthy S , Aswani Kumar Cherukuri , S. Anitha , Firuz Kamalov , Annapurna Jonnalagadda

分类：机器人 | 机器学习

2023-01-03

Quadruped robots are currently used in industrial robotics as mechanical aid to automate several routine tasks. However, presently, the usage of such a robot in a domestic setting is still very much a part of the research. This paper discusses the understanding and virtual simulation of such a robot capable of detecting and understanding human emotions, generating its gait, and responding via sounds and expression on a screen. To this end, we use a combination of reinforcement learning and software engineering concepts to simulate a quadruped robot that can understand emotions, navigate through various terrains and detect sound sources, and respond to emotions using audio-visual feedback. This paper aims to establish the framework of simulating a quadruped robot that is emotionally intelligent and can primarily respond to audio-visual stimuli using motor or audio response. The emotion detection from the speech was not as performant as ERANNs or Zeta Policy learning, still managing an accuracy of 63.5%. The video emotion detection system produced results that are almost at par with the state of the art, with an accuracy of 99.66%. Due to its "on-policy" learning process, the PPO algorithm was extremely rapid to learn, allowing the simulated dog to demonstrate a remarkably seamless gait across the different cadences and variations. This enabled the quadruped robot to respond to generated stimuli, allowing us to conclude that it functions as predicted and satisfies the aim of this work.

translated by 谷歌翻译

Chaotic Variational Auto Encoder based One Class Classifier for Insurance Fraud Detection

K. S. N. V. K. Gangadhar , B. Akhil Kumar , Yelleti Vivek , Vadlamani Ravi

分类：机器学习

2022-12-15

Of late, insurance fraud detection has assumed immense significance owing to the huge financial & reputational losses fraud entails and the phenomenal success of the fraud detection techniques. Insurance is majorly divided into two categories: (i) Life and (ii) Non-life. Non-life insurance in turn includes health insurance and auto insurance among other things. In either of the categories, the fraud detection techniques should be designed in such a way that they capture as many fraudulent transactions as possible. Owing to the rarity of fraudulent transactions, in this paper, we propose a chaotic variational autoencoder (C-VAE to perform one-class classification (OCC) on genuine transactions. Here, we employed the logistic chaotic map to generate random noise in the latent space. The effectiveness of C-VAE is demonstrated on the health insurance fraud and auto insurance datasets. We considered vanilla Variational Auto Encoder (VAE) as the baseline. It is observed that C-VAE outperformed VAE in both datasets. C-VAE achieved a classification rate of 77.9% and 87.25% in health and automobile insurance datasets respectively. Further, the t-test conducted at 1% level of significance and 18 degrees of freedom infers that C-VAE is statistically significant than the VAE.

translated by 谷歌翻译

Towards Building Text-To-Speech Systems for the Next Billion Users

Gokul Karthik Kumar , Praveen S V , Pratyush Kumar , Mitesh M. Khapra , Karthik Nandakumar

分类：自然语言处理 | 机器学习

2022-11-17

Deep learning based text-to-speech (TTS) systems have been evolving rapidly with advances in model architectures, training methodologies, and generalization across speakers and languages. However, these advances have not been thoroughly investigated for Indian language speech synthesis. Such investigation is computationally expensive given the number and diversity of Indian languages, relatively lower resource availability, and the diverse set of advances in neural TTS that remain untested. In this paper, we evaluate the choice of acoustic models, vocoders, supplementary loss functions, training schedules, and speaker and language diversity for Dravidian and Indo-Aryan languages. Based on this, we identify monolingual models with FastPitch and HiFi-GAN V1, trained jointly on male and female speakers to perform the best. With this setup, we train and evaluate TTS models for 13 languages and find our models to significantly improve upon existing models in all languages as measured by mean opinion scores. We open-source all models on the Bhashini platform.

translated by 谷歌翻译

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Teven Le Scao , Angela Fan , Christopher Akiki , Ellie Pavlick , Suzana Ilić , Daniel Hesslow , Roman Castagné , Alexandra Sasha Luccioni , François Yvon , Matthias Gallé

分类：自然语言处理

2022-11-09

Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access language model designed and built thanks to a collaboration of hundreds of researchers. BLOOM is a decoder-only Transformer language model that was trained on the ROOTS corpus, a dataset comprising hundreds of sources in 46 natural and 13 programming languages (59 in total). We find that BLOOM achieves competitive performance on a wide variety of benchmarks, with stronger results after undergoing multitask prompted finetuning. To facilitate future research and applications using LLMs, we publicly release our models and code under the Responsible AI License.

translated by 谷歌翻译

Vision-based Perimeter Defense via Multiview Pose Estimation

Elijah S. Lee , Giuseppe Loianno , Dinesh Jayaraman , Vijay Kumar

分类：计算机视觉 | 机器人

2022-09-25

以前在外围防御游戏中的研究主要集中在完全可观察到的环境上，在该环境中，所有玩家都知道真正的玩家状态。但是，这对于实际实施而言是不现实的，因为捍卫者可能必须感知入侵者并估计其国家。在这项工作中，我们在照片真实的模拟器和现实世界中研究外围防御游戏，要求捍卫者从视力中估算入侵者状态。我们通过域随机化训练一个基于机器学习的系统，用于入侵者姿势检测，该系统汇总了多个视图，以减少状态估计错误并适应防御策略来解决此问题。我们新介绍性能指标来评估基于视觉的外围防御。通过广泛的实验，我们表明我们的方法改善了国家的估计，最终在两场比赛中的VS-1-Intruder游戏和2-Fefenders-VS-1-Intruder游戏中最终进行了外围防御性能。

translated by 谷歌翻译

An Overview of Violence Detection Techniques: Current Challenges and Future Directions

Nadia Mumtaz , Naveed Ejaz , Shabana Habib , Syed Muhammad Mohsin , Prayag Tiwari , Shahab S. Band , Neeraj Kumar

分类：计算机视觉 | 人工智能

2022-09-21

当今智能城市中产生的大型视频数据从其有目的的用法角度引起了人们的关注，其中监视摄像机等是最突出的资源，是为大量数据做出贡献的最突出的资源，使其自动化分析成为计算方面的艰巨任务。和精确。暴力检测（VD）在行动和活动识别域中广泛崩溃，用于分析大型视频数据，以了解由于人类而引起的异常动作。传统上，VD文献基于手动设计的功能，尽管开发了基于深度学习的独立模型的进步用于实时VD分析。本文重点介绍了深度序列学习方法以及检测到的暴力的本地化策略。该概述还介入了基于机器学习的初始图像处理和基于机器学习的文献及其可能具有的优势，例如针对当前复杂模型的效率。此外，讨论了数据集，以提供当前模型的分析，并用对先前方法的深入分析得出的VD域中的未来方向解释了他们的利弊。

translated by 谷歌翻译

CoV-TI-Net: Transferred Initialization with Modified End Layer for COVID-19 Diagnosis

Sadia Khanam , Mohammad Reza Chalak Qazani , Subrota Kumar Mondal , H M Dipu Kabir , Abadhan S. Sabyasachi , Houshyar Asadi , Keshav Kumar , Farzin Tabarsinezhad , Shady Mohamed , Abbas Khorsavi

分类：计算机视觉

2022-09-20

本文提议使用修改的完全连接层转移初始化，以进行1900诊断。卷积神经网络（CNN）在图像分类中取得了显着的结果。但是，由于图像识别应用程序的复杂性，培训高性能模型是一个非常复杂且耗时的过程。另一方面，转移学习是一种相对较新的学习方法，已在许多领域使用，以减少计算来实现良好的性能。在这项研究中，Pytorch预训练的模型（VGG19 \ _bn和WideresNet -101）首次在MNIST数据集中应用于初始化，并具有修改的完全连接的层。先前在Imagenet中对使用的Pytorch预培训模型进行了培训。提出的模型在Kaggle笔记本电脑中得到了开发和验证，并且在网络培训过程中没有花费巨大的计算时间，达到了99.77％的出色精度。我们还将相同的方法应用于SIIM-FISABIO-RSNA COVID-19检测数据集，并达到80.01％的精度。相比之下，以前的方法在训练过程中需要大量的压缩时间才能达到高性能模型。代码可在以下链接上找到：github.com/dipuk0506/spinalnet

translated by 谷歌翻译

A Survey on the application of Data Science And Analytics in the field of Organised Sports

Sachin Kumar S , Prithvi HV , C Nandini

分类：机器学习

2022-09-15

在现代世界中，数据科学和分析以优化或预测结果的应用无处不在。数据科学和分析已经优化了市场中存在的几乎所有领域。在我们的调查中，我们专注于如何在体育领域采用分析领域，以及它如何促进游戏的转型，从评估现场玩家及其选择到赢得团队的预测以及大型体育比赛的门票和商业方面的营销。我们将介绍体育分析领域采用的不同运动的分析工具，算法和方法论，并介绍我们对同一体育的看法，我们还将比较和对比这些现有方法。通过这样做，我们还将介绍任何希望尝试体育数据并分析游戏的各个方面的人考虑的最佳工具，算法和分析方法。

translated by 谷歌翻译

Data Science Approach to predict the winning Fantasy Cricket Team Dream 11 Fantasy Sports

Sachin Kumar S , Prithvi HV , C Nandini

分类：机器学习

2022-09-15

数字技术的发展和体育运动的日益普及激发了创新者，通过引入幻想体育平台FSP，将体育倾向的用户带到一个全新的不同层次上。数据科学和分析的应用在现代世界中无处不在。数据科学和分析打开门，以获得更深入的理解和帮助，以帮助决策过程。我们坚信，我们可以采用数据科学来预测FSP上的获胜幻想板球团队，Dream 11.我们建立了一个预测模型，可以预测潜在游戏中玩家的性能。我们结合了贪婪和背包算法的组合，开出了11名球员的组合，创建了一支幻想板球团队，这是最重要的统计赔率，即最大的团队成为最强的团队，从而使我们有更大的机会赢得梦想中的赌注。 11 FSP。我们使用Pycaret Python库来帮助我们理解并采用最佳回归算法来进行问题陈述，以做出精确的预测。此外，我们使用Plotly Python图书馆为我们提供了对团队的视觉见解，并且玩家通过计算前瞻性游戏的统计和主观因素来表演。交互作用图帮助我们提高了我们的预测模型的建议。您要么赢得大，赢得小巧，要么根据预期游戏中为您的幻想团队选出的球员的表现而失去赌注，而我们的模型增加了您赢得大的可能性。

translated by 谷歌翻译