智能论文笔记

5-Star Hotel Customer Satisfaction Analysis Using Hybrid Methodology

Yongmin Yoo , Yeongjoon Park , Dongjin Lim , Deaho Seo

分类：人工智能 | 自然语言处理

2022-09-26

由于由于电晕病毒而迅速开发了非面对面服务，因此通过互联网（例如销售和保留）的商业正在迅速增长。消费者还会在网站上发布有关商品或服务的评论，建议或判断。消费者直接使用的审查数据为消费者提供了积极的反馈和良好的影响，例如创造业务价值。因此，从营销的角度来看，分析审核数据非常重要。我们的研究提出了一种通过审核数据来找到客户满意度因素的新方法。我们采用了一种方法来通过混合和使用数据挖掘技术来找到客户满意度的因素，这是一种大数据分析方法，而自然语言处理技术（我们的研究中）是一种语言处理方法。与过去对客户满意度进行的许多研究不同，我们的研究通过使用各种技术来对论文的新颖性。由于分析，我们的实验结果非常准确。

translated by 谷歌翻译

Cross-Domain Consumer Review Analysis

Aditya Pandey , Kunal Joshi

分类：机器学习

2022-12-23

The paper presents a cross-domain review analysis on four popular review datasets: Amazon, Yelp, Steam, IMDb. The analysis is performed using Hadoop and Spark, which allows for efficient and scalable processing of large datasets. By examining close to 12 million reviews from these four online forums, we hope to uncover interesting trends in sales and customer sentiment over the years. Our analysis will include a study of the number of reviews and their distribution over time, as well as an examination of the relationship between various review attributes such as upvotes, creation time, rating, and sentiment. By comparing the reviews across different domains, we hope to gain insight into the factors that drive customer satisfaction and engagement in different product categories.

translated by 谷歌翻译

Sentiment analysis and opinion mining on E-commerce site

Fatema Tuz Zohra Anny , Oahidul Islam

分类：自然语言处理 | 机器学习

2022-11-28

Sentiment analysis or opinion mining help to illustrate the phrase NLP (Natural Language Processing). Sentiment analysis has been the most significant topic in recent years. The goal of this study is to solve the sentiment polarity classification challenges in sentiment analysis. A broad technique for categorizing sentiment opposition is presented, along with comprehensive process explanations. With the results of the analysis, both sentence-level classification and review-level categorization are conducted. Finally, we discuss our plans for future sentiment analysis research.

translated by 谷歌翻译

Impact of Sentiment Analysis in Fake Review Detection

Amira Yousif , James Buckley

分类：自然语言处理 | 机器学习

2022-12-18

Fake review identification is an important topic and has gained the interest of experts all around the world. Identifying fake reviews is challenging for researchers, and there are several primary challenges to fake review detection. We propose developing an initial research paper for investigating fake reviews by using sentiment analysis. Ten research papers are identified that show fake reviews, and they discuss currently available solutions for predicting or detecting fake reviews. They also show the distribution of fake and truthful reviews through the analysis of sentiment. We summarize and compare previous studies related to fake reviews. We highlight the most significant challenges in the sentiment evaluation process and demonstrate that there is a significant impact on sentiment scores used to identify fake feedback.

translated by 谷歌翻译

Can Machine Learning Tools Support the Identification of Sustainable Design Leads From Product Reviews? Opportunities and Challenges

Michael Saidani , Harrison Kim , Bernard Yannou

分类：机器学习 | 人工智能

2021-12-17

在线发布的产品评论数量越来越多的是设计师的金矿，通过捕捉客户的声音，并相应地改善这些产品，了解他们开发的产品。与此同时，产品设计和开发在创造更可持续的未来方面具有重要作用。随着自然语言处理领域的人工智能技术最近，该研究旨在开发一体化机器学习解决方案，以便自动从线产品评论获得可持续设计的洞察。在本文中，讨论了，说明了现有框架 - 包括Python库，软件包以及伯爵等最先进的算法的机会和挑战。这一贡献讨论了达成的机会和建立机器学习管道的挑战，以便从产品审查中获取有限性，以设计更可持续的产品，包括五个阶段，包括与解释的可持续性相关的审查可持续设计引导：数据收集，数据格式，模型培训，模型评估和模型部署。给出了可持续设计见解的例子，可提供退出产品审查采矿和加工。最后，提供了用于该领域未来研究的有希望的线，包括与可持续替代品的平行标准产品的案例研究，以比较客户价值的特征，并在优质的可持续设计引线中产生。

translated by 谷歌翻译

Using Online Customer Reviews to Classify, Predict, and Learn about Domestic Robot Failures

Shanee Honig , Alon Bartal , Yisrael Parmet , Tal Oron-Gilad

分类：机器人

2022-01-10

关于哪些类型的故障机器人在家庭环境中以及这些失败如何影响客户体验时，存在知识差距。我们在亚马逊上分类了10,072个客户评论，通过它们中描述的机器人失败，将故障分组为十二种类型和三类（技术，互动和服务）。我们确定了先前忽略了文献中忽视的失败的来源和类型，将它们结合到更新的失败分类。我们分析了他们的频率和关系与客户明星评级。结果表明，对于功利主义国内机器人来说，技术故障比互动或服务失败更有害。常常报告任务完成和鲁棒性和恢复力的问题，并具有最大的负面影响。未来的预防和反应战略应解决机器人的技术能力，以满足功能目标，运作和保持结构完整性随着时间的推移。可用性和互动设计对客户体验不利，表明客户可能更宽容影响机器人和实际用途的影响的失败。此外，我们开发了一种能够预测客户审查是否包含描述故障的内容以及它描述的故障类型的自然语言处理模型。借鉴了这些知识，机器人系统的设计者和研究人员可以优先考虑设计和开发努力实现基本问题。

translated by 谷歌翻译

Exploring the Distribution Regularities of User Attention and Sentiment toward Product Aspects in Online Reviews

Chenglei Qin , Chengzhi Zhang , Yi Bu

分类：自然语言处理

2022-09-08

[目的]更好地了解在线评论，并帮助潜在的消费者，商人和产品制造商有效地获得用户对产品方面的评估，本文从在线评论的时间角度来探讨了用户关注和对产品方面的情感分布规律性。 [设计/方法/方法]在线评论的时间特征（购买时间和审核时间之间的购买时间，审核时间和时间间隔），类似的属性聚类以及属性级别的情感计算技术是基于340k智能手机评论来使用的在JD.com（中国著名的在线购物平台）的三种产品中，探讨了本文中用户对产品方面的关注和情感的分布规律。 [调查结果]经验结果表明，幂律分布可以符合用户对产品方面的关注，并且在短时间间隔发布的评论包含更多产品方面。此外，结果表明，在短时间间隔内，产品方面的用户情感值显着更高/较低，这有助于判断产品的优势和弱点。 [研究局限性]本文无法获得更多具有时间特征的产品的在线评论，以验证发现，因为对购物平台的评论的限制限制了。 [原创性/价值]这项工作揭示了用户对产品方面的关注和情感的分布规律，这在协助决策，优化审查演示和改善购物体验方面具有重要意义。

translated by 谷歌翻译

Mining and summarizing customer reviews

分类：

Merchants selling products on the Web often ask their customers to review the products that they have purchased and the associated services. As e-commerce is becoming more and more popular, the number of customer reviews that a product receives grows rapidly. For a popular product, the number of reviews can be in hundreds or even thousands. This makes it difficult for a potential customer to read them to make an informed decision on whether to purchase the product. It also makes it difficult for the manufacturer of the product to keep track and to manage customer opinions. For the manufacturer, there are additional difficulties because many merchant sites may sell the same product and the manufacturer normally produces many kinds of products. In this research, we aim to mine and to summarize all the customer reviews of a product. This summarization task is different from traditional text summarization because we only mine the features of the product on which the customers have expressed their opinions and whether the opinions are positive or negative. We do not summarize the reviews by selecting a subset or rewrite some of the original sentences from the reviews to capture the main points as in the classic text summarization. Our task is performed in three steps: (1) mining product features that have been commented on by customers; (2) identifying opinion sentences in each review and deciding whether each opinion sentence is positive or negative; (3) summarizing the results. This paper proposes several novel techniques to perform these tasks. Our experimental results using reviews of a number of products sold online demonstrate the effectiveness of the techniques.

translated by 谷歌翻译

Utilizing Textual Reviews in Latent Factor Models for Recommender Systems

Tatev Karen Aslanyan , Flavius Frasincar

分类：机器学习 | (统计)机器学习

2021-11-16

大多数现有推荐系统仅基于评级数据，并且他们忽略了可能会增加建议质量的其他信息来源，例如文本评论或用户和项目特征。此外，这些系统的大多数仅适用于小型数据集（数千个观察）并且无法处理大型数据集（具有数百万观察结果）。我们提出了一种推荐人算法，该算法将评级建模技术（即潜在因子模型）与基于文本评论（即潜在Dirichlet分配）的主题建模方法组合，并且我们扩展了算法，使其允许添加额外的用户和项目 - 对系统的特定信息。我们使用具有不同大小的Amazon.com数据集来评估算法的性能，对应于23个产品类别。将建筑模型与四种其他型号进行比较后，我们发现将患有评级的文本评语相结合，导致更好的建议。此外，我们发现为模型添加额外的用户和项目功能会提高其预测精度，这对于中型和大数据集尤其如此。

translated by 谷歌翻译

Analyzing Folktales of Different Regions Using Topic Modeling and Clustering

Jacob Werzinsky , Zhiyan Zhong , Xuedan Zou

分类：自然语言处理

2022-06-09

本文采用了两种主要的自然语言处理技术，即主题建模和聚类，以在民间故事中找到模式，并揭示区域之间的文化关系。特别是，我们使用潜在的Dirichlet分配和伯托式分配来提取反复出现的元素，以及将K-均值聚类提取到集体民间故事。我们的论文试图回答这个问题，民间故事之间有什么相似之处，以及他们对文化的看法。在这里，我们表明民间故事之间的共同趋势是家庭，食物，传统的性别角色，神话人物和动物。此外，民间主题的主题是根据地理位置而有所不同的，在具有不同动物和环境的不同地区发现的民间故事。我们并不感到惊讶的是，宗教人物和动物是所有文化中的一些共同主题。但是，我们感到惊讶的是，欧洲和亚洲的民间故事经常结合在一起。我们的结果表明，世界各地文化中某些元素的流行率。我们预计我们的工作将成为对民间故事的未来研究的资源，也是使用自然语言处理来分析特定领域中文档的一个例子。此外，由于我们仅根据文件进行分析，因此可以在分析这些民间故事的结构，情感和特征方面做更多的工作。

translated by 谷歌翻译

Investigation of Topic Modelling Methods for Understanding the Reports of the Mining Projects in Queensland

Yasuko Okamoto , Thirunavukarasu Balasubramaniam , Richi Nayak

分类：机器学习

2021-11-05

在矿业行业中，在项目管理过程中产生了许多报告。这些过去的文件是未来成功的知识资源。但是，如果文件未经组织和非结构化，则可以是一个繁琐而挑战的任务是检索必要的信息。文档聚类是一种强大的方法来应对问题，并且在过去的研究中介绍了许多方法。尽管如此，没有银弹可以对任何类型的文件表现最佳。因此，需要探索性研究来应用新数据集的聚类方法。在本研究中，我们将研究多个主题建模（TM）方法。目标是使用昆士兰，资源部，昆士兰州政府部的地质调查的数据集找到采矿项目报告的适当方法，并了解内容，以了解如何组织它们。三种TM方法，潜在的Dirichlet分配（LDA），非负矩阵分解（NMF）和非负张量分解（NTF）在统计和定性地比较。评估后，我们得出结论，LDA对数据集执行最佳;然而，可以通过一些改进来采用其他方法的可能性仍然存在。

translated by 谷歌翻译

Natural Language Processing in Customer Service: A Systematic Review

Malak Mashaabi , Areej Alotaibi , Hala Qudaih , Raghad Alnashwan , Hend Al-Khalifa

分类：自然语言处理 | 人工智能

2022-12-16

Artificial intelligence and natural language processing (NLP) are increasingly being used in customer service to interact with users and answer their questions. The goal of this systematic review is to examine existing research on the use of NLP technology in customer service, including the research domain, applications, datasets used, and evaluation methods. The review also looks at the future direction of the field and any significant limitations. The review covers the time period from 2015 to 2022 and includes papers from five major scientific databases. Chatbots and question-answering systems were found to be used in 10 main fields, with the most common use in general, social networking, and e-commerce areas. Twitter was the second most commonly used dataset, with most research also using their own original datasets. Accuracy, precision, recall, and F1 were the most common evaluation methods. Future work aims to improve the performance and understanding of user behavior and emotions, and address limitations such as the volume, diversity, and quality of datasets. This review includes research on different spoken languages and models and techniques.

translated by 谷歌翻译

Intent Recognition in Conversational Recommender Systems

Sahar Moradizeyveh

分类：自然语言处理 | 机器学习

2022-12-06

Any organization needs to improve their products, services, and processes. In this context, engaging with customers and understanding their journey is essential. Organizations have leveraged various techniques and technologies to support customer engagement, from call centres to chatbots and virtual agents. Recently, these systems have used Machine Learning (ML) and Natural Language Processing (NLP) to analyze large volumes of customer feedback and engagement data. The goal is to understand customers in context and provide meaningful answers across various channels. Despite multiple advances in Conversational Artificial Intelligence (AI) and Recommender Systems (RS), it is still challenging to understand the intent behind customer questions during the customer journey. To address this challenge, in this paper, we study and analyze the recent work in Conversational Recommender Systems (CRS) in general and, more specifically, in chatbot-based CRS. We introduce a pipeline to contextualize the input utterances in conversations. We then take the next step towards leveraging reverse feature engineering to link the contextualized input and learning model to support intent recognition. Since performance evaluation is achieved based on different ML models, we use transformer base models to evaluate the proposed approach using a labelled dialogue dataset (MSDialogue) of question-answering interactions between information seekers and answer providers.

translated by 谷歌翻译

Intellectual Property Evaluation Utilizing Machine Learning

Jinxin Ding , Yuxin Huang , Keyang Ni , Xueyao Wang , Yinxiao Wang , Yucheng Wang

分类：人工智能

2022-08-18

智力特性在经济发展中越来越重要。为了通过IP评估中的传统方法来解决疼痛点，我们正在以机器学习为核心开发一项新技术。我们已经建立了一个在线平台，并将在大湾地区扩展我们的业务。

translated by 谷歌翻译

"I think this is the most disruptive technology": Exploring Sentiments of ChatGPT Early Adopters using Twitter Data

Mubin Ul Haque , Isuru Dharmadasa , Zarrin Tasnim Sworna , Roshan Namal Rajapakse , Hussain Ahmad

分类：自然语言处理

2022-12-12

Large language models have recently attracted significant attention due to their impressive performance on a variety of tasks. ChatGPT developed by OpenAI is one such implementation of a large, pre-trained language model that has gained immense popularity among early adopters, where certain users go to the extent of characterizing it as a disruptive technology in many domains. Understanding such early adopters' sentiments is important because it can provide insights into the potential success or failure of the technology, as well as its strengths and weaknesses. In this paper, we conduct a mixed-method study using 10,732 tweets from early ChatGPT users. We first use topic modelling to identify the main topics and then perform an in-depth qualitative sentiment analysis of each topic. Our results show that the majority of the early adopters have expressed overwhelmingly positive sentiments related to topics such as Disruptions to software development, Entertainment and exercising creativity. Only a limited percentage of users expressed concerns about issues such as the potential for misuse of ChatGPT, especially regarding topics such as Impact on educational aspects. We discuss these findings by providing specific examples for each topic and then detail implications related to addressing these concerns for both researchers and users.

translated by 谷歌翻译

Urdu Speech and Text Based Sentiment Analyzer

Waqar Ahmad , Maryam Edalati

分类：自然语言处理

2022-07-19

发现别人认为是我们信息收集策略的关键方面。现在，人们可以积极利用信息技术来寻找和理解他人的想法，这要归功于越来越多的意见资源（例如在线评论网站和个人博客）的越来越多。由于其在理解人们的意见方面的关键功能，因此情感分析（SA）是一项至关重要的任务。另一方面，现有的研究主要集中在英语上，只有少量研究专门研究低资源语言。对于情感分析，这项工作根据用户评估提供了一个新的多级乌尔都语数据集。高音扬声器网站用于获取乌尔都语数据集。我们提出的数据集包括10,000项评论，这些评论已被人类专家精心归类为两类：正面，负面。这项研究的主要目的是构建一个手动注释的数据集进行乌尔都语情绪分析，并确定基线结果。采用了五种不同的词典和规则的算法，包括NaiveBayes，Stanza，TextBlob，Vader和Flair，实验结果表明，其精度为70％的天赋优于其他经过测试的算法。

translated by 谷歌翻译

Justification of Recommender Systems Results: A Service-based Approach

Noemi Mauro , Zhongli Filippo Hu , Liliana Ardissono

分类：人工智能

2022-11-07

With the increasing demand for predictable and accountable Artificial Intelligence, the ability to explain or justify recommender systems results by specifying how items are suggested, or why they are relevant, has become a primary goal. However, current models do not explicitly represent the services and actors that the user might encounter during the overall interaction with an item, from its selection to its usage. Thus, they cannot assess their impact on the user's experience. To address this issue, we propose a novel justification approach that uses service models to (i) extract experience data from reviews concerning all the stages of interaction with items, at different granularity levels, and (ii) organize the justification of recommendations around those stages. In a user study, we compared our approach with baselines reflecting the state of the art in the justification of recommender systems results. The participants evaluated the Perceived User Awareness Support provided by our service-based justification models higher than the one offered by the baselines. Moreover, our models received higher Interface Adequacy and Satisfaction evaluations by users having different levels of Curiosity or low Need for Cognition (NfC). Differently, high NfC participants preferred a direct inspection of item reviews. These findings encourage the adoption of service models to justify recommender systems results but suggest the investigation of personalization strategies to suit diverse interaction needs.

translated by 谷歌翻译

Confounds and Overestimations in Fake Review Detection: Experimentally Controlling for Product-Ownership and Data-Origin

Felix Soldner , Bennett Kleinberg , Shane Johnson

分类：自然语言处理

2021-10-28

The popularity of online shopping is steadily increasing. At the same time, fake product reviewsare published widely and have the potential to affect consumer purchasing behavior. In response,previous work has developed automated methods for the detection of deceptive product reviews.However, studies vary considerably in terms of classification performance, and many use data thatcontain potential confounds, which makes it difficult to determine their validity. Two possibleconfounds are data-origin (i.e., the dataset is composed of more than one source) and productownership (i.e., reviews written by individuals who own or do not own the reviewed product). Inthe present study, we investigate the effect of both confounds for fake review detection. Using anexperimental design, we manipulate data-origin, product ownership, review polarity, and veracity.Supervised learning analysis suggests that review veracity (60.26 - 69.87%) is somewhat detectablebut reviews additionally confounded with product-ownership (66.19 - 74.17%), or with data-origin(84.44 - 86.94%) are easier to classify. Review veracity is most easily classified if confounded withproduct-ownership and data-origin combined (87.78 - 88.12%), suggesting overestimations of thetrue performance in other work. These findings are moderated by review polarity.

translated by 谷歌翻译

Happy or grumpy? A Machine Learning Approach to Analyze the Sentiment of Airline Passengers' Tweets

Shengyang Wu , Yi Gao

分类：自然语言处理

2022-09-28

作为最广泛的社交网络服务之一，Twitter截至2022年，Twitter拥有超过3亿个活跃用户。在其许多功能中，Twitter现在是消费者分享他们对产品或体验的看法的首选平台之一商业航空公司提供的服务。这项研究旨在通过分析使用机器学习方法提及航空公司的推文的观点来衡量客户满意度。从Twitter的API检索相关推文，并通过令牌化和矢量化处理。之后，这些处理后的向量被传递到预训练的机器学习分类器中以预测情感。除了情感分析外，我们还对收集的推文执行词汇分析，以模拟关键字的频率，这些频率提供了有意义的上下文以促进情感的解释。然后，我们应用时间序列方法，例如鲍林（Bollinger）频段来检测情绪数据中的异常。使用从2022年1月到7月的历史记录，我们的方法被证明能够捕捉乘客情绪的突然变化。这项研究有可能发展为可以帮助航空公司以及其他几家面向客户的企业的应用程序，有效地检测到客户情绪的突然变化，并采取足够的措施来抵消他们。

translated by 谷歌翻译

A Survey of Recommender System Techniques and the Ecommerce Domain

Imran Hossain , Md Aminul Haque Palash , Anika Tabassum Sejuty , Noor A Tanjim , MD Abdullah AL Nasim , Sarwar Saif , Abu Bokor Suraj

分类：人工智能

2022-08-15

在这个大数据时代，当前一代很难从在线平台中包含的大量数据中找到正确的数据。在这种情况下，需要一个信息过滤系统，可以帮助他们找到所需的信息。近年来，出现了一个称为推荐系统的研究领域。推荐人变得重要，因为他们拥有许多现实生活应用。本文回顾了推荐系统在电子商务，电子商务，电子资源，电子政务，电子学习和电子生活中的不同技术和发展。通过分析有关该主题的最新工作，我们将能够详细概述当前的发展，并确定建议系统中的现有困难。最终结果为从业者和研究人员提供了对建议系统及其应用的必要指导和见解。

translated by 谷歌翻译