Redirecting to original paper in 30 seconds...

Click below to go immediately or wait for automatic redirect

arxiv_cl 95% Match Benchmark Paper AI researchers,NLP practitioners,Market researchers,Data scientists,Product managers 2 weeks ago

Can Large Language Models be Effective Online Opinion Miners?

large-language-models › evaluation
📄 Abstract

Abstract: The surge of user-generated online content presents a wealth of insights into customer preferences and market trends. However, the highly diverse, complex, and context-rich nature of such contents poses significant challenges to traditional opinion mining approaches. To address this, we introduce Online Opinion Mining Benchmark (OOMB), a novel dataset and evaluation protocol designed to assess the ability of large language models (LLMs) to mine opinions effectively from diverse and intricate online environments. OOMB provides extensive (entity, feature, opinion) tuple annotations and a comprehensive opinion-centric summary that highlights key opinion topics within each content, thereby enabling the evaluation of both the extractive and abstractive capabilities of models. Through our proposed benchmark, we conduct a comprehensive analysis of which aspects remain challenging and where LLMs exhibit adaptability, to explore whether they can effectively serve as opinion miners in realistic online scenarios. This study lays the foundation for LLM-based opinion mining and discusses directions for future research in this field.
Authors (4)
Ryang Heo
Yongsik Seo
Junseong Lee
Dongha Lee
Submitted
May 21, 2025
arXiv Category
cs.CL
arXiv PDF

Key Contributions

Introduces the Online Opinion Mining Benchmark (OOMB), a novel dataset and evaluation protocol designed to assess LLMs' ability to mine opinions from complex online content. OOMB includes detailed annotations for (entity, feature, opinion) tuples and opinion summaries, enabling evaluation of both extractive and abstractive LLM capabilities.

Business Value

Provides businesses with a powerful tool to understand customer sentiment and market dynamics from vast amounts of online data, enabling better product development and marketing strategies.