Evaluating the quality of social media data in big data architecture

Research output: Contribution to journalArticleScientificpeer-review

27 Citations (Scopus)

Abstract

The use of freely available online data is rapidly increasing, as companies have detected the possibilities and the value of these data in their businesses. In particular, data from social media are seen as interesting as they can, when properly treated, assist in achieving customer insight into business decision making. However, the unstructured and uncertain nature of this kind of big data presents a new kind of challenge: how to evaluate the quality of data and manage the value of data within a big data architecture? This paper contributes to addressing this challenge by introducing a new architectural solution to evaluate and manage the quality of social media data in each processing phase of the big data pipeline. The proposed solution improves business decision making by providing real-time, validated data for the user. The solution is validated with an industrial case example, in which the customer insight is extracted from social media data in order to determine the customer satisfaction regarding the quality of a product.
Original languageEnglish
Pages (from-to)2028-2043
JournalIEEE Access
Volume3
DOIs
Publication statusPublished - 2015
MoE publication typeA1 Journal article-refereed

Fingerprint

Industry
Decision making
Customer satisfaction
Pipelines
Big data
Processing

Keywords

  • architecture
  • big data
  • computer architecture
  • meta data
  • online services
  • social network services

Cite this

@article{c3e3ac5aadca4095ba116b22a0a126bf,
title = "Evaluating the quality of social media data in big data architecture",
abstract = "The use of freely available online data is rapidly increasing, as companies have detected the possibilities and the value of these data in their businesses. In particular, data from social media are seen as interesting as they can, when properly treated, assist in achieving customer insight into business decision making. However, the unstructured and uncertain nature of this kind of big data presents a new kind of challenge: how to evaluate the quality of data and manage the value of data within a big data architecture? This paper contributes to addressing this challenge by introducing a new architectural solution to evaluate and manage the quality of social media data in each processing phase of the big data pipeline. The proposed solution improves business decision making by providing real-time, validated data for the user. The solution is validated with an industrial case example, in which the customer insight is extracted from social media data in order to determine the customer satisfaction regarding the quality of a product.",
keywords = "architecture, big data, computer architecture, meta data, online services, social network services",
author = "Anne Immonen and Pekka P{\"a}{\"a}kk{\"o}nen and Eila Ovaska",
year = "2015",
doi = "10.1109/ACCESS.2015.2490723",
language = "English",
volume = "3",
pages = "2028--2043",
journal = "IEEE Access",
issn = "2169-3536",
publisher = "Institute of Electrical and Electronic Engineers IEEE",

}

Evaluating the quality of social media data in big data architecture. / Immonen, Anne; Pääkkönen, Pekka; Ovaska, Eila.

In: IEEE Access, Vol. 3, 2015, p. 2028-2043.

Research output: Contribution to journalArticleScientificpeer-review

TY - JOUR

T1 - Evaluating the quality of social media data in big data architecture

AU - Immonen, Anne

AU - Pääkkönen, Pekka

AU - Ovaska, Eila

PY - 2015

Y1 - 2015

N2 - The use of freely available online data is rapidly increasing, as companies have detected the possibilities and the value of these data in their businesses. In particular, data from social media are seen as interesting as they can, when properly treated, assist in achieving customer insight into business decision making. However, the unstructured and uncertain nature of this kind of big data presents a new kind of challenge: how to evaluate the quality of data and manage the value of data within a big data architecture? This paper contributes to addressing this challenge by introducing a new architectural solution to evaluate and manage the quality of social media data in each processing phase of the big data pipeline. The proposed solution improves business decision making by providing real-time, validated data for the user. The solution is validated with an industrial case example, in which the customer insight is extracted from social media data in order to determine the customer satisfaction regarding the quality of a product.

AB - The use of freely available online data is rapidly increasing, as companies have detected the possibilities and the value of these data in their businesses. In particular, data from social media are seen as interesting as they can, when properly treated, assist in achieving customer insight into business decision making. However, the unstructured and uncertain nature of this kind of big data presents a new kind of challenge: how to evaluate the quality of data and manage the value of data within a big data architecture? This paper contributes to addressing this challenge by introducing a new architectural solution to evaluate and manage the quality of social media data in each processing phase of the big data pipeline. The proposed solution improves business decision making by providing real-time, validated data for the user. The solution is validated with an industrial case example, in which the customer insight is extracted from social media data in order to determine the customer satisfaction regarding the quality of a product.

KW - architecture

KW - big data

KW - computer architecture

KW - meta data

KW - online services

KW - social network services

U2 - 10.1109/ACCESS.2015.2490723

DO - 10.1109/ACCESS.2015.2490723

M3 - Article

VL - 3

SP - 2028

EP - 2043

JO - IEEE Access

JF - IEEE Access

SN - 2169-3536

ER -