Early prediction of movie box office success based on Wikipedia activity big data.

Use of socially generated "big data" to access information about collective states of the minds in human societies has become a new paradigm in the emerging field of computational social science. A natural application of this would be the prediction of the society's reaction to a new...

Full description

Saved in:
Bibliographic Details
Main Authors: Márton Mestyán, Taha Yasseri, János Kertész
Format: article
Language:EN
Published: Public Library of Science (PLoS) 2013
Subjects:
R
Q
Online Access:https://doaj.org/article/cd37cbf5cadb43a686e840404d7c5eff
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Use of socially generated "big data" to access information about collective states of the minds in human societies has become a new paradigm in the emerging field of computational social science. A natural application of this would be the prediction of the society's reaction to a new product in the sense of popularity and adoption rate. However, bridging the gap between "real time monitoring" and "early predicting" remains a big challenge. Here we report on an endeavor to build a minimalistic predictive model for the financial success of movies based on collective activity data of online users. We show that the popularity of a movie can be predicted much before its release by measuring and analyzing the activity level of editors and viewers of the corresponding entry to the movie in Wikipedia, the well-known online encyclopedia.