Open Access
MPAA Rating Prediction Based on Deep Learning
Caner    BALIM1*, Ugur    GUREL2
1Afyon Kocatepe University, Afyon, Turkey
2Eskisehir Osmangazi University, Eskisehir, Turkey
* Corresponding author: cbalim@aku.edu.tr

Presented at the 3rd International Symposium on Innovative Approaches in Scientific Studies (Engineering and Natural Sciences) (ISAS2019-ENS), Ankara, Turkey, Apr 19, 2019

SETSCI Conference Proceedings, 2019, 4, Page (s): 612-614 , https://doi.org/

Published Date: 01 June 2019    | 704     9

Abstract

The Motion Picture Association of America(MPAA) is used in the United States to rate a film's compatibility via its content. These systems evaluate ratings according to the scenes in the movies. In this work, a deep learning based classification technique is proposed to differentiate between MPAA ratings (G, PG, PG-13, R, NC-17 ) via subtitles. Movie subtitles which generally locate bottom of the screen to show character dialogs in movies or series. The syntax of each subtitle files are learned through Word2Vector. We then constructed a binary classifier based on the preceding representation dataset. We studied the performance of different classifiers with stratified ten-fold cross-validation. The model has been validated with experiments on English subtitle dataset. According to the experiment results, our proposed method was achieved more than 59\% accuracy rate.

Keywords - Film rating, Word2vec, Deep Learning, Machine Learning, Classification

References

[1] https://www.mpaa.org/

[2] T. Kim, J. Hong, and P. Kang. (2015). Box office forecasting using machine learning algorithms based on SNS data.

[3] K. Ozkan, O. N. Atak, and S. Işik. "Using movie posters for prediction of box-office revenue with deep learning approach." 2018 26th Signal Processing and Communications Applications Conference (SIU). IEEE, 2018.

[4] K. S. Sivaraman and G. Somappa (2016). MovieScope: Movie trailer classification using Deep Neural Networks.

[5] S. K. Jain, Movies Genres Classifier using Neural Network, in: Proceedings of the International Symposium on Computer and Information Sciences, 2009, pp. 610–615.

[6] https://www.imdb.com/

[7] https://www.opensubtitles.org

[8] M. F. Porter, (1980). An algorithm for suffix stripping. Program, 14(3), 130-137.

[9] J. R. Finkel, T. Grenager, and C. Manning. 2005. Incorporating Nonlocal Information into Information Extraction Systems by Gibbs Sampling. Proceedings of the 43nd Annual Meeting of the Association for Computational Linguistics (ACL 2005), pp. 363-370.

[10] T. Mikolov,, Sutskever, I., Chen, K., Corrado, G. S., & Dean, J. (2013). Distributed representations of words and phrases and their compositionality. In Advances in neural information processing systems (pp. 3111-3119).

[11] R. Rehurek and P. Sojka 2010. Software Framework for Topic Modelling with Large Corpora. Proceedings of the LREC 2010 Workshop on New Challenges for NLP Frameworks, pp. 45-50

SETSCI 2024
info@set-science.com
Copyright © 2024 SETECH
Tokat Technology Development Zone Gaziosmanpaşa University Taşlıçiftlik Campus, 60240 TOKAT-TÜRKİYE