FAUST: Fast Per-Scene Encoding Using Entropy-Based Scene Detection and Machine Learning
HTTP adaptive video streaming is a widespread and sought-after technology on the Internet that allows clients to dynamically switch between different stream qualities presented in the bitrate ladder to optimize overall received video quality. Currently, there exist several approaches of different co...
Guardado en:
Autores principales: | , , , |
---|---|
Formato: | article |
Lenguaje: | EN |
Publicado: |
FRUCT
2021
|
Materias: | |
Acceso en línea: | https://doaj.org/article/2c4ff9718dd84364864e573ee8909ef5 |
Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
id |
oai:doaj.org-article:2c4ff9718dd84364864e573ee8909ef5 |
---|---|
record_format |
dspace |
spelling |
oai:doaj.org-article:2c4ff9718dd84364864e573ee8909ef52021-11-20T15:59:33ZFAUST: Fast Per-Scene Encoding Using Entropy-Based Scene Detection and Machine Learning2305-72542343-073710.23919/FRUCT53335.2021.9599963https://doaj.org/article/2c4ff9718dd84364864e573ee8909ef52021-10-01T00:00:00Zhttps://www.fruct.org/publications/fruct30/files/Zab.pdfhttps://doaj.org/toc/2305-7254https://doaj.org/toc/2343-0737HTTP adaptive video streaming is a widespread and sought-after technology on the Internet that allows clients to dynamically switch between different stream qualities presented in the bitrate ladder to optimize overall received video quality. Currently, there exist several approaches of different complexity for building such a ladder. The simplest method is to use a static bitrate ladder, and the more complex one is to compute a per-title encoding ladder. The main drawback of these approaches is that they do not provide bitrate ladders for scenes with different visual complexity within the video. Moreover, most modern methods require additional computationally-intensive test encodings of the entire video to construct the convex hull, used to calculate the bitrate ladder. This paper proposes a new fast per-scene encoding approach called FAUST based on 1) quick entropy-based scene detection and 2) prediction of optimized bitrate ladder for each scene using an artificial neural network. The results show that our model reduces the mean absolute error to 0.15, the mean square error to 0.08, and the bitrate to 13.5% while increasing the difference in video multimethod assessment fusion to 5.6 points.Anatoliy ZabrovskiyPrateek AgrawalChristian TimmererRadu ProdanFRUCTarticleper-title encodingper-shot encodingscene detectionvideo entropyneural networksvideo encodingvideo transcodinghttp adaptive streamingmpeg-dashTelecommunicationTK5101-6720ENProceedings of the XXth Conference of Open Innovations Association FRUCT, Vol 30, Iss 1, Pp 292-304 (2021) |
institution |
DOAJ |
collection |
DOAJ |
language |
EN |
topic |
per-title encoding per-shot encoding scene detection video entropy neural networks video encoding video transcoding http adaptive streaming mpeg-dash Telecommunication TK5101-6720 |
spellingShingle |
per-title encoding per-shot encoding scene detection video entropy neural networks video encoding video transcoding http adaptive streaming mpeg-dash Telecommunication TK5101-6720 Anatoliy Zabrovskiy Prateek Agrawal Christian Timmerer Radu Prodan FAUST: Fast Per-Scene Encoding Using Entropy-Based Scene Detection and Machine Learning |
description |
HTTP adaptive video streaming is a widespread and sought-after technology on the Internet that allows clients to dynamically switch between different stream qualities presented in the bitrate ladder to optimize overall received video quality. Currently, there exist several approaches of different complexity for building such a ladder. The simplest method is to use a static bitrate ladder, and the more complex one is to compute a per-title encoding ladder. The main drawback of these approaches is that they do not provide bitrate ladders for scenes with different visual complexity within the video. Moreover, most modern methods require additional computationally-intensive test encodings of the entire video to construct the convex hull, used to calculate the bitrate ladder. This paper proposes a new fast per-scene encoding approach called FAUST based on 1) quick entropy-based scene detection and 2) prediction of optimized bitrate ladder for each scene using an artificial neural network. The results show that our model reduces the mean absolute error to 0.15, the mean square error to 0.08, and the bitrate to 13.5% while increasing the difference in video multimethod assessment fusion to 5.6 points. |
format |
article |
author |
Anatoliy Zabrovskiy Prateek Agrawal Christian Timmerer Radu Prodan |
author_facet |
Anatoliy Zabrovskiy Prateek Agrawal Christian Timmerer Radu Prodan |
author_sort |
Anatoliy Zabrovskiy |
title |
FAUST: Fast Per-Scene Encoding Using Entropy-Based Scene Detection and Machine Learning |
title_short |
FAUST: Fast Per-Scene Encoding Using Entropy-Based Scene Detection and Machine Learning |
title_full |
FAUST: Fast Per-Scene Encoding Using Entropy-Based Scene Detection and Machine Learning |
title_fullStr |
FAUST: Fast Per-Scene Encoding Using Entropy-Based Scene Detection and Machine Learning |
title_full_unstemmed |
FAUST: Fast Per-Scene Encoding Using Entropy-Based Scene Detection and Machine Learning |
title_sort |
faust: fast per-scene encoding using entropy-based scene detection and machine learning |
publisher |
FRUCT |
publishDate |
2021 |
url |
https://doaj.org/article/2c4ff9718dd84364864e573ee8909ef5 |
work_keys_str_mv |
AT anatoliyzabrovskiy faustfastpersceneencodingusingentropybasedscenedetectionandmachinelearning AT prateekagrawal faustfastpersceneencodingusingentropybasedscenedetectionandmachinelearning AT christiantimmerer faustfastpersceneencodingusingentropybasedscenedetectionandmachinelearning AT raduprodan faustfastpersceneencodingusingentropybasedscenedetectionandmachinelearning |
_version_ |
1718419412725268480 |