Describir: Video Scene Information Detection Based on Entity Recognition