NOTICE: This software (or technical data) was produced for the U.S. Government under contract, and is subject to the Rights in Data-General Clause 52.227-14, Alt. IV (DEC 2007). Copyright 2023 The MITRE Corporation. All Rights Reserved.
Overview
There are numerous video and image exploitation capabilities available today. The Open Media Processing Framework (OpenMPF) provides a framework for chaining, combining, or replacing individual components for the purpose of experimentation and comparison.
OpenMPF is a non-proprietary, scalable framework that permits practitioners and researchers to construct video, imagery, and audio exploitation capabilities using the available third-party components. Using OpenMPF, one can extract targeted entities in large-scale data environments, such as face and object detection.
For those developing new exploitation capabilities, OpenMPF exposes a set of Application Program Interfaces (APIs) for extending media analytics functionality. The APIs allow integrators to introduce new algorithms capable of detecting new targeted entity types. For example, a backpack detection algorithm could be integrated into an OpenMPF instance. OpenMPF does not restrict the number of algorithms that can operate on a given media file, permitting researchers, practitioners, and developers to explore arbitrarily complex composites of exploitation algorithms.
A list of algorithms currently integrated into the OpenMPF as distributed processing components is shown here:
Operation | Object Type | Framework |
---|---|---|
Detection/Tracking | Face | LBP-Based OpenCV |
Detection/Tracking | Motion | MOG w/ STRUCK |
Detection/Tracking | Motion | SuBSENSE w/ STRUCK |
Detection/Tracking | License Plate | OpenALPR |
Detection | Speech | Sphinx |
Detection | Speech | Azure Cognitive Services Batch Transcription API |
Detection | Scene | OpenCV |
Detection | Classification | OpenCV DNN (GoogLeNet, Yahoo NSFW, vehicle color) |
Detection/Tracking | Classification | OpenCV DNN (YOLO) |
Detection/Tracking | Classification/Features | TensorRT (COCO classes) |
Detection | Text Region | EAST |
Detection | Text (OCR) | Apache Tika |
Detection | Text (OCR) | Tesseract OCR |
Detection | Text (OCR) | Azure Cognitive Services Computer Vision API (OCR endpoint) |
Detection | Text (OCR) | Azure Cognitive Services Read API |
Detection | Form Structure (with OCR) | Azure Cognitive Services Form Recognizer API |
Detection | Keywords | Boost Regular Expressions |
Detection | Image (from document) | Apache Tika |
Translation | Language | Azure Cognitive Services Translate API |
The OpenMPF exposes data processing and job management web services via a User Interface (UI). These services allow users to upload media, create media processing jobs, determine the status of jobs, and retrieve the artifacts associated with completed jobs. The web services give application developers flexibility to use the OpenMPF in their preferred environment and programming language.