Overview
Media Insights on AWS is a development framework that incorporates AWS machine learning services into multimedia applications for processing videos, images, audio, and text. The solution provides purpose-built APIs that allow developers to plug into these services and build innovative media workflows without handling backend complexities. It automatically streamlines resource provisioning, storage, transcoding, annotations, model optimizations, and other orchestration tasks to simplify workflow creation. Using this AWS Solution, you can leverage computer vision, speech recognition, language analysis, and other artificial intelligence (AI) services to empower rapid prototyping and intelligent media software deployment.
Benefits
This solution manages workflow orchestration and data persistence so that you can focus on applications that extract value from media or automate manual workflows.
You can extend and customize the solution to fit new use cases. Operators are generated, pre-built state machines that you can extend to handle specific use cases.
Components are described by clean interfaces. Operators are small single purpose components that transform or extract metadata from media. You can define custom operators or use any of the included pre-built operators.
Technical details
You can automatically deploy this architecture using the implementation guide and the accompanying AWS CloudFormation template.
Step 1
An Amazon API Gateway resource for the control plane REST API.
Step 2
AWS Lambda and Amazon Simple Queue Service (Amazon SQS) resources to support workflow orchestration and translating user-defined workflows into AWS Step Functions.
Step 3
Amazon DynamoDB tables to store workflow-related data.
Step 4
Step Functions for when a user defines a new workflow using the workflow API.
Step 5
Lambda functions for the Media Insights on AWS operator library. Operators are provisioned for the following services: Amazon Rekognition, Amazon Comprehend, Amazon Translate, Amazon Transcribe, Amazon Polly, and AWS Elemental MediaConvert.
Step 6
An API Gateway resource for the data plane REST API.
Step 7
Amazon Simple Storage Service (Amazon S3), DynamoDB, and DynamoDB Streams for media and metadata data storage.
Step 8
Amazon Kinesis Data Streams resources to provide an interface for external applications to access data in the Media Insights on AWS data plane.
- Publish Date
Related content
This blog explores the Media Insights Engine solution which aims to help developers focus less on the scaffolding and more on the things that make their applications impactful.
Learn about the modular framework the Media Insights Engine is built on, as well as use cases and practical applications for this solution.