Amazon Rekognition Video Moderation - image sampling solution

Amazon Rekognition is an AWS AI service that offers pre-trained APIs to support image and video moderation.

This repo contains two solutions to help you to implement video moderation using Rekognition image API. Both solutions offer a similar interface as the Rekognition Video moderation API, which can invoke asynchronizely and send moderation results to an SNS topic. So you can easily replace your logic to switch to Image moderation API.

You can deploy the two solutions to your AWS account using CDK by following the instruction.

Solution I: All-in-one Lambda function

The Lambda function manages the below logical steps in one place:

Download the video from the S3 bucket to the local disk
Sample images from the video based on a given interval using ffmpeg (default interval: 2 frames per second)
Store the images in S3 bucket in a temperate folder
Call Rekognition Image API to moderate the image frames
Consolidate the result to the Rekognition Video moderation API response format
Send the result to an SNS topic (if provided)

Lambda source code

Deployment logic with Lambda Layer

Invoke the Lambda function from your existing code in async mode as below:

import boto3, json
lambda_client = boto3.client("lambda")

lambda_client.invoke_async(
    FunctionName='TheLambdaFunctionName',
    InvokeArgs=json.dumps(
        {
          "s3_source_bucket": "MyS3Bucket",
          "s3_source_key": "path/to/video.mp4",
          "s3_target_bucket": "OPTIONAL_BUCKET_KEEP_TEMP_FILES", # Optional. if you don't have read access to source bucket
          "s3_target_folder": "temp/folder", # Optional. Temp folder keeps staging files, such as sampled images and moderation result
          "sample_frequency": 2, # Optional. numbers of images per second
          "min_confidence": 50, # Optional. Confidence threshold
          "sns_topic_arn": "arn:aws:sns:us-east-1:122702569249:cm-rek-video-sampling-topic" # Optional
        }
    )
)

This solution is light to deploy but only suitable for short-form videos. It may timeout if the video is too long, with a high resolution, and requires to sample the video in a high frequency. The max timeout setting for Lambda is 15 minutes.

Solution II: Step Functions + Lambda

This solution uses Step Functions state machine to orchestrate Lambda functions. It prevents the timeout issue could happen in the first single Lambda function solution, as the workflow will iterate through the sampled images and call a Lambda function one by one. It is ideal for use cases when you need to moderate large videos in a high frequency.

Invoke Step Functions state machine using Boto3

import boto3, json
sfn_client = boto3.client('stepfunctions')

sfn_client.start_execution(
    stateMachineArn='StepFunctionStateMachineArn',
    name='GiveItAName',
    input=json.dumps(
        {
          "s3_source_bucket": "MyS3Bucket",
          "s3_source_key": "path/to/video.mp4",
          "s3_target_bucket": "OPTIONAL_BUCKET_KEEP_TEMP_FILES", # Optional. if you don't have read access to source bucket
          "s3_target_folder": "temp/folder", # Optional. Temp folder keeps staging files, such as sampled images and moderation result
          "sample_frequency": 2, # Optional. numbers of images per second
          "min_confidence": 50, # Optional. Confidence threshold
          "sns_topic_arn": "arn:aws:sns:us-east-1:122702569249:cm-rek-video-sampling-topic" # Optional
        }
    ),
)

Step Functions source code

Lambda: capture images from video

Lambda: moderate image

Lambda: consolidation

Deployment logic

You can test the Lambda or the Step Functions solutions directly by sending the JSON payload to the Lambda and Step Functions state machine using the AWS console, CLI, or SDK.

Install environment dependencies and set up authentication

💡 You can skip this section if using CloudShell to deploy the CDK package or the other AWS services support bash command from the same AWS account (ex. Cloud9). This section is required if you run from a self-managed environment such as a local desktop.

Install Node.js https://nodejs.org/en/download/
Install Python 3.7+ https://www.python.org/downloads/
Install Git https://github.com/git-guides/install-git
Install Pip

python -m ensurepip --upgrade

Install Python Virtual Environment

pip install virtualenv

Setup the AWS CLI authentication

aws configure

Deploy the CDK package

Clone code

git clone https://github.com/lanazhang/aws-rekognition-video-moderation-image-sampling

cd aws-rekognition-video-moderation-image-sampling/

Install Node CDK package

sudo npm install -g aws-cdk

Create Python Virtual Environment

python3 -m venv .venv

Activate virtual environment

On MacOS or Linux

source .venv/bin/activate

On Windows

  .venv\Scripts\activate.bat

Once the virtualenv is activated, you can install the required dependencies.

pip install -r requirements.txt

Set up environment varaibles - change the values to your target AWS account Id and region.

export CDK_DEFAULT_ACCOUNT=[YOUR_ACCOUNT_ID]
export CDK_DEFAULT_REGION=[YOUR_TARGET_REGION]

Bootstrap CDK - this step will launch a CloudFormation stack to provision the CDK package, which will take ~2 minutes.

cdk bootstrap aws://${CDK_DEFAULT_ACCOUNT}/${CDK_DEFAULT_REGION}

Deploy CDK package - this step will launch both or one of the solutions, which will take ~5 minutes.

# Deloy both solutions
cdk deploy --all --requires-approval never

# Deloy the Lambda solution
cdk deploy LambdaAllInOneStack

# Deloy the Step Functions solution
cdk deploy StepFunctionWorkflowStack

Once the deployment completes, you can find the Lambda or the Step Functions names in the bash console. You can also find them in the CloudFormation console by checking the output in stack LambdaAllInOneStack and StepFunctionWorkflowStack.

Both the Lambda Function and the Step Functions State Machine names are in the pattern of rek-video-image-sampling-XXXX

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
iam_role		iam_role
lambda		lambda
static		static
stepfunctions		stepfunctions
tests		tests
video_sampling		video_sampling
.gitignore		.gitignore
README.md		README.md
app.py		app.py
cdk.json		cdk.json
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt
source.bat		source.bat

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Amazon Rekognition Video Moderation - image sampling solution

Solution I: All-in-one Lambda function

Solution II: Step Functions + Lambda

Install environment dependencies and set up authentication

Deploy the CDK package

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Amazon Rekognition Video Moderation - image sampling solution

Solution I: All-in-one Lambda function

Solution II: Step Functions + Lambda

Install environment dependencies and set up authentication

Deploy the CDK package

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages