To establish a baseline you should, at a minimum, monitor the following metrics:
ActivitiesStarted
ActivitiesTimedOut
ExecutionsStarted
ExecutionsTimedOut
LambdaFunctionsStarted
LambdaFunctionsTimedOut
그리고 :
State Machine Metrics
The following metrics are available for Step Functions state machines:
Execution Metrics
The AWS/States namespace includes the following metrics for Step Functions executions:
Metric Description
ExecutionTime The interval, in milliseconds, between the time the execution starts and the time it closes.
ExecutionThrottled The number of StateEntered events in addition to retries.
ExecutionsAborted The number of aborted or terminated executions.
ExecutionsFailed The number of failed executions.
ExecutionsStarted The number of started executions.
ExecutionsSucceeded The number of successfully completed executions.
ExecutionsTimedOut The number of executions that time out for any reason.
Reference