API Plan Tiers

Below are the of tiers of API usage available:

API Plan TierAPI CreditsPrice
Free Trial10 / monthFree
Pro100 / month$99 / month
Scale660 / month$330 / month
EnterpriseCustomMinimum $10,000 / year

Please note that the video_translate endpoint can only be used by Scale Plan and Enterprise Plan customers.

📘

What is an API Credit?

HeyGen API Credits are consumed when creating videos, translating videos, or creating active streaming sessions with Interactive Avatars. API credits are consumed at different rates, depending on which endpoint of the HeyGen API is being used:

API functionalityAPI Credit costs
video/generate endpoint footage featuring a Photo Avatar1 API credit = 5 minutes
video/generate endpoint footage featuring a Video Avatar1 API credit = 2 minutes
video_translate endpoint footage1 API credit = 20 seconds
Active streaming API sessions1 API credit = 10 minutes active streaming
photo_avatar/photo/generate0.25 API credits per call
photo_avatar/look/generate0.25 API credits per call
photo_avatar/train4 API credits per call
photo_avatar/add_motion1 API credit per call
photo_avatar/add_sound_effectFree
photo_avatar/upscale0.5 API credits per call

You can review your API Credit consumption on the Usage History section of the API tab in your Space Settings

Video Length and Resolution Limits

The length of videos that can be translated or generated via API is limited by Tier:

API Plan TierVideo LengthVideo Resolution Limit
Free3 minutes720p
Pro5 minutes1080p
Scale30 Minutes4k
EnterpriseCustom4k

Video/Generate Endpoint Limits

Resources

HeyGen limits the size and format of resources such as images, audio or video, that can be uploaded to the Video/Generate endpoint. Render errors may occur if resources are invalid.

Resource TypeSupported formatMax File SizeResolution
VideoMP4100MB<2K
ImageJPG, PNG50MB<2K
AudioWAV, MP350MB

Please ensure:

  • Your resource URL is public and accessible to anyone.
  • The provided asset is in one of the supported formats.
  • Check that the asset file extension aligns with the actual format.
  • Verify that the asset file is not corrupted or misformatted.

Avatar Input

Avatar input can be either text or audio:

  • The Input Text - which is the text the Avatar speaks - should be less than 5,000 characters.
  • The duration of audio input for avatars should not exceed 10 minutes (3600 seconds).

Invalid inputs will result in video render failures.

Frame Rate & Dimension

  • The frame rate of video output containing avatars is 25fps.
  • dimension parameter width or height must be less than 4096 and greater than 128.

Scenes

Streaming endpoint

Repeat Task

  • Up to 1,000 characters at a time can be sent to the Interactive Avatar to repeat text.