m.heygen.com
robots.txt

Robots Exclusion Standard data for m.heygen.com

Resource Scan

Scan Details

Site Domain m.heygen.com
Base Domain heygen.com
Scan Status Ok
Last Scan2025-12-15T22:11:55+00:00
Next Scan 2025-12-29T22:11:55+00:00

Last Scan

Scanned2025-12-15T22:11:55+00:00
URL https://m.heygen.com/robots.txt
Domain IPs 216.150.1.1, 216.150.16.1
Response IP 216.150.1.193
Found Yes
Hash 0d2e7e87cce64c75678d0c7950b3fb4cbc6fe892468b16bbde79ff8d6455d875
SimHash 2248dd39edf3

Groups

twitterbot

Rule Path
Allow /

*

Rule Path
Allow /share
Allow /videos
Allow /embeds
Disallow /home
Disallow /login
Disallow /guest
Disallow /signup
Disallow /get-started
Disallow /avatars
Disallow /labs
Disallow /templates
Disallow /video-translation
Disallow /voices
Disallow /projects

Other Records

Field Value
sitemap https://app.heygen.com/sitemap.xml

Comments

  • All pages are allowed for crawling, since SEO meta data is static and not related to user-specific data.
  • Explicitly allow Twitterbot to ensure Video Card rendering for /videos, /embeds, /share