family.20thcenturystudios.com
robots.txt

Robots Exclusion Standard data for family.20thcenturystudios.com

Resource Scan

Scan Details

Site Domain family.20thcenturystudios.com
Base Domain 20thcenturystudios.com
Scan Status Ok
Last Scan2024-04-09T18:59:10+00:00
Next Scan 2024-05-09T18:59:10+00:00

Last Scan

Scanned2024-04-09T18:59:10+00:00
URL https://family.20thcenturystudios.com/robots.txt
Domain IPs 23.49.60.106, 23.49.60.112, 2600:1413:b000:1e::17d1:2e53, 2600:1413:b000:1e::17d1:2e5a
Response IP 23.52.171.160
Found Yes
Hash 52e8527ee6c4bb3fedf95fe2b933be65e60bc92197b64b6beb25964b3e22d4b6
SimHash ea04ec60e333

Groups

*

Rule Path
Disallow /7046/
Disallow /bh6/
Disallow /products/
Disallow /_xd/
Disallow /_did/
Disallow /www.shutterstock.com/
Disallow /youtu.be/
Disallow /mobile/
Disallow /s3/

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.20thcenturystudios.com/sitemap.xml