cdn.za.com
robots.txt

Robots Exclusion Standard data for cdn.za.com

Resource Scan

Scan Details

Site Domain cdn.za.com
Base Domain cdn.za.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-10-26T10:24:37+00:00
Next Scan 2026-01-24T10:24:37+00:00

Last Successful Scan

Scanned2023-12-14T00:56:03+00:00
URL https://cdn.za.com/robots.txt
Domain IPs 104.21.28.19, 172.67.170.44, 2606:4700:3035::6815:1c13, 2606:4700:3035::ac43:aa2c
Response IP 172.67.170.44
Found Yes
Hash 224c5bd0e99e00d47c3487baf02055ba9c57ca5f17e08caa84f8b757292e1a71
SimHash f85c5917cfe1

Groups

*

Rule Path
Disallow /a/
Disallow /action/
Disallow /c/
Disallow /crt/
Disallow /h/
Disallow /i/
Disallow /ii/
Disallow /m/
Disallow /r/
Disallow /v/
Disallow /wp/

twitterbot

Rule Path
Allow /a/
Allow /action/
Allow /c/
Allow /crt/
Allow /h/
Allow /i/
Allow /ii/
Allow /m/
Allow /r/
Allow /v/
Allow /wp/

Comments

  • Dear Robots (and legal human guardians),
  • The Google AMP Cache is roboted to crawlers. We recommend that search engines
  • process cache links according to the guidelines in https://goo.gl/G40cwD.
  • If you only access the Google AMP Cache for user initiated requests,
  • please contact us at amphtml-robots@googlegroups.com.