cdn.za.com
robots.txt

Robots Exclusion Standard data for cdn.za.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	cdn.za.com
Base Domain	cdn.za.com
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Couldn't connect to server.
Last Scan	2025-10-26T10:24:37+00:00
Next Scan	2026-01-24T10:24:37+00:00

Last Successful Scan

Scanned	2023-12-14T00:56:03+00:00
URL	https://cdn.za.com/robots.txt
Domain IPs	104.21.28.19, 172.67.170.44, 2606:4700:3035::6815:1c13, 2606:4700:3035::ac43:aa2c
Response IP	172.67.170.44
Found	Yes
Hash	224c5bd0e99e00d47c3487baf02055ba9c57ca5f17e08caa84f8b757292e1a71
SimHash	f85c5917cfe1

Groups

*

Rule	Path
Disallow	/a/
Disallow	/action/
Disallow	/c/
Disallow	/crt/
Disallow	/h/
Disallow	/i/
Disallow	/ii/
Disallow	/m/
Disallow	/r/
Disallow	/v/
Disallow	/wp/

Rule

Path

Disallow

/a/

Disallow

/action/

Disallow

/c/

Disallow

/crt/

Disallow

/h/

Disallow

/i/

Disallow

/ii/

Disallow

/m/

Disallow

/r/

Disallow

/v/

Disallow

/wp/

twitterbot

Rule	Path
Allow	/a/
Allow	/action/
Allow	/c/
Allow	/crt/
Allow	/h/
Allow	/i/
Allow	/ii/
Allow	/m/
Allow	/r/
Allow	/v/
Allow	/wp/

Rule

Path

Allow

/a/

Allow

/action/

Allow

/c/

Allow

/crt/

Allow

/h/

Allow

/i/

Allow

/ii/

Allow

/m/

Allow

/r/

Allow

/v/

Allow

/wp/

Back to top

Comments

Dear Robots (and legal human guardians),
The Google AMP Cache is roboted to crawlers. We recommend that search engines
process cache links according to the guidelines in https://goo.gl/G40cwD.
If you only access the Google AMP Cache for user initiated requests,
please contact us at amphtml-robots@googlegroups.com.

Back to top

cdn.za.comrobots.txt

Resource Scan

Scan Details

Last Successful Scan

Groups

*

twitterbot

Comments

cdn.za.com
robots.txt