spider-gen.com
robots.txt

Robots Exclusion Standard data for spider-gen.com

Resource Scan

Scan Details

Site Domain spider-gen.com
Base Domain spider-gen.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-05-28T04:04:35+00:00
Next Scan 2025-06-27T04:04:35+00:00

Last Successful Scan

Scanned2025-04-06T04:03:16+00:00
URL https://spider-gen.com/robots.txt
Redirect https://greenwichalumni.co.uk/robots.txt
Redirect Domain greenwichalumni.co.uk
Redirect Base greenwichalumni.co.uk
Domain IPs 104.21.112.1, 104.21.16.1, 104.21.32.1, 104.21.48.1, 104.21.64.1, 104.21.80.1, 104.21.96.1, 2606:4700:3030::6815:1001, 2606:4700:3030::6815:2001, 2606:4700:3030::6815:3001, 2606:4700:3030::6815:4001, 2606:4700:3030::6815:5001, 2606:4700:3030::6815:6001, 2606:4700:3030::6815:7001
Redirect IPs 104.21.19.148, 172.67.186.132, 2606:4700:3030::ac43:ba84, 2606:4700:3037::6815:1394
Response IP 172.67.186.132
Found Yes
Hash f680bfcf7abcf8c9cf9e2b5314105a705107ec148f5d616f5db058b26db8000a
SimHash 4100d8408b92

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
sitemap https://greenwichalumni.co.uk/sitemap_index.xml