geglobalresearch.com
robots.txt
Robots Exclusion Standard data for geglobalresearch.com
Resource Scan
Scan Details
Site Domain | geglobalresearch.com |
Base Domain | geglobalresearch.com |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2025-06-30T20:41:48+00:00 |
Next Scan | 2025-09-28T20:41:48+00:00 |
Last Successful Scan
Scanned | 2025-02-08T19:27:06+00:00 |
URL | https://geglobalresearch.com/robots.txt |
Redirect | https://www.geglobalresearch.com/robots.txt |
Redirect Domain | www.geglobalresearch.com |
Redirect Base | geglobalresearch.com |
Domain IPs | 104.21.10.21, 172.67.162.33, 2606:4700:3030::ac43:a221, 2606:4700:3033::6815:a15 |
Redirect IPs | 104.21.10.21, 172.67.162.33, 2606:4700:3030::ac43:a221, 2606:4700:3033::6815:a15 |
Response IP | 104.21.10.21 |
Found | Yes |
Hash | 1d6f5b0369f7eef1f3c3e09016af1f295dd7b02565599ca6475bd148ea40f2c1 |
SimHash | 9066df60a917 |
Groups
*
Rule | Path |
---|---|
Disallow | /visit/ |
Disallow | */visit/* |
Disallow | /login/ |
Disallow | /search/ |
Disallow | /wp-admin/ |
Disallow | /*?s= |
Comments