greentechmedia.com
robots.txt
Robots Exclusion Standard data for greentechmedia.com
Resource Scan
Scan Details
Site Domain | greentechmedia.com |
Base Domain | greentechmedia.com |
Scan Status | Ok |
Last Scan | 2024-09-30T04:22:44+00:00 |
Next Scan | 2024-10-07T04:22:44+00:00 |
Last Scan
Scanned | 2024-09-30T04:22:44+00:00 |
URL | https://greentechmedia.com/robots.txt |
Redirect | https://www.greentechmedia.com/robots.txt |
Redirect Domain | www.greentechmedia.com |
Redirect Base | greentechmedia.com |
Domain IPs | 104.21.33.48, 172.67.158.221, 2606:4700:3030::ac43:9edd, 2606:4700:3031::6815:2130 |
Redirect IPs | 104.21.33.48, 172.67.158.221, 2606:4700:3030::ac43:9edd, 2606:4700:3031::6815:2130 |
Response IP | 104.21.33.48 |
Found | Yes |
Hash | a361b8c3c2d7ce53194962b1f99e01c81221b185e1c755c9401c34e3f9d72bcc |
SimHash | c5699020c2d1 |
Groups
*
Rule | Path |
---|---|
Disallow | /confirm/ |
Disallow | /register/ |
Disallow | */preview_code_* |
Disallow | /*/*/*/submitted |
Disallow | /1062325/ |
Disallow | /search/results |
Disallow | /gtm_admin/ |
Disallow | /account/ |
Disallow | /checkout/ |
Disallow | /research/cart |
Disallow | /squared/finish-registration |
Other Records
Field | Value |
---|---|
crawl-delay | 5 |
Other Records
Field | Value |
---|---|
sitemap | https://www.greentechmedia.com/site/sitemap-index.xml |