theyesmancan.com
robots.txt
Robots Exclusion Standard data for theyesmancan.com
Resource Scan
Scan Details
Site Domain | theyesmancan.com |
Base Domain | theyesmancan.com |
Scan Status | Ok |
Last Scan | 2024-11-12T16:30:49+00:00 |
Next Scan | 2024-11-26T16:30:49+00:00 |
Last Scan
Scanned | 2024-11-12T16:30:49+00:00 |
URL | https://theyesmancan.com/robots.txt |
Redirect | https://www.theyesmancan.com/robots.txt |
Redirect Domain | www.theyesmancan.com |
Redirect Base | theyesmancan.com |
Domain IPs | 162.159.140.127, 172.66.0.125, 2606:4700:7::7d, 2a06:98c1:58::7d |
Redirect IPs | 162.159.140.127, 172.66.0.125, 2606:4700:7::7d, 2a06:98c1:58::7d |
Response IP | 162.159.140.127 |
Found | Yes |
Hash | 9a9205adff7caa1187f74387eef8240069f940496bd97a90a24da40cda645afb |
SimHash | 2d114c44c7d1 |
Groups
*
Rule | Path |
---|---|
Disallow | /Umbraco/ |
Disallow | /umbraco/ |
Disallow | /admin/ |
Disallow | /Admin/ |
Disallow | /cgi-bin/ |
Disallow | /tmp/ |
Disallow | /profile/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.theyesmancan.com/sitemap.xml |