archdaily.com
robots.txt
Robots Exclusion Standard data for archdaily.com
Resource Scan
Scan Details
Site Domain | archdaily.com |
Base Domain | archdaily.com |
Scan Status | Ok |
Last Scan | 2024-11-13T11:14:06+00:00 |
Next Scan | 2024-11-20T11:14:06+00:00 |
Last Scan
Scanned | 2024-11-13T11:14:06+00:00 |
URL | https://archdaily.com/robots.txt |
Redirect | https://www.archdaily.com/robots.txt |
Redirect Domain | www.archdaily.com |
Redirect Base | archdaily.com |
Domain IPs | 2600:9000:2795:3a00:4:2b2a:34c0:93a1, 2600:9000:2795:4200:4:2b2a:34c0:93a1, 2600:9000:2795:5600:4:2b2a:34c0:93a1, 2600:9000:2795:8400:4:2b2a:34c0:93a1, 2600:9000:2795:c200:4:2b2a:34c0:93a1, 2600:9000:2795:c600:4:2b2a:34c0:93a1, 2600:9000:2795:d800:4:2b2a:34c0:93a1, 2600:9000:2795:da00:4:2b2a:34c0:93a1, 3.168.132.101, 3.168.132.105, 3.168.132.21, 3.168.132.82 |
Redirect IPs | 2600:9000:2761:1c00:4:2b2a:34c0:93a1, 2600:9000:2761:2800:4:2b2a:34c0:93a1, 2600:9000:2761:3a00:4:2b2a:34c0:93a1, 2600:9000:2761:4600:4:2b2a:34c0:93a1, 2600:9000:2761:5a00:4:2b2a:34c0:93a1, 2600:9000:2761:7a00:4:2b2a:34c0:93a1, 2600:9000:2761:9000:4:2b2a:34c0:93a1, 2600:9000:2761:a600:4:2b2a:34c0:93a1, 3.164.85.118, 3.164.85.13, 3.164.85.34, 3.164.85.45 |
Response IP | 18.165.140.12 |
Found | Yes |
Hash | c4a005d959e2e508fc832c814479dec4a0143eef6da4226dcd60595dfd5a33d7 |
SimHash | c17c1c52cf93 |
Groups
*
Rule | Path |
---|---|
Disallow | *?replytocom |
Allow | / |
Disallow | /cl/ |
Disallow | /catalog/cl/ |
Disallow | /cn/ |
Disallow | /br/ |
Disallow | /catalog/br/ |
Disallow | /us/ |
Disallow | /mx/ |
Disallow | /catalog/mx/ |
Disallow | /co/ |
Disallow | /catalog/co/ |
Disallow | /pe/ |
Disallow | /catalog/pe/ |
Disallow | /1021178/AD |
Other Records
Field | Value |
---|---|
sitemap | https://www.archdaily.com/sitemap.xml |