duck.com
robots.txt
Robots Exclusion Standard data for duck.com
Resource Scan
Scan Details
Site Domain | duck.com |
Base Domain | duck.com |
Scan Status | Ok |
Last Scan | 2024-05-02T17:51:43+00:00 |
Next Scan | 2024-05-16T17:51:43+00:00 |
Last Scan
Scanned | 2024-05-02T17:51:43+00:00 |
URL | https://duck.com/robots.txt |
Redirect | https://duckduckgo.com/robots.txt |
Redirect Domain | duckduckgo.com |
Redirect Base | duckduckgo.com |
Domain IPs | 20.43.161.105 |
Redirect IPs | 20.43.161.105 |
Response IP | 20.43.161.105 |
Found | Yes |
Hash | bd82319c3644f1adbeeb16be5e714fa91d46fa0ff258f9a21124cc3d56a16be3 |
SimHash | 86106a1287d6 |
Groups
*
Rule | Path |
---|---|
Disallow | /lite |
Disallow | /html |
Disallow | /*? |
Disallow | /chrome_newtab |
Disallow | /email/ |
Allow | /email/$ |
Allow | /email/privacy-guarantees |
Allow | /email/privacy-terms |
Disallow | /2012-privacy-policy |
Other Records
Field | Value |
---|---|
sitemap | https://duckduckgo.com/sitemap.xml |
Comments