startpage.com
robots.txt

Robots Exclusion Standard data for startpage.com

Resource Scan

Scan Details

Site Domain startpage.com
Base Domain startpage.com
Scan Status Ok
Last Scan2025-10-10T10:32:15+00:00
Next Scan 2025-11-09T10:32:15+00:00

Last Scan

Scanned2025-10-10T10:32:15+00:00
URL https://startpage.com/robots.txt
Redirect https://www.startpage.com/robots.txt
Redirect Domain www.startpage.com
Redirect Base startpage.com
Domain IPs 67.63.52.233
Redirect IPs 67.63.52.232
Response IP 67.63.52.231
Found Yes
Hash 795bee4d79cfc811f19fa32f2ce9e9aaac481b367c34a2d85982f4ba2761624e
SimHash 212c4164d795

Groups

*

Rule Path
Allow /sp/cdn/images/
Allow /sp/cdn/favicons/
Disallow /cgi-bin/
Disallow /do/
Disallow /sp/
Disallow /av/

Warnings

  • `noindex` is not a known field.