haagcertifiedinspector.com
robots.txt

Robots Exclusion Standard data for haagcertifiedinspector.com

Resource Scan

Scan Details

Site Domain haagcertifiedinspector.com
Base Domain haagcertifiedinspector.com
Scan Status Ok
Last Scan2024-11-19T04:27:37+00:00
Next Scan 2024-12-03T04:27:37+00:00

Last Scan

Scanned2024-11-19T04:27:37+00:00
URL https://haagcertifiedinspector.com/robots.txt
Domain IPs 104.21.92.179, 172.67.197.20, 2606:4700:3031::ac43:c514, 2606:4700:3037::6815:5cb3
Response IP 104.21.92.179
Found Yes
Hash e2480191399c2b563705a8237e43e1f2d7f768c9e90070198bd96edeb4fdca3e
SimHash 25209bceceb6

Groups

googlebot

Rule Path
Allow /

*

Product Comment
* applies to all robots
Rule Path Comment
Allow /?*un=*&pw=* -
Allow /?*pw=*&un=* -
Allow /secur/frontdoor.jsp?*sid=* -
Allow /secur/contentDoor?*sid=* -
Allow /secur/myDomainDoor?*sid=* -
Allow /secur/LoginInterstitial.apexp -
Allow /secur/login_portal.jsp?*pw=* -
Allow /sserv/login.jsp?*pw=* -
Allow /login.jsp?*pw=* -
Allow /login/login.jsp?*pw=* -
Allow /secur/login_page.jsp?*pw=* -
Disallow / disallow indexing of all pages

Comments

  • robots.txt for sfdc appservers.
  • For use by salesforce.com