cafestoworkfrom.com
robots.txt

Robots Exclusion Standard data for cafestoworkfrom.com

Resource Scan

Scan Details

Site Domain cafestoworkfrom.com
Base Domain cafestoworkfrom.com
Scan Status Ok
Last Scan2026-02-26T09:59:13+00:00
Next Scan 2026-03-05T09:59:13+00:00

Last Scan

Scanned2026-02-26T09:59:13+00:00
URL https://cafestoworkfrom.com/robots.txt
Domain IPs 104.21.20.158, 172.67.193.46, 2606:4700:3030::6815:149e, 2606:4700:3037::ac43:c12e
Response IP 104.21.20.158
Found Yes
Hash 259168b429bdd8bc89f076ffd9b8dd5c3a2dc7c86ee3bc408fdef96bddc0f5ac
SimHash 2e0842f2c7e3

Groups

*

Rule Path
Allow /
Disallow /cookies
Disallow /privacy
Disallow /terms
Disallow /contact
Allow /*

Other Records

Field Value
sitemap https://cafestoworkfrom.com/sitemap.xml

Comments

  • https://www.robotstxt.org/robotstxt.html
  • Disallow policy pages
  • Allow everything else
  • Sitemap