businessprofiles.com
robots.txt

Robots Exclusion Standard data for businessprofiles.com

Resource Scan

Scan Details

Site Domain businessprofiles.com
Base Domain businessprofiles.com
Scan Status Ok
Last Scan2025-04-22T10:36:25+00:00
Next Scan 2025-04-29T10:36:25+00:00

Last Scan

Scanned2025-04-22T10:36:25+00:00
URL https://businessprofiles.com/robots.txt
Domain IPs 172.66.40.96, 172.66.43.160, 2606:4700:3108::ac42:2860, 2606:4700:3108::ac42:2ba0
Response IP 172.66.40.96
Found Yes
Hash 91eb541dcf285f08c2bc537923fad1cfe78022f39157c1139a58d4831c9664ff
SimHash 229549156570

Groups

*

Rule Path
Disallow /edit/
Disallow /terms/
Allow /

Other Records

Field Value
sitemap https://businessprofiles.com/sitemap.xml.gz

Comments

  • See http://www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file
  • To ban all spiders from the entire site uncomment the next two lines: