cantercadd.com
robots.txt

Robots Exclusion Standard data for cantercadd.com

Resource Scan

Scan Details

Site Domain cantercadd.com
Base Domain cantercadd.com
Scan Status Ok
Last Scan2025-04-20T07:09:43+00:00
Next Scan 2025-04-27T07:09:43+00:00

Last Scan

Scanned2025-04-20T07:09:43+00:00
URL https://www.cantercadd.com/robots.txt
Domain IPs 104.21.46.165, 172.67.140.182, 2606:4700:3032::ac43:8cb6, 2606:4700:3036::6815:2ea5
Response IP 104.21.46.165
Found Yes
Hash c9e25da26cf8d189d68ca1ef0bae579f07e79b04d30c2fc504b775d817e3f7f1
SimHash 353c1c4377b1

Groups

*

Rule Path
Disallow /sitehome/*

Other Records

Field Value
crawl-delay 1

mj12bot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

Other Records

Field Value
sitemap http://cdn.attracta.com/sitemap/4811792.xml.gz

Comments

  • Default robots.txt file (which asks all bots to crawl slowly but still index everything).
  • Begin Attracta SEO Tools Sitemap. Do not remove
  • End Attracta SEO Tools Sitemap. Do not remove