cp-carrillo.com
robots.txt

Robots Exclusion Standard data for cp-carrillo.com

Resource Scan

Scan Details

Site Domain cp-carrillo.com
Base Domain cp-carrillo.com
Scan Status Ok
Last Scan2024-10-02T15:27:21+00:00
Next Scan 2024-11-01T15:27:21+00:00

Last Scan

Scanned2024-10-02T15:27:21+00:00
URL https://www.cp-carrillo.com/robots.txt
Domain IPs 67.207.213.76
Response IP 67.207.213.76
Found Yes
Hash 4e2d5be3010625d68a1ff557be4cac112e464a170bf6c83f5183d3c4ba462346
SimHash a2118c4286d2

Groups

ahrefsbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

weborama-fetcher

Rule Path
Disallow /

*

Rule Path
Disallow /admin
Disallow /cgi-bin
Disallow /modlogan
Disallow /webalizer
Disallow /cart.html
Disallow /checkout.html
Disallow /account.html
Disallow /account/*
Disallow /wishlist.html

Other Records

Field Value
sitemap https://www.cp-carrillo.com/sitemap_index.xml

Comments

  • robots.txt generated at https://www.cp-carrillo.com
  • end of file