thecafesucrefarine.com
robots.txt

Robots Exclusion Standard data for thecafesucrefarine.com

Resource Scan

Scan Details

Site Domain thecafesucrefarine.com
Base Domain thecafesucrefarine.com
Scan Status Ok
Last Scan2024-09-24T13:54:40+00:00
Next Scan 2024-10-01T13:54:40+00:00

Last Scan

Scanned2024-09-24T13:54:40+00:00
URL https://thecafesucrefarine.com/robots.txt
Domain IPs 104.26.2.103, 104.26.3.103, 172.67.71.40, 2606:4700:20::681a:267, 2606:4700:20::681a:367, 2606:4700:20::ac43:4728
Response IP 172.67.71.40
Found Yes
Hash eaa3a38d62c4050867881fb0e649c5a1ce50bf663679c172d286807e553f15a3
SimHash 6320da608196

Groups

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

piplbot

Rule Path
Disallow /

*

Rule Path
Disallow /wp-admin
Allow /wp-admin/admin-ajax.php
Allow /

Other Records

Field Value
crawl-delay 15