curlec.com
robots.txt

Robots Exclusion Standard data for curlec.com

Resource Scan

Scan Details

Site Domain curlec.com
Base Domain curlec.com
Scan Status Ok
Last Scan2025-10-05T23:56:54+00:00
Next Scan 2025-11-04T23:56:54+00:00

Last Scan

Scanned2025-10-05T23:56:54+00:00
URL https://curlec.com/robots.txt
Domain IPs 108.157.254.127, 108.157.254.37, 108.157.254.44, 108.157.254.9
Response IP 108.157.254.37
Found Yes
Hash 7b3ce9e2c20e5e3077553bbf211b17f57bc1bc850e7aa6a4bafddaf146e52696
SimHash 2359d6508c01

Groups

*

Rule Path
Disallow /draft/
Disallow /popupdemo/
Disallow /rquery
Disallow /themedemo/
Disallow /payment-link/p*
Disallow /blog/tag/
Disallow /blog/page/
Allow /

Other Records

Field Value
crawl-delay 1

ia_archiver

Rule Path
Disallow /

oai-searchbot
chatgpt-user
perplexitybot
firecrawlagent
andibot
exabot
phindbot
youbot
claudebot
duckassistbot
apple-extended
baiduspider
yandex

Rule Path
Disallow /draft/
Disallow /blog/page/
Allow /

Other Records

Field Value
sitemap https://curlec.com/sitemap.xml