blog.cuatrecasas.com
robots.txt
Robots Exclusion Standard data for blog.cuatrecasas.com
Resource Scan
Scan Details
Site Domain | blog.cuatrecasas.com |
Base Domain | cuatrecasas.com |
Scan Status | Ok |
Last Scan | 2025-09-24T07:06:49+00:00 |
Next Scan | 2025-10-24T07:06:49+00:00 |
Last Scan
Scanned | 2025-09-24T07:06:49+00:00 |
URL | https://blog.cuatrecasas.com/robots.txt |
Domain IPs | 13.39.11.93 |
Response IP | 13.37.145.162 |
Found | Yes |
Hash | 63eef2614fc52a77eea0f1a85029fbe3e9c3d025f7c24fc4d0f4fda6a61c373d |
SimHash | 3c143150c544 |
Groups
*
Rule | Path |
---|---|
Disallow | /admin/ |
Disallow | /bundles/ |
Disallow | /bundles_old/ |
Disallow | /erecruiting/ |
Disallow | /images/ |
Allow | /images/cache/ |
Disallow | /img/ |
Disallow | /media_repository/ |
Allow | /summernote/ |
Allow | /resources/ |
Disallow | /web/ |
Allow | /web/assets/ |
Allow | /web/vendor/ |
oai-searchbot
chatgpt-user
perplexitybot
bingbot
googlebot
google-extended
Rule | Path |
---|---|
Disallow | /admin/ |
Disallow | /bundles/ |
Disallow | /bundles_old/ |
Disallow | /erecruiting/ |
Disallow | /images/ |
Allow | /images/cache/ |
Disallow | /img/ |
Disallow | /media_repository/ |
Allow | /summernote/ |
Allow | /resources/ |
Disallow | /web/ |
Allow | /web/assets/ |
Allow | /web/vendor/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.cuatrecasas.com/sitemap.xml |
Comments