sempreinsalute.com
robots.txt

Robots Exclusion Standard data for sempreinsalute.com

Resource Scan

Scan Details

Site Domain sempreinsalute.com
Base Domain sempreinsalute.com
Scan Status Ok
Last Scan2025-03-30T01:08:08+00:00
Next Scan 2025-04-29T01:08:08+00:00

Last Scan

Scanned2025-03-30T01:08:08+00:00
URL https://sempreinsalute.com/robots.txt
Domain IPs 104.21.95.208, 172.67.148.102, 2606:4700:3031::6815:5fd0, 2606:4700:3036::ac43:9466
Response IP 104.21.95.208
Found Yes
Hash 982054a3942ab0c0a07baa53ae78dffc0330fc0bb2c29241067a6ddd2d19aa1f
SimHash e800c8008f92

Groups

scrapy

Rule Path
Allow /

*

Rule Path
Disallow /wp-admin/

Other Records

Field Value
sitemap https://www.sempreinsalute.com/sitemap_index.xml