protectioncivile.org
robots.txt

Robots Exclusion Standard data for protectioncivile.org

Resource Scan

Scan Details

Site Domain protectioncivile.org
Base Domain protectioncivile.org
Scan Status Ok
Last Scan2025-04-07T16:00:43+00:00
Next Scan 2025-05-07T16:00:43+00:00

Last Scan

Scanned2025-04-07T16:00:43+00:00
URL https://protectioncivile.org/robots.txt
Domain IPs 104.21.95.208, 172.67.148.102, 2606:4700:3031::6815:5fd0, 2606:4700:3036::ac43:9466
Response IP 104.21.95.208
Found Yes
Hash facc17a206aa50ec5741d32b0e17d6f031e74b1009b69e7e675b200d0211b4ba
SimHash 4501df175711

Groups

scrapy

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.protectioncivile.org/page-sitemap.xml