theplan.it
robots.txt

Robots Exclusion Standard data for theplan.it

Resource Scan

Scan Details

Site Domain theplan.it
Base Domain theplan.it
Scan Status Ok
Last Scan2024-11-01T19:45:24+00:00
Next Scan 2024-11-08T19:45:24+00:00

Last Scan

Scanned2024-11-01T19:45:24+00:00
URL https://theplan.it/robots.txt
Redirect https://www.theplan.it/robots.txt
Redirect Domain www.theplan.it
Redirect Base theplan.it
Domain IPs 104.26.8.215, 104.26.9.215, 172.67.71.252, 2606:4700:20::681a:8d7, 2606:4700:20::681a:9d7, 2606:4700:20::ac43:47fc
Redirect IPs 104.26.8.215, 104.26.9.215, 172.67.71.252, 2606:4700:20::681a:8d7, 2606:4700:20::681a:9d7, 2606:4700:20::ac43:47fc
Response IP 104.26.9.215
Found Yes
Hash cd4b7e0734e48e516b2d0b51eb3f8204784b003cb4304d5b38d69e2243da9a4d
SimHash 6d1438554993

Groups

*

Rule Path
Disallow /eng/chinese
Disallow /chi
Disallow /editorial_committee
Disallow /sitemap_generator
Disallow /ordinimaggioli

Other Records

Field Value
sitemap https://www.theplan.it/sitemap-index.xml