proturin.altervista.org
robots.txt

Robots Exclusion Standard data for proturin.altervista.org

Resource Scan

Scan Details

Site Domain proturin.altervista.org
Base Domain proturin.altervista.org
Scan Status Ok
Last Scan2024-11-13T19:23:44+00:00
Next Scan 2024-11-20T19:23:44+00:00

Last Scan

Scanned2024-11-13T19:23:44+00:00
URL https://proturin.altervista.org/robots.txt
Domain IPs 104.21.75.99, 172.67.220.154
Response IP 104.21.75.99
Found Yes
Hash 6cee1ce547815159a703083a807c8eed2dda14a228bc4cd41b49bf6b48c383fa
SimHash 4d4cda424791

Groups

*

Rule Path
Allow /
Disallow /*.pdf$
Disallow /image/
Disallow /Alberghi/
Disallow /guestbook/

mediapartners-google

Rule Path
Allow /

googlebot

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

yandex

Rule Path
Allow /

Other Records

Field Value
sitemap http://proturin.altervista.org/sitemap.xml

Warnings

  • `host` is not a known field.