gazzettalavoro.net
robots.txt

Robots Exclusion Standard data for gazzettalavoro.net

Resource Scan

Scan Details

Site Domain gazzettalavoro.net
Base Domain gazzettalavoro.net
Scan Status Ok
Last Scan2024-05-26T05:40:13+00:00
Next Scan 2024-06-02T05:40:13+00:00

Last Scan

Scanned2024-05-26T05:40:13+00:00
URL https://gazzettalavoro.net/robots.txt
Redirect https://www.gazzettalavoro.net/robots.txt
Redirect Domain www.gazzettalavoro.net
Redirect Base gazzettalavoro.net
Domain IPs 2600:1901:0:11f5::, 34.149.210.165
Redirect IPs 2600:1901:0:11f5::, 34.149.210.165
Response IP 34.149.210.165
Found Yes
Hash 99786279851f2d38a9f9bc2296876f0d3bae50395aec1aff89d9ccbce0dbb6da
SimHash 0054c360c771

Groups

*

Rule Path
Allow /$
Allow /ads.txt$
Allow /amp/*
Allow /editor/*
Disallow /*

mediapartners-google

Rule Path
Allow /

grapeshot

Rule Path
Disallow