ilsussidiario.net
robots.txt

Robots Exclusion Standard data for ilsussidiario.net

Resource Scan

Scan Details

Site Domain ilsussidiario.net
Base Domain ilsussidiario.net
Scan Status Ok
Last Scan2024-05-29T20:09:15+00:00
Next Scan 2024-06-05T20:09:15+00:00

Last Scan

Scanned2024-05-29T20:09:15+00:00
URL https://ilsussidiario.net/robots.txt
Domain IPs 173.249.13.50
Response IP 173.249.13.50
Found Yes
Hash e036afc977a515651619295e12f99588b5bae85c74d08d605beaaa1968fc3fa8
SimHash 620d51540535

Groups

*

Rule Path
Disallow

msnbot

Rule Path
Disallow

bingbot

Rule Path
Disallow

yandex

Rule Path
Disallow

baiduspider

Rule Path
Disallow

grapeshot

Rule Path
Disallow /
Allow /wp-admin/*.css$
Allow /wp-includes/*.js$
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /etc/
Disallow /?s=*
Disallow /Ricerca/

Other Records

Field Value
sitemap https://www.ilsussidiario.net/sitemap-list.xml
sitemap https://www.ilsussidiario.net/gnewssitemap.xml