spaziomilan.it
robots.txt

Robots Exclusion Standard data for spaziomilan.it

Resource Scan

Scan Details

Site Domain spaziomilan.it
Base Domain spaziomilan.it
Scan Status Ok
Last Scan2024-09-21T18:02:40+00:00
Next Scan 2024-09-28T18:02:40+00:00

Last Scan

Scanned2024-09-21T18:02:40+00:00
URL https://spaziomilan.it/robots.txt
Domain IPs 138.201.136.29
Response IP 138.201.136.29
Found Yes
Hash 6a055891d0efb4d4beb90c85f86f662e0200454820a9256dc25ae0354be6c484
SimHash 6d704fd656b2

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /wp-content
Disallow /xmlrpc.php
Disallow /trackback/

ninjabot

Rule Path
Allow /

mediapartners-google

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

googlebot

Rule Path
Allow /

googlebot-news

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

*

Rule Path
Allow /wp-content/uploads/
Allow /tag/
Allow /category
Allow *.css$
Allow *.js$

Other Records

Field Value
sitemap https://www.spaziomilan.it/sitemap.xml
sitemap https://www.spaziomilan.it/sitemap-news.xml