sempremilan.it
robots.txt

Robots Exclusion Standard data for sempremilan.it

Resource Scan

Scan Details

Site Domain sempremilan.it
Base Domain sempremilan.it
Scan Status Ok
Last Scan2024-11-14T05:07:49+00:00
Next Scan 2024-11-21T05:07:49+00:00

Last Scan

Scanned2024-11-14T05:07:49+00:00
URL https://sempremilan.it/robots.txt
Domain IPs 35.197.243.217
Response IP 35.197.243.217
Found Yes
Hash 5204509cb8a2c5574c03461fa13df3ae8bb7fbad27be5b5eea8fd02000cdffe4
SimHash 21000cf1653d

Groups

*

Rule Path Comment
Disallow /wp-admin -
Allow /wp-admin/admin-ajax.php -
Disallow /wp-login -
Disallow /xmlrpc.php -
Disallow /wp-content/themes/*/$ prevents just the theme dir being crawled, which they have been since mid-2024 for some reason
Disallow /trackback -
Disallow */trackback -

Other Records

Field Value
sitemap https://sempremilan.it/sitemap/sitemap.xml