ilgiornalediudine.com
robots.txt
Robots Exclusion Standard data for ilgiornalediudine.com
Resource Scan
Scan Details
Site Domain | ilgiornalediudine.com |
Base Domain | ilgiornalediudine.com |
Scan Status | Ok |
Last Scan | 2024-09-04T03:18:47+00:00 |
Next Scan | 2024-10-04T03:18:47+00:00 |
Last Scan
Scanned | 2024-09-04T03:18:47+00:00 |
URL | https://ilgiornalediudine.com/robots.txt |
Domain IPs | 94.23.67.76 |
Response IP | 94.23.67.76 |
Found | Yes |
Hash | 121b9136ac3cdd78abb93b11c334e0e5f43537e3854e8bf79cf2ae90d114f90f |
SimHash | 68104a22f2b9 |
Groups
*
Rule | Path |
---|---|
Allow | /cgi-bin/ |
Disallow | /wp-admin/ |
Disallow | /wp-includes/ |
Disallow | /wp-content/plugins/ |
Allow | /tag/ |
Disallow | /author/ |
Allow | /wp-content/uploads |
Disallow | /trackback/ |
Disallow | /feed/ |
Disallow | /comments/ |
Disallow | */trackback/ |
Disallow | */feed/ |
Disallow | */comments/ |
Disallow | /index.php |
Disallow | /xmlrpc.php |
Other Records
Field | Value |
---|---|
sitemap | http://www.ilgiornalediudine.com/xmlsitemap.xml |