spazioj.it
robots.txt

Robots Exclusion Standard data for spazioj.it

Resource Scan

Scan Details

Site Domain spazioj.it
Base Domain spazioj.it
Scan Status Ok
Last Scan2024-09-27T21:01:24+00:00
Next Scan 2024-10-04T21:01:24+00:00

Last Scan

Scanned2024-09-27T21:01:24+00:00
URL https://spazioj.it/robots.txt
Domain IPs 138.201.136.29
Response IP 138.201.136.29
Found Yes
Hash 44e0a47bc99c45d37db60167bc0be05212d7f60d3c4f8cb7de7b020b90e82ee3
SimHash 6d744fb656b1

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /wp-content
Disallow /xmlrpc.php
Disallow /trackback/

ninjabot

Rule Path
Allow /

mediapartners-google

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

googlebot

Rule Path
Allow /

googlebot-news

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

*

Rule Path
Allow /wp-content/uploads/
Allow /tag/
Allow /category
Allow *.css$
Allow *.js$

Other Records

Field Value
sitemap https://www.spazioj.it/sitemap.xml
sitemap https://www.spazioj.it/sitemap-news.xml