spaziointer.it
robots.txt

Robots Exclusion Standard data for spaziointer.it

Resource Scan

Scan Details

Site Domain spaziointer.it
Base Domain spaziointer.it
Scan Status Ok
Last Scan2024-09-25T18:26:07+00:00
Next Scan 2024-10-02T18:26:07+00:00

Last Scan

Scanned2024-09-25T18:26:07+00:00
URL https://spaziointer.it/robots.txt
Domain IPs 138.201.136.29
Response IP 138.201.136.29
Found Yes
Hash 99abe8fb06f8e847951fa13daddc71ff1cda94b33e7b62819f4505edfb9a070a
SimHash 6d744ff65651

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /xmlrpc.php
Disallow /trackback/

ninjabot

Rule Path
Allow /

mediapartners-google

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

googlebot

Rule Path
Allow /

googlebot-news

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

*

Rule Path
Allow /wp-content/uploads/
Allow /tag/
Allow /category
Allow *.css$
Allow *.js$
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
sitemap https://www.spaziointer.it/sitemap_index.xml
sitemap https://www.spaziointer.it/sitemap-news.xml