pressitalia.net
robots.txt

Robots Exclusion Standard data for pressitalia.net

Resource Scan

Scan Details

Site Domain pressitalia.net
Base Domain pressitalia.net
Scan Status Ok
Last Scan2026-04-03T22:11:26+00:00
Next Scan 2026-04-10T22:11:26+00:00

Last Scan

Scanned2026-04-03T22:11:26+00:00
URL https://pressitalia.net/robots.txt
Domain IPs 192.250.229.149
Response IP 192.250.229.149
Found Yes
Hash 406e6cdc3d5e1a815437c78838e156fed29381cd2309b53d291b677d9a0ef260
SimHash e9284940d2b4

Groups

mediapartners-google*

Rule Path
Disallow

*

Rule Path
Disallow /wp-
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /wp-content/
Allow /wp-content/uploads/
Disallow /trackback/
Allow /feed/
Disallow /comments/
Disallow */trackback/
Allow */feed/
Disallow */comments/

Other Records

Field Value
sitemap http://www.pressitalia.net/sitemap.xml
sitemap http://www.pressitalia.net/news/google_sitemap.php
sitemap http://www.pressitalia.net/sitemap_index.xml