themag.it
robots.txt

Robots Exclusion Standard data for themag.it

Resource Scan

Scan Details

Site Domain themag.it
Base Domain themag.it
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2025-11-21T21:45:30+00:00
Next Scan 2025-12-21T21:45:30+00:00

Last Successful Scan

Scanned2025-10-22T05:04:29+00:00
URL https://themag.it/robots.txt
Redirect https://www.themag.it/robots.txt
Redirect Domain www.themag.it
Redirect Base themag.it
Domain IPs 89.46.108.69
Redirect IPs 89.46.108.69
Response IP 89.46.108.69
Found Yes
Hash 92e3ab5438d8cce1d71248b3bd32e054448e128efbcdb8e01b0f0c8399ca9be1
SimHash e8056810ef03

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /trackback/
Disallow /comments/
Disallow /comments/feed/
Disallow /*?*
Disallow /*?

mediapartners-google*

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

Other Records

Field Value
sitemap http://www.themag.it/sitemap.xml.gz
sitemap http://www.themag.it/sitemap.xml
sitemap http://www.themag.it/sitemap-image.xml