mainfatti.it
robots.txt

Robots Exclusion Standard data for mainfatti.it

Resource Scan

Scan Details

Site Domain mainfatti.it
Base Domain mainfatti.it
Scan Status Ok
Last Scan2024-06-05T03:40:34+00:00
Next Scan 2024-06-12T03:40:34+00:00

Last Scan

Scanned2024-06-05T03:40:34+00:00
URL https://mainfatti.it/robots.txt
Redirect https://www.mainfatti.it/robots.txt
Redirect Domain www.mainfatti.it
Redirect Base mainfatti.it
Domain IPs 138.201.211.238
Redirect IPs 138.201.211.238
Response IP 138.201.211.238
Found Yes
Hash cab223cbad1639a2f61b4973d12cdae344ab58f012f9180d5b8df5afaa168af9
SimHash 4a0d40518518

Groups

mediapartners-google

Rule Path
Allow /

*

Rule Path
Disallow /cgi-bin
Disallow /mi-ecredits
Disallow /mi-outnews
Disallow /credits
Disallow /mi-error
Disallow /mi-mob
Disallow /00
Disallow /0t
Disallow /0int
Disallow /0apr
Allow /007

ia_archiver

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.mainfatti.it/mi-maps/sitemap.xml
sitemap https://www.mainfatti.it/mi-maps/gnews_map.xml