sgrossi.it
robots.txt

Robots Exclusion Standard data for sgrossi.it

Resource Scan

Scan Details

Site Domain sgrossi.it
Base Domain sgrossi.it
Scan Status Ok
Last Scan2024-05-30T03:58:19+00:00
Next Scan 2024-06-06T03:58:19+00:00

Last Scan

Scanned2024-05-30T03:58:19+00:00
URL https://sgrossi.it/robots.txt
Domain IPs 162.159.136.54, 162.159.137.54, 2606:4700:7::a29f:8836, 2606:4700:7::a29f:8936
Response IP 162.159.137.54
Found Yes
Hash 312ced1b76422347140cd2bd7760894f577aaf86db9112d4b2ed8bdac4325095
SimHash 02149b937bb0

Groups

*

Rule Path
Disallow /wp-admin/
Disallow */feed/
Disallow /comments/
Disallow /author/
Disallow /wp-json/
Disallow /grazie/
Disallow /*aggregate*
Disallow /*.cfm*
Disallow /cdn-cgi/*
Disallow /search/*
Disallow /*?s=*
Allow /wp-content/uploads/

googlebot
googlebot-image
bingbot
mediapartners-google
googlebot-mobile
slurp
yandex
adidxbot
msnbot-media
msnbot

Rule Path
Disallow

semrushbot
semrushbot-sa
mj12bot
ahrefsbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://sgrossi.it/sitemaps.xml