schermata.it
robots.txt

Robots Exclusion Standard data for schermata.it

Resource Scan

Scan Details

Site Domain schermata.it
Base Domain schermata.it
Scan Status Ok
Last Scan2025-09-16T08:42:53+00:00
Next Scan 2025-09-23T08:42:53+00:00

Last Scan

Scanned2025-09-16T08:42:53+00:00
URL https://schermata.it/robots.txt
Redirect https://www.schermata.it/robots.txt
Redirect Domain www.schermata.it
Redirect Base schermata.it
Domain IPs 104.21.21.199, 172.67.200.21, 2606:4700:3031::6815:15c7, 2606:4700:3037::ac43:c815
Redirect IPs 104.21.21.199, 172.67.200.21, 2606:4700:3031::6815:15c7, 2606:4700:3037::ac43:c815
Response IP 104.21.21.199
Found Yes
Hash d209c8c99396f7f22f52d9ac382f4f20e0fe2f62eea2ad7b54b529696d5b78d7
SimHash 711e9050a5e2

Groups

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

awariorssbot
awariosmartbot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

*

Rule Path
Disallow /search
Disallow /it/search
Disallow /index.php/it/search
Disallow /info/getinfo
Disallow /info/get

Other Records

Field Value
sitemap https://www.schermata.it/it/sitemap.xml