centumcellae.it
robots.txt

Robots Exclusion Standard data for centumcellae.it

Resource Scan

Scan Details

Site Domain centumcellae.it
Base Domain centumcellae.it
Scan Status Ok
Last Scan2026-01-31T02:27:34+00:00
Next Scan 2026-03-02T02:27:34+00:00

Last Scan

Scanned2026-01-31T02:27:34+00:00
URL https://centumcellae.it/robots.txt
Redirect https://www.centumcellae.it/robots.txt
Redirect Domain www.centumcellae.it
Redirect Base centumcellae.it
Domain IPs 2a00:6d40:4:1::c314:56, 89.46.110.56
Redirect IPs 2a00:6d40:4:1::c314:56, 89.46.110.56
Response IP 89.46.110.56
Found Yes
Hash 05ba6b678890ed95499378d8e3159ad769d372b9bc1caffced5198f71a94d412
SimHash 28189150e9e1

Groups

*

Rule Path
Allow /

twitterbot

Rule Path
Allow /images

facebookexternalhit

Rule Path
Allow /images

Other Records

Field Value
sitemap http://www.centumcellae.it/sitemap.xml.gz

Comments

  • Certain social media sites are whitelisted to allow crawlers to access page markup when links to /images are shared.