pages.dio.me
robots.txt

Robots Exclusion Standard data for pages.dio.me

Resource Scan

Scan Details

Site Domain pages.dio.me
Base Domain dio.me
Scan Status Ok
Last Scan2025-07-24T07:57:28+00:00
Next Scan 2025-08-23T07:57:28+00:00

Last Scan

Scanned2025-07-24T07:57:28+00:00
URL https://pages.dio.me/robots.txt
Domain IPs 104.18.43.16, 172.64.144.240, 2606:4700:4400::6812:2b10, 2606:4700:4400::ac40:90f0
Response IP 104.18.43.16
Found Yes
Hash 7a99c7f6d0ba7fea850dc8d7c6eace1e81e1ba883ec9d4d67b4e8c2ff7787c25
SimHash 0844da44c793

Groups

*

Rule Path
Allow /

googlebot

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

mediapartners-google

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

Other Records

Field Value
sitemap https://pages.dio.me/sitemap.xml