diarioguardiao.pt
robots.txt

Robots Exclusion Standard data for diarioguardiao.pt

Resource Scan

Scan Details

Site Domain diarioguardiao.pt
Base Domain diarioguardiao.pt
Scan Status Ok
Last Scan2025-03-29T03:30:14+00:00
Next Scan 2025-04-05T03:30:14+00:00

Last Scan

Scanned2025-03-29T03:30:14+00:00
URL https://diarioguardiao.pt/robots.txt
Domain IPs 104.21.73.46, 172.67.140.149, 2606:4700:3030::6815:492e, 2606:4700:3031::ac43:8c95
Response IP 104.21.73.46
Found Yes
Hash ea3b5fe1e9aca961b3504503c4bfd931f827c18c44fbd0494cdc6594647e305a
SimHash 4101d8000db3

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
sitemap https://diarioguardiao.pt/sitemap_index.xml