iol.pt
robots.txt

Robots Exclusion Standard data for iol.pt

Resource Scan

Scan Details

Site Domain iol.pt
Base Domain iol.pt
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2024-12-23T17:14:12+00:00
Next Scan 2025-01-06T17:14:12+00:00

Last Successful Scan

Scanned2024-12-08T17:13:04+00:00
URL https://iol.pt/robots.txt
Redirect https://www.iol.pt/robots.txt
Redirect Domain www.iol.pt
Redirect Base iol.pt
Domain IPs 193.126.240.131
Redirect IPs 193.126.240.146
Response IP 193.126.240.146
Found Yes
Hash b400462197fb0d6398d38ec6ff08c0a16978b5317e7271d07c9707f67bb80d57
SimHash 6931b074cb93

Groups

*

Rule Path
Disallow /_preview
Disallow /search
Disallow /7811748
Disallow /300x100

openai

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

googlebot-bard

Rule Path
Disallow /

openai-crawler

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

oai-searchbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.iol.pt/sitemap.xml
sitemap https://www.iol.pt/sitemaps/news.xml
sitemap https://www.iol.pt/sitemaps/index.xml
sitemap https://www.iol.pt/sitemaps/videos.xml