selfie.iol.pt
robots.txt

Robots Exclusion Standard data for selfie.iol.pt

Resource Scan

Scan Details

Site Domain selfie.iol.pt
Base Domain iol.pt
Scan Status Ok
Last Scan2024-05-01T23:02:17+00:00
Next Scan 2024-05-31T23:02:17+00:00

Last Scan

Scanned2024-05-01T23:02:17+00:00
URL https://selfie.iol.pt/robots.txt
Domain IPs 193.126.240.146
Response IP 193.126.240.146
Found Yes
Hash db6e2ea4e2abeef9b5c38cbaba1296227d4c5a20654ab789c2f0efa462be2e57
SimHash aa31b6f48eb3

Groups

*

Rule Path
Disallow /_preview
Disallow /search
Disallow /7811748
Disallow /300x100
Disallow /320x50
Disallow /LDB1

openai

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

googlebot-bard

Rule Path
Disallow /

openai-crawler

Rule Path
Disallow /

Other Records

Field Value
sitemap https://selfie.iol.pt/sitemap.xml
sitemap https://selfie.iol.pt/sitemaps/news.xml
sitemap https://selfie.iol.pt/sitemaps/index.xml
sitemap https://selfie.iol.pt/sitemaps/videos.xml