selfie.iol.pt
robots.txt

Robots Exclusion Standard data for selfie.iol.pt

Resource Scan

Scan Details

Site Domain selfie.iol.pt
Base Domain iol.pt
Scan Status Ok
Last Scan2024-10-28T23:04:56+00:00
Next Scan 2024-11-27T23:04:56+00:00

Last Scan

Scanned2024-10-28T23:04:56+00:00
URL https://selfie.iol.pt/robots.txt
Domain IPs 193.126.240.146
Response IP 193.126.240.146
Found Yes
Hash 71437fa3813ab77512154c26ebc653147e4451c111cbbd9a9298c94682a93cee
SimHash 2a35b4f4cf93

Groups

*

Rule Path
Disallow /_preview
Disallow /search
Disallow /7811748
Disallow /300x100
Disallow /320x50
Disallow /LDB1

openai

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

googlebot-bard

Rule Path
Disallow /

openai-crawler

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

oai-searchbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://selfie.iol.pt/sitemap.xml
sitemap https://selfie.iol.pt/sitemaps/news.xml
sitemap https://selfie.iol.pt/sitemaps/index.xml
sitemap https://selfie.iol.pt/sitemaps/videos.xml