nowcanal.pt
robots.txt

Robots Exclusion Standard data for nowcanal.pt

Resource Scan

Scan Details

Site Domain nowcanal.pt
Base Domain nowcanal.pt
Scan Status Ok
Last Scan2024-11-10T14:23:54+00:00
Next Scan 2024-11-17T14:23:54+00:00

Last Scan

Scanned2024-11-10T14:23:54+00:00
URL https://nowcanal.pt/robots.txt
Redirect https://www.nowcanal.pt/robots.txt
Redirect Domain www.nowcanal.pt
Redirect Base nowcanal.pt
Domain IPs 195.23.36.47
Redirect IPs 88.157.217.146
Response IP 88.157.217.146
Found Yes
Hash eb54263dc58ff674b214da56459886e03f2bb9aaaee81f48ced69de368365d64
SimHash 3b2558448d51

Groups

openai-crawler

Rule Path
Disallow /

googlebot-bard

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

*

Rule Path
Disallow /Content/
Disallow /Scripts/
Disallow /Img/
Disallow /fonts/
Disallow /Error/
Disallow /4196/
Disallow /image
Disallow /CMLogs.aspx
Disallow /Async
Disallow /site/
Disallow /image

Other Records

Field Value
sitemap https://www.nowcanal.pt/sitemap