dominiodebola.pt
robots.txt

Robots Exclusion Standard data for dominiodebola.pt

Resource Scan

Scan Details

Site Domain dominiodebola.pt
Base Domain dominiodebola.pt
Scan Status Ok
Last Scan2024-11-12T02:13:11+00:00
Next Scan 2024-11-19T02:13:11+00:00

Last Scan

Scanned2024-11-12T02:13:11+00:00
URL https://dominiodebola.pt/robots.txt
Redirect https://www.dominiodebola.com/robots.txt
Redirect Domain www.dominiodebola.com
Redirect Base dominiodebola.com
Domain IPs 185.32.188.4
Redirect IPs 104.21.22.107, 172.67.204.97, 2606:4700:3035::6815:166b, 2606:4700:3036::ac43:cc61
Response IP 104.21.22.107
Found Yes
Hash 1620e7e5abbe539ebe2941f6aa942ee4772176c4830579ee70c1fcfd06f5be30
SimHash 9b0dd944c5f3

Groups

openai-crawler

Rule Path
Disallow /

googlebot-bard

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

*

Rule Path
Disallow /wp-admin/

Other Records

Field Value
sitemap https://www.dominiodebola.com/sitemap_index.xml
sitemap https://www.dominiodebola.com/news-sitemap.xml