portaldaqueixa.com
robots.txt

Robots Exclusion Standard data for portaldaqueixa.com

Resource Scan

Scan Details

Site Domain portaldaqueixa.com
Base Domain portaldaqueixa.com
Scan Status Ok
Last Scan2024-09-29T10:27:34+00:00
Next Scan 2024-10-06T10:27:34+00:00

Last Scan

Scanned2024-09-29T10:27:34+00:00
URL https://portaldaqueixa.com/robots.txt
Domain IPs 104.26.14.231, 104.26.15.231, 172.67.74.35
Response IP 104.26.14.231
Found Yes
Hash e9a82caa598d463149c72d90a7bc329a40a0013600a2490c591060a0e3c50182
SimHash eb71bb086082

Groups

*

Rule Path
Disallow /admin/
Disallow /attachments/*
Disallow /bo/
Disallow /complaints/follow/*
Disallow /compare/*
Disallow /assets/*
Disallow /docs/*
Disallow /images/
Disallow /user/*
Disallow /manual/*
Disallow /search
Disallow /search/*
Disallow /*?q=*
Disallow /*?p=*
Disallow /*.pdf$
Disallow *.pdf

baiduspider

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

python-urllib

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

megaindex.com

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

qwantify

Rule Path
Disallow /

sentibot

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

slurp

Rule Path
Disallow /

yahoo! slurp china

Rule Path
Disallow /

Other Records

Field Value
sitemap https://portaldaqueixa.com/sitemap.xml

Comments

  • SITEMAPS