nos.pt
robots.txt

Robots Exclusion Standard data for nos.pt

Resource Scan

Scan Details

Site Domain nos.pt
Base Domain nos.pt
Scan Status Ok
Last Scan2024-11-09T14:34:32+00:00
Next Scan 2024-11-16T14:34:32+00:00

Last Scan

Scanned2024-11-09T14:34:32+00:00
URL https://nos.pt/robots.txt
Redirect https://www.nos.pt/robots.txt
Redirect Domain www.nos.pt
Redirect Base nos.pt
Domain IPs 104.18.18.73, 104.18.19.73, 2606:4700::6812:1249, 2606:4700::6812:1349
Redirect IPs 104.18.18.73, 104.18.19.73, 2606:4700::6812:1249, 2606:4700::6812:1349
Response IP 104.18.18.73
Found Yes
Hash 6df5c563b19bb45de1f93e13f15256097a06807a2b77053ce7ac211fe8ade477
SimHash 20088a6065a1

Groups

*

Rule Path
Allow /
Disallow /login/
Disallow /content/dam/
Disallow /outros/termos-e-condicoes
Disallow /outros/qualidade-de-servico

yandex

Rule Path
Allow /
Disallow /login/
Disallow /content/dam/
Disallow /outros/termos-e-condicoes
Disallow /outros/qualidade-de-servico

baiduspider

Rule Path
Allow /
Disallow /login/
Disallow /content/dam/
Disallow /outros/termos-e-condicoes
Disallow /outros/qualidade-de-servico

bingbot

Rule Path
Allow /
Disallow /login/
Disallow /content/dam/
Disallow /outros/termos-e-condicoes
Disallow /outros/qualidade-de-servico

duckduckbot

Rule Path
Allow /
Disallow /login/
Disallow /content/dam/
Disallow /outros/termos-e-condicoes
Disallow /outros/qualidade-de-servico

Other Records

Field Value
sitemap https://www.nos.pt/content/nos/language-masters/pt.sitemap.xml
sitemap https://www.nos.pt/content/nos/language-masters/en.sitemap.xml

Comments

  • 06-12-2023
  • Block not wanted search engines
  • Specify Crawl Delay for minor search engines