portugal-a-programar.pt
robots.txt

Robots Exclusion Standard data for portugal-a-programar.pt

Resource Scan

Scan Details

Site Domain portugal-a-programar.pt
Base Domain portugal-a-programar.pt
Scan Status Ok
Last Scan2025-06-28T10:54:41+00:00
Next Scan 2025-07-28T10:54:41+00:00

Last Scan

Scanned2025-06-28T10:54:41+00:00
URL https://portugal-a-programar.pt/robots.txt
Redirect https://www.portugal-a-programar.pt/robots.txt
Redirect Domain www.portugal-a-programar.pt
Redirect Base portugal-a-programar.pt
Domain IPs 104.21.7.157, 172.67.130.86, 2606:4700:3031::6815:79d, 2606:4700:3031::ac43:8256
Redirect IPs 104.21.7.157, 172.67.130.86, 2606:4700:3031::6815:79d, 2606:4700:3031::ac43:8256
Response IP 172.67.130.86
Found Yes
Hash 682df79fc72e581442d55c68961ab66f7f36fda6316141b669b58872006ed967
SimHash 7430d302a68a

Groups

*

Rule Path
Disallow /startTopic/
Disallow /discover/unread/
Disallow /markallread/
Disallow /staff/
Disallow /online/
Disallow /discover/
Disallow /leaderboard/
Disallow /search/
Disallow /*?advancedSearchForm=
Disallow /register/
Disallow /lostpassword/
Disallow /login
Disallow /language/*
Disallow /blogs/?page=
Disallow /forums/topic/*-topic/
Disallow /*?sortby=
Disallow /*?filter=
Disallow /*?tab=
Disallow /*?do=
Disallow /*ref%3D
Disallow /*?forumId*
Disallow /*csrfKey%3D
Disallow /*?csrf=
Disallow /*?csrfKey=
Disallow /*?_fromLogin=
Disallow /*?_fromLogout=
Disallow /*?&controller=embed
Disallow /applications/core/interface/imageproxy/imageproxy.php*
Disallow /index.phptopic
Disallow /blog/rss/

Other Records

Field Value
crawl-delay 8

adsbot-google
ai2bot
ai2bot-dolma
amazonbot
anthropic-ai
applebot
awariorssbot
awariosmartbot
barkrowler
bytespider
chatgpt-user
claudebot
claude-web
cohere-ai
dataforseobot
facebookbot
friendlycrawler
google-extended
googleother
gptbot
img2dataset
imagesiftbot
magpie-crawler
meltwater
meta-externalagent
oai-searchbot
omgili
omgilibot
peer39_crawler
peer39_crawler/1.0
perplexitybot
piplbot
seekr
timpibot
youbot

Rule Path
Disallow /

ahrefsbot
dotbot
mj12bot
semrushbot

Rule Path
Disallow /

baiduspider
yisouspider
petalbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

sogou inst spider

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.portugal-a-programar.pt/sitemap.php

Comments

  • Block pages with no unique content
  • Legacy URLs
  • Block faceted pages and 301 redirect pages
  • Sitemap URL

Warnings

  • 1 invalid line.