globalpdteams.com
robots.txt

Robots Exclusion Standard data for globalpdteams.com

Resource Scan

Scan Details

Site Domain globalpdteams.com
Base Domain globalpdteams.com
Scan Status Ok
Last Scan2025-09-11T20:49:15+00:00
Next Scan 2025-09-25T20:49:15+00:00

Last Scan

Scanned2025-09-11T20:49:15+00:00
URL https://globalpdteams.com/robots.txt
Domain IPs 44.226.128.212
Response IP 44.226.128.212
Found Yes
Hash 6da188d2b6eab8e71a459873ed94c4a6dcea39e16c865678df462dc5ab0fec03
SimHash 711dc840c6c2

Groups

gptbot

Rule Path
Disallow

anthropicbot

Rule Path
Disallow

claudebot

Rule Path
Disallow

claude-web

Rule Path
Disallow

google-extended

Rule Path
Disallow

googlebot

Rule Path
Disallow

bingbot

Rule Path
Disallow

applebot

Rule Path
Disallow

amazonbot

Rule Path
Disallow

microsoft-extended

Rule Path
Disallow

metabot

Rule Path
Disallow

duckduckbot

Rule Path
Disallow

bytespider

Rule Path
Disallow /

qiniuspider

Rule Path
Disallow /

ai-crawler

Rule Path
Disallow /

scrapybot

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://globalpd.com/sitemap_index.xml

Comments

  • AI & Search Engine Bots — Allowed
  • Known AI Scrapers / Less Trusted Bots — Blocked