studyinwarsaw.pl
robots.txt

Robots Exclusion Standard data for studyinwarsaw.pl

Resource Scan

Scan Details

Site Domain studyinwarsaw.pl
Base Domain studyinwarsaw.pl
Scan Status Ok
Last Scan2026-01-24T21:48:56+00:00
Next Scan 2026-01-31T21:48:56+00:00

Last Scan

Scanned2026-01-24T21:48:56+00:00
URL https://studyinwarsaw.pl/robots.txt
Domain IPs 104.21.72.221, 172.67.155.163, 2606:4700:3032::ac43:9ba3, 2606:4700:3035::6815:48dd
Response IP 172.67.155.163
Found Yes
Hash 53caf55a0463e83703d3e44f353abb716a8d2005c3930f009dea5920c4733f48
SimHash 1018c8a0e311

Groups

gptbot

Rule Path
Disallow /

oai-searchbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

*

Rule Path
Allow /

Comments

  • AI Bots Block