jorgejarai.xyz
robots.txt

Robots Exclusion Standard data for jorgejarai.xyz

Resource Scan

Scan Details

Site Domain jorgejarai.xyz
Base Domain jorgejarai.xyz
Scan Status Ok
Last Scan2025-11-06T03:08:25+00:00
Next Scan 2025-12-06T03:08:25+00:00

Last Scan

Scanned2025-11-06T03:08:25+00:00
URL https://jorgejarai.xyz/robots.txt
Domain IPs 104.21.33.121, 172.67.144.142, 2606:4700:3033::6815:2179, 2606:4700:3033::ac43:908e
Response IP 104.21.33.121
Found Yes
Hash 9d373b7b873b7f646e43fc9a473bc48fc571310449f8427494ba2be2f60015b8
SimHash 101559000113

Groups

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

oai-searchbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

piplbot

Rule Path
Disallow /