jobbatical.com
robots.txt

Robots Exclusion Standard data for jobbatical.com

Resource Scan

Scan Details

Site Domain jobbatical.com
Base Domain jobbatical.com
Scan Status Ok
Last Scan2026-01-30T21:08:25+00:00
Next Scan 2026-03-01T21:08:25+00:00

Last Scan

Scanned2026-01-30T21:08:25+00:00
URL https://www.jobbatical.com/robots.txt
Domain IPs 172.66.40.192, 172.66.43.64, 2606:4700:3108::ac42:28c0, 2606:4700:3108::ac42:2b40
Response IP 172.66.40.192
Found Yes
Hash 317e3083907323eb8f5e7539f4d602030d811cdc964c25badb300d3ace992a0c
SimHash 691c69d28491

Groups

*

Rule Path
Allow /
Disallow /blog/test
Disallow /countries-we-relocate-to/test

diffbot

Rule Path
Allow /

facebookbot

Rule Path
Allow /

oai-searchbot

Rule Path
Allow /

gptbot

Rule Path
Allow /

ccbot

Rule Path
Allow /

google-extended

Rule Path
Allow /

perplexitybot

Rule Path
Allow /

youbot

Rule Path
Allow /

omgili

Rule Path
Allow /

anthropic-ai

Rule Path
Allow /

claude-web

Rule Path
Allow /

claudebot

Rule Path
Allow /

cohere-ai

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.jobbatical.com/sitemap.xml
sitemap https://www.jobbatical.com/de/sitemap.xml
sitemap https://www.jobbatical.com/fr/sitemap.xml
sitemap https://www.jobbatical.com/es/sitemap.xml
sitemap https://www.jobbatical.com/sitemap.xml