japantravel.com
robots.txt

Robots Exclusion Standard data for japantravel.com

Resource Scan

Scan Details

Site Domain japantravel.com
Base Domain japantravel.com
Scan Status Ok
Last Scan2024-10-25T07:04:09+00:00
Next Scan 2024-11-24T07:04:09+00:00

Last Scan

Scanned2024-10-25T07:04:09+00:00
URL https://japantravel.com/robots.txt
Domain IPs 139.162.81.108
Response IP 139.162.81.108
Found Yes
Hash 6ab03c3fd9e379b58f287f572545cfc936c630913043b2a2db107b7f8f3ad664
SimHash 7038d950c617

Groups

twitterbot

Rule Path
Allow /permalink/*

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

applebot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

youbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://en.japantravel.com/sitemap/en/sitemap.xml

Comments

  • Disable most of AI Bots: https://www.cyberciti.biz/web-developer/block-openai-bard-bing-ai-crawler-bots-using-robots-txt-file/