warhistory.org
robots.txt

Robots Exclusion Standard data for warhistory.org

Resource Scan

Scan Details

Site Domain warhistory.org
Base Domain warhistory.org
Scan Status Ok
Last Scan2025-06-26T21:27:47+00:00
Next Scan 2025-07-03T21:27:47+00:00

Last Scan

Scanned2025-06-26T21:27:47+00:00
URL https://warhistory.org/robots.txt
Domain IPs 104.21.44.76, 172.67.197.75, 2606:4700:3031::6815:2c4c, 2606:4700:3035::ac43:c54b
Response IP 104.21.44.76
Found Yes
Hash 625d56ed641a229b73f7be5f6156f208d9fb50c2ddf8627d9ccd4cc32e5df6f4
SimHash 581550558852

Groups

ccbot

Rule Path
Disallow /

chatgpt

Rule Path
Disallow /

openai

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://warhistory.org/sitemap_index.xml