war-alliance.com
robots.txt

Robots Exclusion Standard data for war-alliance.com

Resource Scan

Scan Details

Site Domain war-alliance.com
Base Domain war-alliance.com
Scan Status Ok
Last Scan2024-05-28T08:24:25+00:00
Next Scan 2024-06-04T08:24:25+00:00

Last Scan

Scanned2024-05-28T08:24:25+00:00
URL https://war-alliance.com/robots.txt
Redirect https://www.war-alliance.com/robots.txt
Redirect Domain www.war-alliance.com
Redirect Base war-alliance.com
Domain IPs 199.34.228.77
Redirect IPs 199.34.228.77
Response IP 199.34.228.77
Found Yes
Hash 41cbdb23b3dff9a0ece9bd0cdd9adb2db0bef5f5641ec26d553a44d0180b167d
SimHash 621cdc6e2793

Groups

nerdybot

Rule Path
Disallow /

dotbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

*

Rule Path
Disallow /ajax/
Disallow /apps/
Disallow /https%3A//www.war-alliance.com/files/theme/assetlinks.json
Disallow /http%3A//www.war-alliance.com/files/theme/windows-app-web-link
Disallow /https%3A//www.war-alliance.com/files/theme/app-ads.txt
Disallow /newsfeed.html

Other Records

Field Value
sitemap https://www.war-alliance.com/sitemap.xml