americancivilwar.com
robots.txt

Robots Exclusion Standard data for americancivilwar.com

Resource Scan

Scan Details

Site Domain americancivilwar.com
Base Domain americancivilwar.com
Scan Status Ok
Last Scan2024-11-10T00:32:04+00:00
Next Scan 2024-11-17T00:32:04+00:00

Last Scan

Scanned2024-11-10T00:32:04+00:00
URL https://americancivilwar.com/robots.txt
Domain IPs 2406:da18:9d0:143f:2124:4e9c:36a9:d9de, 52.221.42.138
Response IP 52.221.42.138
Found Yes
Hash 2d5f9a204a16faf7e9daef62e373fc6cd63472f55f94d3afbf4a79f2b9641775
SimHash 08721a908a83

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /Military-Store/
Disallow /Security_Systems/
Disallow /Chow/
Disallow /Virtual/
Disallow /Civil_War_Music/Music_Store/

twiceler

Rule Path
Disallow /

aipbot

Rule Path
Disallow /

becomebot

Rule Path
Disallow /

msnbot

Rule Path
Disallow /cgi-bin/

Other Records

Field Value
crawl-delay 10

discovery

Rule Path
Disallow /

crawl

Rule Path
Disallow /

spider

Rule Path
Disallow /

bot*

Rule Path
Disallow /

robot

Rule Path
Disallow /

psbot

Rule Path
Disallow /

teoma

Rule Path
Disallow /cgi-bin/
Disallow /Virtual/
Disallow /civilwar/
Disallow /Chow/

Other Records

Field Value
crawl-delay 10

slurp

Rule Path
Disallow /cgi-bin/

Other Records

Field Value
crawl-delay 30

Warnings

  • 2 invalid lines.