onmilwaukee.com
robots.txt

Robots Exclusion Standard data for onmilwaukee.com

Resource Scan

Scan Details

Site Domain onmilwaukee.com
Base Domain onmilwaukee.com
Scan Status Ok
Last Scan2024-11-13T21:11:47+00:00
Next Scan 2024-11-20T21:11:47+00:00

Last Scan

Scanned2024-11-13T21:11:47+00:00
URL https://onmilwaukee.com/robots.txt
Domain IPs 52.116.12.250
Response IP 52.116.12.250
Found Yes
Hash 9b3e861ede6970b1fca3c9a735e7a26b5bdaab966f51eeb6a08805b346eb3542
SimHash 4818dcd2a283

Groups

*

Rule Path
Disallow /articles/print/
Disallow /articles/email/
Disallow /admin/
Disallow /media/
Disallow /search/

Other Records

Field Value
crawl-delay 10

missigua locator 1.9

Rule Path
Disallow /

aipbot

Rule Path
Disallow /

snoopy

Rule Path
Disallow /

baiduspider
baiduspider-video
baiduspider-image

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

yandex

Rule Path
Disallow /

Warnings

  • 2 invalid lines.