umkaorleans.com
robots.txt

Robots Exclusion Standard data for umkaorleans.com

Resource Scan

Scan Details

Site Domain umkaorleans.com
Base Domain umkaorleans.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-05-26T13:06:56+00:00
Next Scan 2024-08-24T13:06:56+00:00

Last Successful Scan

Scanned2022-11-02T16:16:07+00:00
URL https://umkaorleans.com/robots.txt
Response IP 185.128.239.52
Found Yes
Hash b1a7163dc09714f54c9fd78488f8b1943241267ccf0c0dde70d021f4fe7b736e
SimHash 6b1cd015c731

Groups

*

Rule Path
Allow /
Disallow /contact
Disallow /mail/subscribe
Disallow /mail/valid-*

msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

spbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://umkaorleans.com/sitemap-news.xml
sitemap https://umkaorleans.com/sitemap.xml