centralmaine.com
robots.txt

Robots Exclusion Standard data for centralmaine.com

Resource Scan

Scan Details

Site Domain centralmaine.com
Base Domain centralmaine.com
Scan Status Ok
Last Scan2024-05-28T01:03:08+00:00
Next Scan 2024-06-04T01:03:08+00:00

Last Scan

Scanned2024-05-28T01:03:08+00:00
URL https://centralmaine.com/robots.txt
Redirect https://www.centralmaine.com/robots.txt
Redirect Domain www.centralmaine.com
Redirect Base centralmaine.com
Domain IPs 192.0.66.100
Redirect IPs 192.0.66.100, 2a04:fa87:fffd::c000:4264
Response IP 192.0.66.100
Found Yes
Hash 00da26c85665d2643ab1ef3dc8e2bfb044f120706605d45969f5e80519f6e7e4
SimHash 4a00dd207190

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /search/*
Disallow *?s=*
Disallow *%26s%3D*
Disallow *?s$
Disallow *?s&*

slurp

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10