begellhouse.com
robots.txt
Robots Exclusion Standard data for begellhouse.com
Resource Scan
Scan Details
Site Domain | begellhouse.com |
Base Domain | begellhouse.com |
Scan Status | Ok |
Last Scan | 2024-10-07T22:07:16+00:00 |
Next Scan | 2024-11-06T22:07:16+00:00 |
Last Scan
Scanned | 2024-10-07T22:07:16+00:00 |
URL | https://www.begellhouse.com/robots.txt |
Domain IPs | 169.59.241.41, 2607:f0d0:1f02:45:: |
Response IP | 169.59.241.41 |
Found | Yes |
Hash | 4ec488e8c05537184372446f6c5fa5d96dac0692641193cf32bf698e0b32d32a |
SimHash | 30465d04c192 |
Groups
*
Rule | Path |
---|---|
Disallow | /files/ |
Disallow | /ii/ |
Disallow | /js/ |
Disallow | /is/ |
Disallow | /ic/ |
Disallow | /img/ |
Disallow | /st/ |
Disallow | /flash/ |
Disallow | /badmin/ |
Disallow | /captcha/ |
Disallow | /lib/ |
Disallow | /user/ |
Disallow | /cart/ |
Disallow | /order/ |
Disallow | /search/ |
Disallow | /doi/ |
Other Records
Field | Value |
---|---|
crawl-delay | 30 |
Comments