begellhouse.com
robots.txt

Robots Exclusion Standard data for begellhouse.com

Resource Scan

Scan Details

Site Domain begellhouse.com
Base Domain begellhouse.com
Scan Status Ok
Last Scan2024-10-07T22:07:16+00:00
Next Scan 2024-11-06T22:07:16+00:00

Last Scan

Scanned2024-10-07T22:07:16+00:00
URL https://www.begellhouse.com/robots.txt
Domain IPs 169.59.241.41, 2607:f0d0:1f02:45::
Response IP 169.59.241.41
Found Yes
Hash 4ec488e8c05537184372446f6c5fa5d96dac0692641193cf32bf698e0b32d32a
SimHash 30465d04c192

Groups

*

Rule Path
Disallow /files/
Disallow /ii/
Disallow /js/
Disallow /is/
Disallow /ic/
Disallow /img/
Disallow /st/
Disallow /flash/
Disallow /badmin/
Disallow /captcha/
Disallow /lib/
Disallow /user/
Disallow /cart/
Disallow /order/
Disallow /search/
Disallow /doi/

Other Records

Field Value
crawl-delay 30

Comments

  • robots.txt for www.begellhouse.com
  • TEMP
  • User-agent:*
  • Disallow: /