holbornhub.com
robots.txt
Robots Exclusion Standard data for holbornhub.com
Resource Scan
Scan Details
Site Domain | holbornhub.com |
Base Domain | holbornhub.com |
Scan Status | Ok |
Last Scan | 2024-11-12T07:16:36+00:00 |
Next Scan | 2024-11-19T07:16:36+00:00 |
Last Scan
Scanned | 2024-11-12T07:16:36+00:00 |
URL | https://holbornhub.com/robots.txt |
Redirect | https://www.holbornhub.com/robots.txt |
Redirect Domain | www.holbornhub.com |
Redirect Base | holbornhub.com |
Domain IPs | 209.58.155.78 |
Redirect IPs | 209.58.155.78 |
Response IP | 209.58.155.78 |
Found | Yes |
Hash | 3d04a35a88aeb7c712ed830140ee51c009ba85054985091f3ad247ae4542dcd3 |
SimHash | 4111f4556113 |
Groups
*
Rule | Path |
---|---|
Disallow | /cache/ |
Disallow | /components/ |
Disallow | /modules/ |
Disallow | /templates/ |
Disallow | /pc-openads/ |
Disallow | /images/ads/pb/ |
Disallow | /site/captcha.html* |
Disallow | /00/ |
Disallow | /requirements/ |
Disallow | /search/feedback.html |
Disallow | /search/resolve.html |
Disallow | /different_tmp_directory |
Allow | /templates/pc3/images/category-icons/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.holbornhub.com/pc-sitemap/browse/holborn-assets/sitemap.xml |
Warnings
- `host` is not a known field.