bluesmenchannel.com
robots.txt
Robots Exclusion Standard data for bluesmenchannel.com
Resource Scan
Scan Details
Site Domain | bluesmenchannel.com |
Base Domain | bluesmenchannel.com |
Scan Status | Ok |
Last Scan | 2024-11-11T02:44:34+00:00 |
Next Scan | 2024-11-18T02:44:34+00:00 |
Last Scan
Scanned | 2024-11-11T02:44:34+00:00 |
URL | https://bluesmenchannel.com/robots.txt |
Domain IPs | 195.216.243.24 |
Response IP | 195.216.243.24 |
Found | Yes |
Hash | 02059f5e753c9b99cad06ecb384abe54e75a56e6d10f9e5545c764222daa4565 |
SimHash | 7b83726b00e2 |
Groups
*
Rule | Path |
---|---|
Disallow | /404.htm |
Disallow | /index/sub/ |
Disallow | /ShoutcastServ/ |
Disallow | /*ShoutcastServ |
Disallow | /*.zip |
similarpages/nutch-1.0-dev (similarpages nutch crawler; http://www.similarpages.com; info at similarpages dot com)
Rule | Path |
---|---|
Disallow | / |
Other Records
Field | Value |
---|---|
sitemap | https://bluesmenchannel.com/sitemap.xml |
Warnings
- 8 invalid lines.
- `host` is not a known field.