wchsb.com
robots.txt
Robots Exclusion Standard data for wchsb.com
Resource Scan
Scan Details
Site Domain | wchsb.com |
Base Domain | wchsb.com |
Scan Status | Ok |
Last Scan | 2024-10-14T07:25:00+00:00 |
Next Scan | 2024-11-13T07:25:00+00:00 |
Last Scan
Scanned | 2024-10-14T07:25:00+00:00 |
URL | https://wchsb.com/robots.txt |
Domain IPs | 192.124.249.161 |
Response IP | 192.124.249.161 |
Found | Yes |
Hash | 44d836daba982965d1f93ac35d2246b6c6ee73a3030bded62f5d0cb680a023a8 |
SimHash | a054b9418391 |
Groups
*
Rule | Path |
---|---|
Disallow | /cgi-bin |
Disallow | *%26s%3D |
Disallow | /search/ |
Disallow | /author/ |
Disallow | /users/ |
Disallow | */trackback |
Disallow | */feed |
Disallow | */rss |
Disallow | */embed |
Disallow | *utm*%3D |
Disallow | *openstat%3D |
Disallow | /*/*.js |
Disallow | /*/*.css |
Allow | / |
Other Records
Field | Value |
---|---|
sitemap | https://wchsb.com/sitemap.xml |