pahockey.net
robots.txt
Robots Exclusion Standard data for pahockey.net
Resource Scan
Scan Details
Site Domain | pahockey.net |
Base Domain | pahockey.net |
Scan Status | Ok |
Last Scan | 2024-09-22T11:00:32+00:00 |
Next Scan | 2024-09-29T11:00:32+00:00 |
Last Scan
Scanned | 2024-09-22T11:00:32+00:00 |
URL | http://pahockey.net/robots.txt |
Domain IPs | 34.171.199.74 |
Response IP | 34.171.199.74 |
Found | Yes |
Hash | 30dba173a47a3c1a1003096a41e2ef38c1d2e92b92f2d2abc996c4ab8673d4ff |
SimHash | d00c906baef2 |
Groups
*
Rule | Path |
---|---|
Allow | / |
Disallow | /p/ |
Disallow | /thumbnail_images/ |
Disallow | /buttons/ |
Disallow | /rosters/*/events.pdf* |
Disallow | /rosters/*/statistics.pdf |
Disallow | /themes/* |
Disallow | *.zip |
Other Records
Field | Value |
---|---|
sitemap | http://www.pahockey.net/sitemap.xml |