hrblock.com
robots.txt
Robots Exclusion Standard data for hrblock.com
Resource Scan
Scan Details
Site Domain | hrblock.com |
Base Domain | hrblock.com |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2024-11-12T01:30:34+00:00 |
Next Scan | 2025-02-10T01:30:34+00:00 |
Last Successful Scan
Scanned | 2024-03-24T16:45:59+00:00 |
URL | https://www.hrblock.com/robots.txt |
Domain IPs | 23.210.98.82 |
Response IP | 104.103.150.107 |
Found | Yes |
Hash | a8133ffac2a1b89dedbb1e5fc4895a3f6169333d93d6675873b580fe57646652 |
SimHash | ac101984e6cb |
Groups
*
Rule | Path |
---|---|
Disallow | /support/software/ |
Disallow | /am/index.html |
Disallow | /cmpgn/affiliate.html |
Disallow | /expat-tax-preparation/ajax/in/locations.html |
Disallow | /lp/fy16/diy-lineup.html |
Disallow | /lp/virtualdropoff/index.html |
Disallow | /offices/healthcare_tpf.html |
Disallow | /pdf/bw/ |
Disallow | /tax-offices/tax-pro-finder.html |
Disallow | /corporate/tax-franchise/pdfs/Franchise-Disclosure-Document.pdf |
Disallow | /directdepositoffer/IVR.pdf |
Disallow | /sc/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.hrblock.com/sitemap.xml |