thebay.com
robots.txt
Robots Exclusion Standard data for thebay.com
Resource Scan
Scan Details
Site Domain | thebay.com |
Base Domain | thebay.com |
Scan Status | Failed |
Failure Reason | Scan timed out. |
Last Scan | 2024-09-17T03:57:35+00:00 |
Next Scan | 2024-12-16T03:57:35+00:00 |
Last Successful Scan
Scanned | 2024-03-31T00:50:52+00:00 |
URL | https://www.thebay.com/robots.txt |
Domain IPs | 23.32.29.105, 23.32.29.107 |
Response IP | 23.32.29.105 |
Found | Yes |
Hash | 46b2e4769ebf41b5eef5ec6c80c2d41a661f6057204f5267cdd808358744323f |
SimHash | 2c444b586eb6 |
Groups
*
Rule | Path |
---|---|
Allow | /account/login |
Disallow | /account/ |
Disallow | /search |
Disallow | /cart |
Disallow | /checkout |
Disallow | /orderconfirm |
Disallow | /wishlist |
Disallow | /on/demandware.store/ |
Disallow | /c/*_* |
Disallow | /*_*?cgid |
Disallow | /*cgid* |
Disallow | /*pmin* |
Disallow | /*pmax* |
Disallow | /*prefn2* |
Disallow | /*prefn3* |
Disallow | /*prefn4* |
Disallow | /*prefv2* |
Disallow | /*prefv3* |
Disallow | /*prefv4* |
Other Records
Field | Value |
---|---|
sitemap | https://www.thebay.com/sitemap_index.xml |
Warnings
- 2 invalid lines.
Comments