waterman.com
robots.txt
Robots Exclusion Standard data for waterman.com
Resource Scan
Scan Details
| Site Domain | waterman.com |
| Base Domain | waterman.com |
| Scan Status | Ok |
| Last Scan | 2026-03-04T05:49:05+00:00 |
| Next Scan | 2026-04-03T05:49:05+00:00 |
Last Scan
| Scanned | 2026-03-04T05:49:05+00:00 |
| URL | https://waterman.com/robots.txt |
| Redirect | https://www.waterman.com/robots.txt |
| Redirect Domain | www.waterman.com |
| Redirect Base | waterman.com |
| Domain IPs | 104.18.40.61, 172.64.147.195 |
| Redirect IPs | 104.18.40.61, 172.64.147.195 |
| Response IP | 172.64.147.195 |
| Found | Yes |
| Hash | 5a125663b5537059599a20403c8117ce43c91fa72bd39aa283b584762e6cc3aa |
| SimHash | 395449f00e51 |
Groups
*
| Rule | Path |
|---|---|
| Allow | / |
| Disallow | /*cart |
| Disallow | /account/ |
| Disallow | /setpassword/ |
| Disallow | /search? |
| Disallow | /confirmednewpassword/ |
| Disallow | /profile/ |
| Disallow | /orders/ |
| Disallow | *q%3D* |
| Disallow | *srule%3D* |
| Disallow | *format%3Dajax* |
Other Records
| Field | Value |
|---|---|
| sitemap | https://www.waterman.com/sitemap_index.xml |