newarkbound.com
robots.txt
Robots Exclusion Standard data for newarkbound.com
Resource Scan
Scan Details
| Site Domain | newarkbound.com |
| Base Domain | newarkbound.com |
| Scan Status | Ok |
| Last Scan | 2026-02-07T19:04:56+00:00 |
| Next Scan | 2026-03-09T19:04:56+00:00 |
Last Scan
| Scanned | 2026-02-07T19:04:56+00:00 |
| URL | http://newarkbound.com/robots.txt |
| Domain IPs | 5.181.161.88 |
| Response IP | 5.181.161.88 |
| Found | Yes |
| Hash | 6ee16a01812be24f91faa651087247273b3ccb045e9915e9f9728aabab2017ca |
| SimHash | 1339d8438ff1 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /tilda/form* |
| Disallow | /tilda/rec* |
| Disallow | /tilda/click* |
| Disallow | /tilda/scroll* |
| Disallow | /tilda/popup* |
| Disallow | /tilda/cart* |
| Disallow | /tilda/product* |
| Disallow | /tilda/event* |
| Disallow | /*_escaped_fragment_* |
| Disallow |
Other Records
| Field | Value |
|---|---|
| sitemap | http://newarkbound.com/sitemap.xml |
Warnings
- `host` is not a known field.