newarkbound.com
robots.txt

Robots Exclusion Standard data for newarkbound.com

Resource Scan

Scan Details

Site Domain newarkbound.com
Base Domain newarkbound.com
Scan Status Ok
Last Scan2026-02-07T19:04:56+00:00
Next Scan 2026-03-09T19:04:56+00:00

Last Scan

Scanned2026-02-07T19:04:56+00:00
URL http://newarkbound.com/robots.txt
Domain IPs 5.181.161.88
Response IP 5.181.161.88
Found Yes
Hash 6ee16a01812be24f91faa651087247273b3ccb045e9915e9f9728aabab2017ca
SimHash 1339d8438ff1

Groups

*

Rule Path
Disallow /tilda/form*
Disallow /tilda/rec*
Disallow /tilda/click*
Disallow /tilda/scroll*
Disallow /tilda/popup*
Disallow /tilda/cart*
Disallow /tilda/product*
Disallow /tilda/event*
Disallow /*_escaped_fragment_*
Disallow

Other Records

Field Value
sitemap http://newarkbound.com/sitemap.xml

Warnings

  • `host` is not a known field.